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I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/941,992 recorded 
November 16, 2001, at Reel 012176 and Frame 0450. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PR0341." There exists one related patent application: (1) U.S. Patent Application 
Serial No. 09/941,992, filed August 28, 2001 (containing claims directed to PR0341 
polypeptides), which is also under final rejection from the same Examiner. Based upon the 
similar outstanding rejections, an appeal of the final rejection is being pursued independently and 
concurrently herewith. 

III. STATUS OF CLAIMS 

Claims 1 19-123 are pending in this application. 
Claims 1-118 and 124 have been canceled. 

Claims 1 19-123 stand rejected and Appellants appeal the rejection of these claims. 
A copy of the rejected claims in the present Appeal is provided in Section VIIL 

IV. STATUS OF AMENDMENTS 

A summary of the prosecution history for this case is as follows: 
Previously, in response to a Final Office Action mailed on February 8, 2005, a Notice of 
Appeal was filed on August 3, 2005 and an Appeal Brief was filed on November 3, 2005. An 
Examiner's Answer was mailed on January 11, 2006 which cited new references; hence, a Reply 
Brief, a Petition for Designation as New grounds of Rejection and a request to withdraw finality 
of the rejection was filed under 37 C.F.R. §1.181 on March 9, 2006. The Decision on the 
Petition granting the Appellants' request to have the finality withdrawn was mailed on April 18, 
2006, and the Examiner's Answer was deemed as a non- final Office Action. Therefore, a 
Supplemental Response submitting supportive references to Appellants' arguments was filed on 
September 25, 2006. A Final Office Action was mailed on November 6, 2006 (with new 
references cited by the Examiner). A Notice of Appeal was filed on April 5, 2007. 
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No claim amendments have been submitted after the last final rejection of November 6, 

2006. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated antibody that 
specifically binds to the polypeptide of SEQ ID NO: 20 (Claim 119), referred to in the present 
application as "PR0341." The invention is further directed to monoclonal antibodies 
(Claim 120), humanized antibodies (Claim 121), antibody fragments (Claim 122), and labeled 
antibodies (Claim 123) that specifically bind to the polypeptide of SEQ ID NO: 20. The 
PR0341 gene was shown for the first time in the present application to be significantly amplified 
in human lung cancers as compared to normal, non-cancerous human tissue controls 
(Example 170). 

Support for the preparation and uses of antibodies is found throughout the specification, 
including, for example, pages 390-395. The preparation of antibodies is described in 
Example 144, while Example 145 describes the use of the antibodies for purifying the 
polypeptides to which they bind. Isolated antibodies are defined in the specification at page 315, 
line 3 1 . Support for monoclonal antibodies is found in the specification at, for example, 
page 390, line 17, to page 392, line 3. Support for humanized antibodies is found in the 
specification at, for example, page 392, line 4, to page 393, line 6. Support for antibody 
fragments is found in the specification at, for example, page 314, line 30 onwards. Support for 
labeled antibodies is found in the specification at, for example, page 316, lines 3. 

The polypeptide of SEQ ID NO:20 is designated PR0341, and its amino acid sequence is 
shown in Figure 12, while the encoding nucleic acid sequence (SEQ ID NO: 19) is shown in 
Figure 11. Page 288, lines 14-17 of the specification provides the description for Figures 12 and 
11. The cDNA for PR0341 was deposited under ATCC accession number 209792. The 
isolation of cDNA clones encoding PR0341 of SEQ ID NO:20 is described in Example 107. 
Examples 140-143 and page 376, line 12 onwards describe the expression of PRO polypeptides 
in various host cells, including E. coli, mammalian cells, yeast and Baculovirus-infected insect 
cells. 
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Finally, Example 170, in the specification at page 539, line 19, to page 555, line 5, sets 
forth a 'Gene Amplification assay' which shows that the PR0341 gene is amplified in the 
genome of certain human lung cancers (see Table 9A, page 550, third column). The profiles of 
various primary lung tumors used for screening the PRO polypeptide compounds of the 
invention in the gene amplification assay are summarized on Table 8, page 546 of the 
specification. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1 . Whether Claims 1 19-123 satisfy the utility requirement under 35 U.S.C. 
§§101/112, first paragraph. 

VII. ARGUMENTS 
Summary of the Arguments 
Issue 1: Utility/ Enablement 

As a preliminary matter, Appellants note that the Examiner refers to the microarray assay 
in several instances in several Office Actions throughout prosecution, and cites references like 
Lilley et al 9 King et al 9 Wildsmith et al 9 Chen et al etc. which use and analyze the microarray 
assay. On the other hand, Appellants rely upon the gene amplification data of the PR0341 gene 
for patentable utility of antibodies to the PR0341 polypeptide in the present application, and 
would like to distinguish between the gene amplification and the microarray assay. The gene 
amplification assay measures the level at which a certain gene (i.e. DNA) is amplified in the 
genome, whereas the microarray assay measures the level of expression of a mRNA encoding for 
a certain polypeptide in a sample. Throughout prosecution, the Examiner fails to distinguish 
between these two techniques, but Appellants submit that the two assays, although similar, are 
not the same. Appellants would like to bring to the Examiner's attention a recent decision in a 
microarray case by the Board of Patent Appeals and Interferences (Decision on Appeal No. 
2006-1469). In its decision, the Board reversed the utility rejection, acknowledging that "there is 
a strong correlation between mRNA levels and protein expression, and the Examiner has not 
presented any evidence specific to the PR01866 polypeptide to refute that." (Page 9). 
Appellants submit that, in the instant application, the Examiner has likewise not presented any 
evidence specific to the PR0341 polypeptide or its antibodies to refute Appellant's assertion of a 
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correlation between DNA levels, mRNA levels and protein expression. Appellants add that they 
analyze the microarray assay only in response to the Examiner's cited references. 

Appellants rely upon the gene amplification data of the PR0341 gene for patentable 
utility of the PR0341 polypeptides and their antibodies. This data is clearly disclosed in the 
instant specification in Example 170 which discloses that the gene encoding PR0341 showed 
significant amplification, ranging from 2.173 fold to 2.514-fold amplification in three lung 
tumors . Appellants have submitted, in their Response filed October 24, 2003, a Declaration by 
Dr. Audrey Goddard, which explains that a gene identified as being amplified at least 2-fold by 
the disclosed gene amplification assay in a tumor sample relative to a normal sample is useful as 
a marker for the diagnosis of cancer , and for monitoring cancer development and/or for 
measuring the efficacy of cancer therapy. Therefore, such a gene is useful as a marker for the 
diagnosis of lung cancer , and for monitoring cancer development and/or for measuring the 
efficacy of cancer therapy. 

Appellants have also submitted, in their Responses filed November 15, 2004, and 
September 25, 2006, ample evidence to show that, in general, if a gene is amplified in cancer, it 
is more likely than not that the encoded protein will be expressed at an elevated level. For 
instance, the articles by Orntoft et al, Hyman et al, and Pollack et al. collectively teach that in 
general, gene amplification increases mRNA expression . Second, the Declarations of Dr. Paul 
Polakis: (Polakis I and II), shows that, in general there is a correlation between mRNA levels 
and polypeptide levels . Third, Appellants further submit that even if there were no correlation 
between gene amplification and increased mRNA/protein expression, (which Appellants 
expressly do not concede to), a polypeptide encoded by a gene that is amplified in cancer would 
still have a specific, substantial, and credible utility. Appellants submit that, as evidenced by the 
Ashkenazi Declaration and the teachings of Hanna and Mornin (both made of record in 
Appellants' Response filed November 15, 2004), simultaneous testing of gene amplification and 
gene product over-expression enables more accurate tumor classification , even if the gene- 
product, the protein, is not over-expressed. This leads to better determination of a suitable 
therapy for the tumor, as demonstrated by a real-world example of the breast cancer marker 
HER-2/neu. 
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Moreover, the Declarations of Dr. Paul Polakis (Polakis I- made of record in Appellants' 
Response filed November 15, 2004; and Polakis II- made of record in Appellants' Response filed 
September 25, 2006), principal investigator of the Tumor Antigen Project of Genentech, Inc., the 
assignee of the present application, shows that, in general there is good correlation between 
mRNA levels and polypeptide levels . 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e.,' that it is more likely than not informative of 
the protein level). Therefore, as a general rule, one skilled in the art would find it more likely 
than not that PR0341 and antibodies binding to the PR0341 polypeptides are useful as a 
diagnostic tools for detecting lung tumors. 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed PR0341 polypeptides and its antibodies thereof. Accordingly, one of ordinary 
skill in the art would also understand how to make and use the recited antibodies for the 
diagnosis of lung cancer without any undue experimentation. 

These arguments are all discussed in further detail below under the appropriate headings. 

Response to Rejections 

ISSUE 1. Claims 119-123 are Supported bv a Credible, Specific and Substantial Asserted 
Utility, and Thus Meet the Utility Requirement of 35 U.S.C. S101 and the "How to Use 
Prong" of the Enablement Requirement of 35 U.S.C. §112, First Paragraph 

The sole basis for the Examiner's rejection of Claims 1 19-123 under these sections is that 
the data presented in Example 170 of the present specification is allegedly insufficient under 
applicable legal standards to establish a patentable utility under 35 U.S.C. §101 for the presently 
claimed subject matter, and further, since a patentable utility has not been established, one would 
not know how to use the claimed invention. 

Appellants strongly disagree and respectfully traverse the rejection. 
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A. The Legal Standard For Utility Under 35 U.S.C. §101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 

In interpreting the utility requirement, in Brenner v. Manson, the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent applicant disclose a "substantial utility" for his or 
her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 

3 

commerce rather than the realm of philosophy." 

4 

Later, in Nelson v. Bowler, the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The Court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 

In Cross v. Iizuka? the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 

1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. lizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 
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results with the particular pharmacological activity are generally predictive of in vivo test results, 
i.e. 9 there is a reasonable correlation there between." 7 The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants* statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 

10 

burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Appellants assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope." ' 

13 

Compliance with 35 U.S.C. §101 is a question of fact. The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the Appellant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Appellant. The issue will then be decided on the totality of evidence. 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 

8 

Id. 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 

Ibid. 

H In re Langer, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In reJolIes, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 F.2d 974, 144 
USPQ 351 (1965); In re Sichert, 566 F.2d 1154, 1159, 196 USPQ 209, 212-13 (C.C.P.A. 1977). 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert, denied, 469 
US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 
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The well established case law is clearly reflected in the Utility Examination Guidelines 
("Utility Guidelines")/ 5 which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 
regard to defining a 'substantial' utility.'" 16 Indeed, the Guidelines for Examination of 

17 

Applications for Compliance With the Utility Requirement, gives the following instruction to 
patent examiners: "If the Appellant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants submit that the evidentiary standard to be used throughout ex parte 
examination of a patent application is a preponderance of the totality of the evidence under 
consideration. Thus, to overcome the presumption of truth that an assertion of utility by the 
Appellant enjoys, the Examiner must establish that it is more likely than not that one of ordinary 
skill in the art would doubt the truth of the statement of utility. Only after the Examiner has 



15 66 Fed. Reg. 1092 (2001). 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 11(B)(1). 
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made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
Appellant. 

Appellants respectfully submit that the data presented in Example 170 starting on page 
539 of the specification and the cumulative evidence of record support a "specific, substantial 
and credible" asserted utility for the presently claimed invention. 

Patentable utility for the PR0341 polypeptides and their antibodies is based upon the 
gene amplification data for the gene encoding the PR0341 polypeptide of SEQ ID NO: 20. 
Example 170 describes the results obtained using a very well-known and routinely employed 
polymerase chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred to herein 
as the gene amplification assay. This assay allows one to quantitatively measure the level of 
gene amplification in a given sample, say, a tumor extract, or a cell line. It was well known in 
the art at the time the invention was made that gene amplification is an essential mechanism for 
oncogene activation. Appellants isolated genomic DNA from a variety of primary cancers and 
cancer cell lines that are listed in Table 9 (pages 539 onwards of the specification), including 
primary colon cancers of the type and stage indicated in Table 8 (page 546). The tumor samples 
were tested in triplicates with Taqman™ primers and with internal controls, beta-actin and 
GADPH in order to quantitatively compare DNA levels between samples (page 548, lines 33- 
34). .As a negative control, DNA was isolated from the cells of ten normal healthy individuals, 
which was pooled and used as a control (page 539, lines 27-29) and also, no-template controls 
(page 548, lines 33-34). The results of TaqMan™ PCR are reported in ACt units, as explained in 
the passage on page 539, lines 37-39. One unit corresponds to one PCR cycle or approximately 
a 2-fold amplification, relative to control, two units correspond to 4-fold 5 3 units to 8-fold 
amplification and so on. Using this PCR-based assay, Appellants showed that the gene encoding 
for PR0341 was amplified, that is, it showed approximately 1.12-1.33 ACt units which 
corresponds to 2 1 - 12 -2 *- 33 - fold amplification or 2.173 fold to 2.514-fold amplification in 
three lung tumors. 

Appellants submitted a Declaration by Dr. Audrey Goddard which provides a statement 
by an expert in the relevant art that "fold amplification" values of at least 2-fold are considered 
significant in the TaqMan™ PCR gene amplification assay. Appellants particularly draw the 
Board's attention to page 3 of the Goddard Declaration which clearly states that: 
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It is further my considered scientific opinion that an at least 2-fold increase in 
gene copy number in a tumor tissue sample relative to a normal (i.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy number 
in the tumor sample relative to the normal sample serves as a basis for using 
relative gene copy number as quantitated by the TaqMan PCR technique as a 
diagnostic marker for the presence or absence of tumor in a tissue sample of 
unknown pathology. Accordingly, a gene identified as being amplified at least 2- 
fold by the quantitative TaqMan PCR assay in a tumor sample relative to a normal 
sample is useful as a marker for the diagnosis of cancer, for monitoring cancer 
development and/or for measuring the efficacy of cancer therapy. (Emphasis 
added). 

Accordingly, the 2.173 fold to 2.514-fold amplification in three lung tumors would be 
considered significant and credible by one skilled in the art, based upon the facts disclosed in the 
Goddard Declaration. 

Further, as discussed in detail below, Appellants have provided ample evidence in the 
form of articles from the art, like Orntoft et al 9 Hyman et ai, Pollack et al, and over a 100 
references (see Evidence List items 16-147) and Declarations by experts in the field of oncology 
and gene expression, i.e.: the Declarations by Dr. Paul Polakis (I and II) and by Dr. Avi 
Ashkenazi , to show that, in general, if a gene is amplified in cancer, it is "more likely than not" 
that the encoded protein will also be expressed at an elevated level. 

C. A prima facie case of lack of utility has not been established 

As discussed above, the increase in DNA copy number for the PR0341 gene is 
significant. Further, the evidentiary standard to be used throughout ex parte examination of a 
patent application is a preponderance of the totality of the evidence under consideration. Thus, 
to overcome the presumption of truth that an assertion of utility by the applicant enjoys, the 
Examiner must establish that it is more likely than not that one of ordinary skill in the art would 
doubt the truth of the statement of utility. 

Accordingly, it is not a legal requirement to establish a necessary correlation between an 
increase in the copy number of the DNA and protein expression levels that would correlate to the 
disease state or that it is imperative to find evidence that DNA amplification is " necessarily " or 
"always" associated with overexpression of the gene product. Appellants respectfully submit 
that when the proper evidentiary standard is applied, a correlation must be acknowledged. Only 
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after the Examiner has made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Appellant. 

Previously, the Examiner has indicated based on references Pennica et al, Konopka et al 
and Haynes et al to show that gene amplification data cannot reliably predict protein levels. 
Appellants have argued the references in great detail throughout prosecution, and these 
arguments are incorporated by reference herein for brevity. Appellants summarize the rejections 
and the arguments submitted below. 

The teachings of Pennica et al are specific to WISP genes, a specific class of closely 
related molecules. Pennica et al showed that there was good correlation between DNA and 
mRNA expression levels for the WISP-1 gene but not for WISP-2 and WISPS genes. WISPs 1- 
3 have no structural relationship to the PR0341 polypeptides of the present application. The 
apparent finding that for two out of three specific molecules, that are related to each other but 
have no relationship to PR0341, there was no correlation between gene amplification and the 
level of mRNA/protein expression does not establish, in general, that it is more likely than not 
that such correlation does not exist, and has no bearing whatsoever on determining the question 
whether such correlation is likely to exist between PR0341 gene amplification and 
mRNA/protein expression levels. As discussed above, the standard is not absolute certainty . 
Pennica et al has no teaching whatsoever about the correlation of gene amplification and protein 
expression for genes in general or PR0341 or related molecules in particular. 

Similarly, in Konopka et al, the Examiner has generalized a very specific result disclosed 
by Konopka et al to cover all genes. Konopka et al actually state that "[p]rotein expression is 
not related to amplification of the abl gene but to variation in the level of bcr-ab\ mRNA 
produced from a single Phi template." (See Konopka et al Abstract, emphasis added). The 
paper does not teach anything whatsoever about the correlation of protein expression and gene 
amplification in general and provides no basis for the generalization that apparently underlies 
the present rejection. The statement of Konopka et al that "[p]rotein expression is not related to 
amplification of the abl gene ..." is not sufficient to establish a prima facie ' case of lack of 
utility. Therefore, the combined teachings of Pennica et al and Konopka et al are not directed 
towards genes in general but to a single gene or genes within a single family and thus, their 
teachings cannot support a general conclusion regarding correlation between gene amplification 
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and mRNA or protein levels. In addition, the abl gene has no structural relationship to the 
PR0341 gene of the present application and thus Konopka et al provides no information of 
specific relevance to the question whether for PR0341 there is a reasonable expectation that 
correlation between gene amplification and mRNA/protein expression levels is likely to exist. 

o The Examiner also cited Haynes et al to show that transcript levels and protein levels do 
not correlate. However, Appellants had shown that Haynes themselves admit that " there was a 
general trend, although no strong correlation between protein [expression] and transcript levels" 
(see Figure 1 and page 1 863, paragraph 2.1, last line). Therefore, when the proper legal standard 
is used, Haynes clearly supports the Appellants 1 position that in general, a positive correlation 
exists between mRNA and protein expression levels . Since accurate prediction is not the 
standard , a prima facie case of lack of utility has not been met based on the cited references 
Pennica et al, Konopka et al. and Haynes et al Appellants respectfully submit that, contrary to 
the Examiner's assertion, none of the cited reference conclusively establish a prima facie case 
for lack of utility for the PR0341 molecule. 

Appellants have already discussed references Hu et ai, et al, Chen et al, Lian et ai, 
Fessler et al, Greenbaum et al, in great detail in their previous responses (at least see Appeal 
brief filed November 3, 2005), and these arguments are hereby incorporated by reference for 
brevity. Appellants have further discussed references LaBaer et al, Gygi et al, in great detail in 
their Reply brief filed March 9, 2006), and these arguments are also hereby incorporated by 
reference for brevity. 

Appellants discuss some of the references below. Briefly, the analytical methods utilized 
by Hu et al have certain statistical drawbacks, as the authors themselves admit, and Hu et a/.'s 
conclusions only apply to a specific type of breast tumor (estrogen receptor (ER)-positive breast 
tumor) and cannot be generalized to breast cancer genes in general, let alone to cancer genes in 
general. 

Regarding Lian et al , Appellants respectfully submit that Lian et al only teach that 
protein expression may not correlate mRNA level in differentiating myeloid cells, and not about 
genes in general . In fact, the authors themselves admit that there were a number of problems 
with their data . For instance, at page 520 of this article, the authors explicitly express their 
concerns regarding the methods they utilized and the interpretation of their data stating that 
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" [fjhese data must be considered with several caveats: membrane and other hydrophobic 
proteins and very basic proteins are not well displayed bv the standard 2DE approach, and 
proteins presented at low level will be missed. In addition, to simplify MS analysis, we used a 
Coomassie dye stain rather than silver to visualize proteins, and this decreased the sensitivity of 
detection of minor proteins. 11 (Emphasis added). Besides, Lian et al 's conclusions are based on 
the Coomassie dye staining method, which is not a very sensitive method of measuring protein. 

Similarly, in Fessler et al., examined lipopoysaccharide-activated neutrophilins, in 
response to LPS stimulation. Fessler et al also used the Coomassie Blue dye staining method, 
and concede that it is known to have a limited protein binding range and a non-linear curve for 
protein detection. Protein identification in their study was also done using two-dimensional 
PAGE, which is, by their own admission, limited only to well-resolved regions of the gel , and 
therefore, may have performed less well with hydrophobic and high molecular weight proteins 
(see page 31301, col. 1). 

In addition, Chen et al also concede that there are problems with 2D gel protein 
detection and therefore, cannot accurately predict protein levels. For instance, Chen et al says 
that, "(i)t is apparent that without prior enrichment only a relatively small and highly selected 
population of long-lived, highly expressed proteins is observed. There are many more proteins in 
a given cell which are not visualized by such methods. Frequently it is the low abundance 
proteins that execute key regulatory functions 11 (page 1870, col. 1, emphasis added). Thus, Chen 
et al. 9 concede that by selecting proteins visualized by 2D gels, they are likely to have excluded 
in their analysis many key regulatory proteins which could be candidate cancer markers. 

Further, the microarray analysis provided by Chen et al in fact support the Appellants 
general proposition that, even if protein levels cannot be accurately predicted (which is not 
required by the utility standard), in the majority of the proteins studied, it is most likely than not 
that an increase in gene amplification or mRNA levels generally correlates well with increased 
protein levels. In Table 1, which lists 66 genes [the paper incorrectly states there are 69 genes 
listed] for which only one protein isoform is expressed, Chen et al show that 40 genes out of 66 
had a positive correlation between mRNA expression and protein expression , which clearly 
meets the standard for "more likely than not". Similarly, in Table II , 22 genes out of 30 [again 
the paper incorrectly states there are 29] and at least one isoform showed a positive correlation 
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between mRNA expression and protein expression. Furthermore, 12 genes out of 29 showed a 
strong positive correlation [as determined by the authors] for at least one isoform. Therefore, the 
authors of the Chen paper clearly agreed that microarrays provided a reliable measure of the 
expression levels of the gene and could be used to identify genes whose overexpression is 
associated with tumors. 

In summary, Hu et al, Lian et al Chen et al, and Fessler et al do not conclusively teach 
that, in general, protein levels cannot be accurately predicted from mRNA/ gene amplification 
levels. These authors concede that either due to insensitive protein detection methods or due 
their methodologies utilized in their protocols, some protein species may have been 
underrepresented over others. Therefore, the teachings of these references cannot be relied upon 
to establish a prima facie showing of lack of utility. On the other hand, as noted even in Haynes 
et al and Chen et al, most genes showed a positive correlation between increased gene 
amplification, mRNA and translated protein. 

Alberts and Lewin et al (Items 17 and 18 of the Evidence List) 

The Examiner acknowledges that "the teachings of Alberts and Lewin (which) disclose 
that initiation of transcription is a common point for a cell to regulate the gene expression" but 
asserts that "it is not the only means of regulating gene expression". (Final Office Action of 
November 6, 2006). 

Appellants respectfully disagree and submit that the utility standard is not absolute 
certainty. Therefore, Appellants do not need to establish that the production of RNA is 
inevitably equated with production of protein in order to meet the utility standard. Instead, as 
long as it is more likely than not that a change of the transcription level of a gene gives rise to a 
change in translation level of a gene, the utility standard is met. 

Orntoft et al (Item 105 of the Evidence List) 

The Examiner alleges that "Orntoft et al do not appear to look at gene amplification, mRNA 
levels and polypeptide levels from a single gene at a time. . ..Orntoft et al. concentrated on 
regions of chromosomes with strong gains of chromosomal material containing clusters of genes 
(p.40). This analysis was not done for PR0341 in the instant specification. That is, it is not 
clear whether or not PR0341 is in a gene cluster in a region of a chromosome that is highly 
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amplified. Therefore, the relevance, if any of Orntoft et al is not clear." (page 12; Final Office 
Action of November 6, 2006). 

Appellants respectfully point out that in Orntoft et al, 1,800 genes that yielded an 
increase or decrease in mRNA expression in two invasive tumors compared to the two non- 
invasive papillomas were then mapped to chromosomal locations. The chromosomes had 
already been analyzed for amplification by hybridizing tumor DNA to normal metaphase 
chromosomes (CGH). Orntoft et al used CGH alterations as the independent variable and 
estimated the frequency of expression alterations of the 1,800 genes in the chromosomal areas. 
Orntoft et al found that in general (77% and 80% concordance) areas with a strong gain of 
chromosomal material contained a cluster of genes having increased mRNA expression (see 
page 40). Orntoft et al state, "For both tumors TCC733 (p<0.015) and TCC827 (p<0.00003) a 
highly significant correlation was observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array based technology" (see page 41, 
column 1). Orntoft et al, also studied the relation between altered mRNA and protein levels 
using 2D-PAGE analysis. Orntoft et al state, "In general there was a highly significant 
correlation (p<0.005) between mRNA and protein alterations.. . . 26 well focused proteins whose 
genes had a known chromosomal location were detected in TCCs 733 and 335, and of these 19 
correlated (p<0.005) with the mRNA changes detected using the arrays." (See page 42, column 
2 to page 34, column 2). Accordingly, Orntoft et al clearly support Appellants' position that 
proteins expressed by genes that are amplified in tumors are useful as cancer markers. The 
Examiner has stated that Appellants have not indicated whether PR0341 is in a gene cluster 
region of a chromosome. Appellants fail to see how this is relevant to the analysis. Orntoft et al 
did not limit their findings to only those regions of amplified gene clusters. 

Meric et al (Item 94 of the Evidence List) 

The Examiner asserts that Meric teaches that "gene expression is quite complicated and is 
also regulated at the level of mRNA stability, mRNA translation, and protein stability" (Final 
Office Action of November 6, 2006). 

Appellants respectfully submit that Meric simply summarizes the translation regulation of 
cancer cells. Meric indicates that translation initiation is regulated in response to nutrient 
availability and mitogenic stimulation and is coupled with cell cycle progression and cell growth. 
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Meric further discusses that alteration in translation control occur in cancer. For example, 
variant mRNA sequences can alter the translational efficiency of individual mRNA molecule. ' 
(See Abstract). Meric further teaches that the changes of the translational efficiency of a mRNA 
transcript depend on the mutation of a specific mRNA sequence. (Page 973, column 2 to 
page 91556, column 1). Meric never suggest that the translation of a cancer gene is suppressed 
in cancer in general and therefore, an increased mRNA levels will not yield an increased protein 
levels. To the contrary, Meric teaches that the translation efficiency of a number of cancer genes 
is enhanced in cancer cells compared to its normal counterpart. For instance, in patient with 
multiple myeloma, a C-T mutation in the c-myc IRES was identified and found to cause an 
enhanced initiation of translation. (Page 91556, column 1). Therefore, the level of proteins 
encoded by these genes increases in cancer cells at an even higher magnitude than the mRNA 
level. 

Appellants emphasize that it is not a legal requirement to establish an absolute correlation 
between an increase in the mRNA level and protein expression levels that would correlate to the 
disease state nor is it imperative to find evidence that protein levels can be accurately predicted. 
Therefore, the Examiner has misinterpreted the teaching of Meric and applied improperly high 
evidentiary standard. Therefore, the Examiner has failed to establish a prima facie showing of 
lack of utility. 

Godbout et ah (Item 48 of Evidence List) 

The Examiner asserts that of the references cited by Appellants in their IDS filed 
September 25, 2006, only "Godbout is drawn to the predictability of protein levels based upon 
genomic DNA amplification" (Final Office Action of November 6, 2006). 

Appellants respectfully submit that, as discussed in the Response of September 25, 2006, 
there are several other references; for instance, Bea et al. investigated gene amplification, mRNA 
expression, and protein expression of the putative oncogene BMI-1 in human lymphoma 
samples, and supports Appellants' assertion that gene amplification is correlated with both 
increased mRNA and protein expression. 
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The Examiner further asserts that unlike Godbout et aL, the instant specification does not 
teach structure/ function analysis and the Examiner questions whether the level of genomic 
amplification of DDX1 gene is comparable to the disclosed PR0341 . 

Appellants respectfully submit that it was never claimed that PR0341 is similar in any 
way to the DDX1 gene of Godbout et aL; in fact the Godbout reference was submitted to show 
good correlation between protein levels based upon genomic DNA amplification, which the 
Examiner clearly agrees with. Structure/ function data, which the Examiner requests, is not a 
requirement for the utility requirement, hence this rejection is improper. 

Lillev et aL, King et al. and Wildsmith et aL 

The Examiner cites Lilley et al to show that "the extrapolation that changes in transcript 
level will also result in corresponding changes in protein amount or activity cannot always be 
made." (Final Office Action of November 6, 2006). 

Appellants repeat that it is not a legal requirement for utility, to establish a necessary 
correlation between an increase in the mRNA level and protein expression levels, or to show that 
changes in transcript level should always result in corresponding changes in protein amount or 
activity. Accordingly, the question is not whether a correlation between an increase in mRNA 
and protein expression levels always exists, rather if it is more likely than not that a person of 
ordinary skill in the pertinent art would recognize such a positive correlation. Nowhere in the 
Lilley paper does the author suggest that it is more likely than not that altered mRNA levels 
does not correlate with altered protein levels. On the contrary, the statement that "changes in 
transcript level will also result in corresponding changes in protein amount or activity cannot 
always be made" implies that the mRNA/protein correlation exists in most cases. 

The Examiner cites King et al to show that "mRNA levels do not necessarily correlates 
with protein levels." (Final Office Action of November 6, 2006). 

First of all, Appellants note that the instant application is not directed to the microarray 
assay as in King et aL . As discussed above, the utility standard is not absolute certainty. 
Therefore, Appellants should not be required to show a "necessary correlation" between mRNA 
and protein levels in order to establish a patentable utility. King never indicates that it is more 
likely than not that a general correlation between the mRNA and protein levels for a gene does 
not exist, this paper alone does not suffice to establish a prima facie showing of lack of utility. 
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Appellants also note that the author discussed numerous advantages of the microarray 
technology, which offers tremendous advantages in the study of human diseases. For instance, 
on page 2287, the author states that "microairays can be expected to prove extremely valuable as 
tools for the study of the generic basis of complex diseases. The ability to measure expression 

profiles across entire genomes provides a level of information not previously attainable 

Microairays make it possible to investigate differential gene expression in normal vs. diseased 
tissue, in treated vs. non-treated tissue, and in different stages during the natural course of the 
disease, all on a genomic scale. Gene expression profiles may help to unlock the molecular basis 

of phenotype, response to treatment, and heterogeneity of disease " Therefore, if anything, 

the King reference supports the use of the microarray in the diagnosis of human diseases, which 
silently assumes that, most probably, increases in mRNA levels correlate well with increases in 
protein levels which in turn impacts disease. 

Similarly, Wildsmith never indicated that it is more likely than not that a general 
correlation between the mRNA and protein levels for a gene does not exist. Therefore, this paper 
is not sufficient to establish a prima facie showing of lack of utility. In fact, the Wildsmith paper 
discusses examples of a number of successes of microarray applications in the detection of 
human diseases (see Page 284). For instance, the author has pointed out that "one area of rapid 
progress using microarray technology is the increased understanding of cancer. Molecular 
pathologies are subgrouping cancers of tissues such as blood, skin, and breast, based on 
differential gene expression patterns. For example, within a small group of breast cancer tissue 
samples, Perou et al distinguished two broad subgroups representing those expressing or 
alternatively lacking expression of the oestrogen receptor- y- gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared with the previous 
methods of gene amplification. Another example of the impact of this technology is in the 
identification of two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran et 
a/., 2001). Microarray technology has also accelerated the understanding of the molecular events 
surrounding pulmonary fibrosis. Specially, two distinct clusters of genes associated with 
imflammation and fibrosis have been identified in a disease where, for years, the pathogenesis 
and treatment have remained unknown (Katsuma et al, 2001)." 
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Therefore, collectively, the references Lilley et ai, Wildsmith et al, and King et al 
references cited by the Examiner show that the art indicates that, generally, if a mRNA is 
overexpressed in cancer, it is more likely than not that the encoded protein will also be expressed 
at an elevated level. 

Madoz Gurpide et aL and Celis et al 

The Examiner cites Madoz-Gurpide et al and Celis et al to show that one cannot 
accurately predict protein levels based on mRNA levels (made of record by the Examiner in the 
Final Office Action mailed November 6, 2006). 

.Madoz-Gurpide et al explains that mRNA expression alone does not provide information 
regarding the "activation state, post-translational modification or localization of 
corresponding proteins" (emphasis added; page 168, col. 1). That is, Madoz-Gurpide et al 
explain that mechanisms are not apparent from mRNA expression alone. Madoz Gurpide et al 
further state that, it is "unclear" how well the reported RNA levels correlate with protein levels. 
In support of this assertion, the authors cite only a single reference, namely, the Chen et al, 
which was discussed above and in detail in the Reply Brief filed March 9, 2006. Madoz Gurpide 
et al also acknowledge that the DNA microarray studies, such as those carried out by Beer et al 
(specifically cited by the authors at page 52), "justify the use of this technology for uncovering 
patterns of gene expression that are clinically informative" (emphasis added; page 53). Thus, 
while Madoz-Gurpide et al note that it is "more difficult to develop an understanding of 
disease at a mechanistic level using DNA microarrays," (emphasis added; page 53), Appellants 
respectfully point out that that "understanding of a disease at the mechanistic level" is not 
relevant to Appellants' assertions of utility, as discussed above. Accordingly, a prima facie case 
cannot be made based on the teachings within the Madoz-Gurpide et al reference. 

Like Madoz-Gurpide et al, Celis et al discuss that mechanisms are not apparent from 
mRNA expression alone. Celis et al note that "proteomics addresses problems that cannot be 
approached by DNA analysis, namely, relative abundance of the protein product, post- 
translational modification, subcellular localization, turnover, interaction with other proteins as 
well as functional aspects" (page 6, col. 2). However, in their discussion, Celis et al cite Orntoft 
et al (Item 103 of Evidence List) and note that "in most cases there was a good correlation 
between transcript and protein levels." Celis et al further explain that those few cases which 

-20- 

On Appeal to the Board of Patent Appeals and Interferences 

Appellants' Brief 
Application Serial No. 09/990,71 1 
Attorney's Docket No. 39780-2730 P1C2 



showed apparent discrepancies may have been due to other causes, such as post-transcriptional 
processing or degradation of the protein, or the choice of methods used to assess protein 
expression levels. Celis et al further note that the observation that there is often more change in 
mRNAs as compared to the proteins may be due to the fact that current technologies detect 
mainly high abundance proteins, while most of the changes affecting protein levels may involve 
low abundance proteins. Thus, the correlation between mRNA and protein levels may be even 
higher than typically observed, given these factors. Celis et al explain that proteomics is useful 
in combination with arrays "for the entire process of drug development and evaluation." (page 6; 
col. 1). Appellants further submit that significant correlations between gene and protein 
expression are most likely to be observed for genes associated with cancer, since as Celis et al 
note, "transformation resulted in the abnormal expression of normal genes, rather than in the 
expression of new ones" (page 1 1, col. 1). Accordingly, alterations in gene amplification or 
expression are more likely to be associated with altered protein expression in the case of cancer 
than in other cases where DNA microarrays are used, because, as explained by Celis et al., the 
alterations in expression levels of certain normal proteins are part of the process that leads to 
cancer. 

Accordingly, a prima facie case cannot be made based on the teachings within the Celis 
et al reference; In fact, Appellants submit that, Celis et al support the Appellants contention 
that it is more likely than not that changes in mRNA levels reflect changes in protein levels, in 
general. 

Nagaraia et al. Waghrav et aL Saganaliev et al 

The Examiner also asserts that "[c]omprehensive studies where significantly large 
numbers of transcripts and proteins were examined report that increases in mRNA and protein 
samples are not correlated." (Final Office Action of November 6, 2006). In support of this 
assertion, the Examiner cites three new references, by Nagaraja et al, Waghray et al, and 
Sagynaliev et al 

The Examiner cites Nagaraja et al as allegedly teaching that "the proteomic profiles 
indicated altered abundance of fewer proteins as compared to transcript profiles." (Final Office 
Action of November 6, 2006). 
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Appellants respectfully submit that the fact that many more transcripts than proteins were 
found to be differentially expressed does not mean that most mRNA changes did not result in 
correlating protein changes, but merely reflects the fact that expression levels were only 
measured at all for many fewer proteins than transcripts . In particular, the total number of 
proteins whose expression levels could be visualized on silver-stained gels was only about 300 
(page 2332, col. 1), as compared to the approximately 14,500 genes on the microarray chips for 
which mRNA levels were measured (page 2336, col. 1). Since the expression levels of so many 
fewer proteins than transcripts were measured, it is hardly surprising that a smaller absolute 
number of proteins than mRNAs were found to be overexpressed, because the protein products 
of most of the overexpressed mRNAs would not have been among the small number of proteins 
identified on the gels. 

The Examiner next cites Waghray et al, to the effect that "for most of the proteins 
identified, there was no appreciable concordant change at the RNA level," and that "[t]he change 
in intensity for most of the affected proteins identified could not be predicted based on the level 
of the corresponding RNA." (Final Office Action of November 6, 2006). Appellants reiterate 
that they need only show that there is a correlation between mRNA and protein levels, such that 
mRNA overexpression generally predict protein overexpression. A showing that mRNA levels 
can be used to "accurately predict" the precise levels of protein expression is not required . 

Appellants also emphasize that Appellants are asserting that a measurable change in 
mRNA level generally leads to a corresponding change in the level of protein expression, not 
that changes in protein level can be used to predict changes in mRNA level. Waghray et al did 
not take genes which showed significant mRNA changes and check the corresponding protein 
levels . Instead, the authors looked at a small and unrepresentative number of proteins, and 
checked the corresponding mRNA levels. Waghray et al. acknowledge that only "[a] relatively 
small set of genes could be analyzed at the protein level, largely due to the limited sensitivity of 
2-D PAGE" (page 1337, col. 1). In particular, while the authors examined the expression levels 
of 16,570 genes (page 1329, col. 2), they were able to measure the expression levels of only 
1031 proteins (page 1333, col. 2). Waghray et al does not teach that changes in mRNA 
expression were not correlated with changes in expression of the corresponding protein. All 
Waghray et al state is that "for most of the proteins identified, there was no appreciable 
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concordant change at the mRNA level" (page 1337, col. 2). This statement is not relevant to 
Appellants' assertion of utility, since Appellants are not asserting that changes in mRNA levels 
are the only cause of changes in protein levels. Waghray et aL do not contradict Appellants' 
assertion that changes in mRNA expression, in general, correspond to changes in expression of 
the corresponding protein. 

Lastly, the Examiner cites Sagynaliev et aL, as allegedly teaching that "it is also difficult 
to reproduce transcriptomics results with proteomics tools." In particular, the Examiner notes 
that according to Sagynaliev et aL, of 982 genes found to be differentially expressed in human 
CRC, only 177 (18%) have been confirmed using proteomics technologies. (Final Office Action 
of November 6, 2006). 

The Sagynaliev et aL reference, titled " Web-based data warehouse on gene expression in 
human colorectal cancer" (emphasis added), drew conclusions based upon a literature survey of gene 
expression data published in human CRC , and not from experimental data. While a literature survey 
can be a useful tool to assist researchers, the results may greatly over-represent or under-represent 
certain genes, and thus the conclusions may not be generally applicable. In particular, Appellants 
note that, as evidenced by Lian et aL, Nagaraja et aL, and Waghray et aL, discussed above, the 
number of mRNAs examined in transcriptomics studies is typically much larger than the number of 
proteins examined in corresponding proteomics studies , due to the difficulties in detecting and 
resolving more than a small minority of all expressed proteins on 2D gels. Thus the fact that only 
18% of all genes found to be differentially expressed in human CRC have been confirmed using 
proteomics technologies does not mean that the corresponding proteins are not also differentially 
expressed, but is most likely due to the fact that the corresponding proteins were not identified on 2D 
gels, and thus their expression levels remain unknown. 

The authors of Sagynaliev et aL acknowledge the many technical problems in finding 
proteomic data for CRC that can be matched to transcriptpmic data to see if the two correlate. The 
authors state that "results have been obtained using heterogeneous samples in particular cell lines, 
whole tissue biopsies, and epithelial cells purified from surgical specimens." However, "Results 
obtained in cell lines do not allow accurate comparison between normal and cancer cells, and the 
presence/absence of proteins of interest has to be confirmed in biopsies." (Page 3072, left column.) 
In particular, the authors specifically note that "only a single study [1] provided differential display 

-23- 

On Appeal to the Board of Patent Appeals and Interferences 

Appellants 9 Brief 
Application Serial No. 09/990,71 1 
Attorney's Docket No. 39780-2730 P1C2 



protein expression data obtained in the human patient, using whole tissue biopsy." (Page 3068, left 
column, second paragraph; see also, Table 2.) 

Appellants further note that Table 2 shows that 6 out of 8 published proteomics studies were 
done using 2-D PAGE. However, the authors state that "2-D PAGE or 2-D DIGE have well-known 
technological limitations . . . even under well-defined experimental conditions, 2-D PAGE parallel 
analysis of paired CRC samples is hampered by a significant variability." (Page 3077, left column, 
third paragraph.) Therefore, Appellants respectfully submit that it is well known in the art that there 
are problems associated with selecting only those proteins detectable by 2D gels. 

Li et ai 

The Examiner cites new reference Li et al as teaching that "68.8% of the genes showing 
over-representation in the genome did not show elevated transcript levels." (Final Office Action 
of November 6, 2006). 

Appellants respectfully point out that Li et al acknowledge that their results differed 
from those obtained by Hyman et al and Pollack et al (of record), who found a substantially 
higher level of correlation between gene amplification and increased gene expression. The 
authors note that "[t]his discordance may reflect methodologic differences between studies or 
biological differences between breast cancer and lung adenocarcinoma" (page 2629, col. 1). For 
instance, as explained in the Supplemental Information accompanying the Li article, genes were 
considered to be amplified if thev had a copy number ratio of at least 1 .40 . In the case of 
PR0341, as discussed in previously filed responses and in the Goddard Declaration (of record), 
an appropriate threshold for considering gene amplification to be significant is a copy number of 
at least 2.0 (which is a higher threshold). The PR0341 gene showed significant amplification of 
2.173 fold to 2.514-fold in three lung tumors, and thus fully meets this standard. It is not 
surprising that, in the Li et al reference, by using a lower threshold of 1 .4 for considering gene 
amplification, a higher number of genes not showing corresponding increases in mRNA 
expression were found. Nevertheless, the results of Li et al do not conclusively disprove that a 
gene with a substantially higher level of gene amplification, such as PRQ34L would be expected 
to show a corresponding increase in transcript expression. 

Therefore, the Patent Office has failed to meet its initial burden of proof that Appellants' 
claims of utility are not substantial or credible. The arguments presented by the Examiner based 
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on references Pennica et aL 9 Konopka et aL 9 Haynes et aL, Hu et aL 9 Chen et al. 9 Lian et aL 9 
Fessler et al. 9 Lilley et al. 9 King et a/., Wildsmith et al 9 Nagaraja et al 9 Sagynaliev et aL, 
Waghray et al 9 Madoz-Gurpide et aL, Celis et al. 9 and Li et al. 9 do not provide sufficient reasons 
to doubt the statements by Appellants that PR0341 has utility as a diagnostic marker for colon 
cancer. Appellants once again remind the Examiner that only after the Examiner has made a 
proper prima facie showing of lack of utility, does the burden of rebuttal shift to the Appellant. 
Based on the above discussions, such a showing has not been made. Accordingly, the instant 
rejection should be withdrawn for the Examiner's lack of establishment of a prima facie 
showing. 

In summary, Appellants maintain that even though there are certain instances where a 
correlation, between DNA/mRNA and protein levels do not exist, in most cases , there is 
generally good correlation between them, and this was collectively demonstrated in the more 
than 100 references submitted by the Appellants in the IDS filed September 25, 2006. 

D. The Gene Amplification Data Establishes Credible, Substantial and Specific 
Patentable Utility for the PRQ341 Polypeptide and its Antibodies 

On the other hand, as discussed throughout prosecution, Appellants submit that Example 
170 of the specification further discloses that, "(amplification is associated with overexpression 
of the gene product, indicating that the polypeptides are useful targets for therapeutic 
intervention in certain cancers such as lung, colon, breast and other cancers and diagnostic 
determination of the presence of those cancers" (Emphasis added). Appellants have also 
submitted ample evidence to show that, in general, if a gene is amplified in cancer, it is "more 
likely than not" that the encoded protein will also be expressed at an elevated level. 

Besides, the Declaration by Dr. Paul Polakis (Polakis I - made of record in Appellants* 
Response filed November 15, 2004), principal investigator of the Tumor Antigen Project of 
Genentech, Inc., the assignee of the present application, explains that in the course of Dr. 
Polakis 1 research using microarray analysis, he and his co-workers identified approximately 200 
gene transcripts that are present in human tumor cells at significantly higher levels than in 
corresponding normal human cells. Appellants submit that Dr. Polakis' Declaration was 
presented to support the position that there is a correlation between mRNA levels and 
polypeptide levels. The second Declaration by Dr. Polakis (Polakis II) presented evidentiary 
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data in Exhibit B. Exhibit B of the Declaration identified 28 gene transcripts out of 31 gene 
transcripts (i.e., greater than 90%) that showed good correlation between tumor mRNA and 
tumor protein levels. As Dr. Polakis' Declaration (Polakis II) says "[a]s such, in the cases where 
we have been able to quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast majority of cases, there is a 
very strong correlation between increases in mRNA expression and increases in the level of 
protein encoded by that mRNA." Accordingly, Dr. Polakis has provided the facts to enable the 
Examiner to draw independent conclusions regarding protein data. Appellants further emphasize 
that the opinions expressed in the Polakis Declaration, including in the above quoted statement, 
are all based on factual findings. For instance, antibodies binding to about 30 of these tumor 
antigens were prepared and mRNA and protein levels were compared. In approximately 80% of 
the cases , the researchers found that increases in the level of a particular mRNA correlated with 
changes in the level of protein expressed from that mRNA when human tumor cells are 
compared with their corresponding normal cells . Therefore, Dr. Polakis 1 research, which is 
referenced in his Declaration, shows that, in general there is a correlation between increased 
mRNA and polypeptide levels . Hence, one of skill in the art would reasonably expect that, based 
on the gene amplification data of the PR0341 gene, the PR0341 polypeptide is concomitantly 
overexpressed in lung tumors studied as well. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip® arrays in 2004. Clearly, the research community believe that the 
information obtained from these chips is useful (i.e., that it is more likely than not that the results 
are informative of protein levels). 

Thus, based on the asserted utility for PR0341 in the diagnosis of lung tumors, the 
reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 20 in the present 
application, the disclosure of the step-by- step protocols for making chimeric PRO polypeptides, 
including those wherein the heterologous polypeptide is an epitope tag or an Fc region of an 
immunoglobulin in the specification (at page 374, lines 24 to page 375, line 9), the disclosure of 
a step-by-step protocol for making and expressing PR0341 in appropriate host cells (in 
Examples 140-143 and page 376, line 12), the step-by-step protocol for the preparation, isolation 
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and detection of monoclonal, polyclonal and other types of antibodies against the PR0341 
protein in the specification (at pages 390-395) and the disclosure of the gene amplification assay 
in Example 170, the skilled artisan would know exactly how to make and use the claimed 
polypeptide and its antibodies for the diagnosis of lung cancers. Appellants submit that based on 
the detailed information presented in the specification and the advanced state of the art in 
oncology, the skilled artisan would have found such testing routine and not 'undue.' 

Therefore, Appellants respectfully request reconsideration and reversal of this 
outstanding rejections under 35 U.S.C. §101 and§112, First Paragraph to Claims 119-123. 



For the reasons given above, Appellants submit that present specification clearly 
describes, details and provides a patentable utility for the claimed invention. Moreover, it is . 
respectfully submitted that based upon this disclosed patentable utility, the present specification 
clearly teaches "how to use" the presently claimed polypeptide. As such, Appellants respectfully 
request reconsideration and reversal of the outstanding rejection of Claims 1 19-123. 

Please charge any fees, including fees for extension of time or other fees, or credit any 
overpayment to Deposit Account No. 08-1641 referencing Attorney's Docket 



HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 
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Date: September 5, 2007 
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VIII. 

Claims on Appeal 

119. An antibody that specifically binds to the polypeptide of SEQ ID NO: 20. 

120. The antibody of Claim 119 which is a monoclonal antibody. 

121. The antibody of Claim 1 1 9 which is a humanized antibody. 

122. The antibody of Claim 1 19 which is an antibody fragment. 

123. The antibody of Claim 1 19 which is labeled. 
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X. RELATED PROCEEDINGS APPENDIX 

None- no decision rendered by a Court or the Board in any related proceedings identified 

above. 
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Abstract 

Lung cancer is the most common cause of cancer death 
in developed countries. The prognosis is poor, with less. than 
15% of patients surviving 5 years after diagnosis. The poor 
prognosis is attributable to lack of efficient diagnostic meth- 
ods for early detection and lack of successful treatment for 
metastatic disease. Most patients (>75%) present with stage 
111 or IV disease and are rarely curable with current ther- 
apies. Within the last decade, rapid advances in molecular 
biology, pathology, bronchology, and radiology have pro- 
vided a rational basis for improving outcome. These ad- 
vancements have led to a better documentation of morpho- 
logical changes in the bronchial epithelium before development 
of clinical evident invasive carcinomas. This has changed our 
concept of lung carcinogenesis and emphasized the multislep 
carcinogenesis approach on several levels. Combined with 
the technical developments in bronchoscopic techniques, 
e.g., laser-induced fluorescence endoscope (UFE) bronchos- 
copy, we now have improved methods to localize preinvasive 
and early-invasive bronchial lesions. With the LIFE bron- 
choscope, a nevr morphological entity (angiogenic squamous 
dysplasia) has been recognised, which might be an impor- 
tant biomarker and target for antiangiogenic chemoprevc »- 
tive agents. To reduce the mortality of lung cancer, these 
new technologies have been taken into the clinic in different 
scientific settings. The use of low-dose spiral computed to- 
mography in the screening of a high-risk population has 
demonstrated the possibility of diagnosing small peripheral 
tumors that are not seen on conventional X-ray. A shift in 
the therapeutic paradigm from targeting advanrcd clinically 
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manifest lung cancer toward asymptomatic preinvasive and 
early- invasive cancer is occurring. The present article re- 
views the recent advances in the diagnosis of preinvasive and 
early- invasive cancer to identify biomarkcrs for early detec- 
tion of Jung cancer and for chemoprevenlion studies. 

Introduction 

Lung cancer is the most common cause of cancer deaths in 
the countries of North America and other developed countries, 
accounting for 29% of all cancer deaths and more deaths than 
from prostate, breast, and colorectal cancer combined in the 
United States (1). Lung cancer, will be diagnosed in —170,000 
new patients in the United States in the year 2000, and < 15% of 
them will survive 5 years after diagnosis f 1). The prognosis for 
lhe patients wiih lung cancer is strongly condoled to the stage 
of the disease at the lime of diagnosis. Whereas patients with 
clinical stage IA disease have a 5-year survival of about 60%, 
the clinical stage 1I-IV disease 5-year survival rale ranges from 
40% to less than 5% (2). Over two-thirds of the patients have 
regional lymph-node involvement or distant disease at lhe lime 
of presentation (3). The poor prognosis is largely attributable to 
the lack of effective early detection methods and the inability to 
ewe metastatic disease. The unsatisfactory cure rates supports 
efforts aimed at early identification and inierveniion in lung 
cancer. 

Historically, the only diagnostic tests available for the 
detection of Jung cancer in its early slages were chest radiogra- 
phy and sputum cytology. The efficacy of these tests as mass 
screening tools was evaluated in controlled dials sponsored by 
lhe NCI 3 and conducicd at Johns Hopkins University, Memorial 
Sloan-Kctlciing Cancer Center, and the Mayo Clinic during lhe 
1970s (4-6). The principal goal of these studies was to deter- 
mine whether a reduction in lung cancer mortality could be 
achieved by adding sputum cytology testing to annual scteening 
by chest radiography. Results horn these trials showed thai both 
tests could detect piesympiomatic, eatly-stage carcinoma, par- 
ticularly of squamous cell type. Kesectabilrty and survival rates 
were found to be generally higher in Ihc study gioups than in the 
control gioups. However;, improvements in readability and 
survival did nOi lend to a reduction in overall lung cancer 
mortality, the most criiical end point. A subsequent study of 
6346 Czcchosloyakian male smokers also found no reduction in 
hint 1 cancer mortality aftei duu! screening by chest radiogtaphy 
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and sputum cytology (7). The negative results from these 
1 screening studies lead the NCI and other health policy and 
research groups to conclude that mass screening programs in- 
volving periodic sputum cytological evaluation and chest radio- 
graphs couJd not be justified. However, controversies in the 
methodology and interpretation of the data from these studies 
have later been extensively discussed (8, 9). One additional 
study of annual chest X-ray screening is currently being con- 
ducted by lhe NCI; The Prostate-, Lung-, Colorectal-, and Ovar- 
ian (PLCO) screening trial This trial includes individuals 55-74 , 
. years old, but they are not selected for this trial on the basis of 
high risk for lung cancer (e.g., smoking history with >20 
pack-years). 

The failure of clinical trials to demonstrate the efficacy of 
sputum cytology and chest radiography as mass screening tools 
has resulted in a search for better diagnostic approaches for 
early lung cancer detection that take advantage of recent devel- 
opments in molecular biology, gene technology, and radiology 
(10). Furthermore, as has been the case for mammography 
screening for breast cancer, it has also been important to identify 
risk groups for rung cancer. 

Although, much is known about predisposing factors, nat- 
ural history, and the outcome based on histology and stage, our 
understanding remains very incomplete in many areas. What are . 
the early premaJignant changes molecularly, biochemically, and 
morphologically? Which changes are reversible and which are 
not? What research tools are available to provide answers to 
these questions? The identification of preinvasive lesions allows 
for developing promising methods for early intervention (1J). 
The therapeutic paradigm and focus are today shifting from 
targeting only clinically verified lung cancer as previously to- 
ward targeting, the premalignant and early- malignant Jesions. 
Furthermore, the prospect of lung cancer screening has today 
become more meaningful as a consequence of recent develop- 
ments in biology and radiology and better possibilities to define 
high-risk populations most suitable for lung cancer screenine 
(12). ^ 

The present article will focus on the clinical perspectives of 
our biological knowledge of premalignant and early- malignant 
lesions and the potential of the recent technological advance- 
ment for early diagnosis of lung cancer. 

Pathology, of Preinvasive and Early Invasive 
Bronchial Lesions 

Most of the efforts to classify lime cancer have been 
directed toward invasive carcinoma (13). However, better un- 
• dcrstanding of the pathogenesis of lung cancer aroused renewed 
interest in morphological abnormalities thai fall short of inva- 
sive carcinoma but may indicate initiation of carcinogenesis. 
These morphological abnormalities are referred to as prernva- 
sive lesions and are shown in Fig. I. The last editron of the 
WHO classification of lung rumors included the classification of 
preinvasive lesions as a separate section. Numerous recent stud- 
ies have indicated that long cancer is not the result of a sudden 
transforming event in lhe bronchial epithelium but a multiMcp 
process in which era dually accruing sequential genetic and 
cellular changes result in the formation of an invasive (:.e.. 
malignant) tumor. Mucosal changes in the taicc airways that 



may precede or accompany invasive squamous carcinoma in- 
clude hyperplasia, metaplasia, dysplasia, and'CIS (14): Hyper- 
plasia of the bronchial epithelium and squamous metaplasia 
have generally been considered reversible, and not premalignant 
. in the sense of squamous dysplasia and CIS (15). 

Squamous metaplasia is a common finding, especially as a 
response to cigarette smoking. Peters et at. ( 1 6) studied bron- 
choscope biopsies from six sites in 106 heavy "cigarette smok- 
ers; Squamous metaplasia was noted at one or more biopsy sites 
in approximately two- thirds of the group, and one- fourth 
showed squamous metaplasia in three or more biopsy sites. The 
incidence of squamous metaplasia increased with smoking his- 
tory and was highest in individuals who had smoked more than 
two packs of cigarettes a day. Auerbach et al (17) noted similar 
. findings in autopsy tissues: basal cell hyperplasia and squamous 
metaplasia are increased in smokers in proportion to smoking 
history. Hyperplasia and metaplasia are believed to be reactive 
changes in the bronchial epithelium, as opposed to true preneo- 
plastic changes (17, 18). The reasons for this include: (a) they 
are frequently found in association with chronic inflammation, 
and may be induced by mechanical trauma; (b) they spontane- 
ously regress after smoking cessation; (c) in chronic smokers, 
the molecular changes present in these lesions are similar lo 
those present in histologically normal epithelium; and (d) there 
are no reports linking their presence to increased risk for devel- 
oping lung cancer. Jn contrast, moderate-to-severe dysplasia and 
CIS lesions seldom regress after smoking cessation (19). 

Dysplasia and CIS are changes thai frequently precede 
squamous cell carcinoma of the lung. Saccomanno el ai (20) 
studied more than 50,000 samples from 6,000 men, many of 
whom had worked in the uranium mining industry. Both smok- 
ing and uranium mining (radon exposure) .were found to be 
associated with increased incidence of dysplasia, CIS, and in- 
vasive cancer. The studies of Saccomanno et of. established that 
increasing degrees of sputum atypia may be recognized an 
average of 4-5 years before the development of frank lung 
carcinoma. 

Another question is: which grades of spurum atypia pro- 
gress to cancer? From the Johns Hopkins cohort of the NCI 
chest X-ray/sputum screening trial, we know that. among indi- 
viduals with moderate atypia on sputum screening. —10% de- 
veloped known cancer up to 9 years later. Among individuals 
with severe atypia on the spurum screening. >40% developed 
known cancer during the same time period (21). Although there 
are data in the literature showing the relationship between 
sputum atypia and subsequent invasive cancer, there is still very 
little information about the histological progression in the bron- 
chial mucosa in the high risk populations. In a recent publica* 
tion. nine patients with CIS were followed with autoftuorcs- 
cence bronchoscopy at regular intervals, and 5 (56%) had 
progression to invasive cancer despite endobronchial therapy 
(22). 7 be number of invasive cancers might even have been 
higher if treatment bad not been not given. Ongoing studies of 
high- 1 is k subjects {e.g., the Colorado sputum cohort study) 
including serial follow-up bronchoscopies wit) provide evidence 
related to the frequency of development of invasive lung cancer 
as ii relates rn smoking history, air Dow- obstruction, and sputum 
atypia. 

Since the previous WHO- c lassi ft cation wai published in 
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Fig. J J f squamous metaplasia. The 
cells are widely dispersed, wilb a reg- 
ular maturation from (he basai region 
lo the lop. There is keratini2ation, and 
(he nuclei/cyr.oplasmic ratio is low. B, 
moderate dysplasia with ASD. Hypei- 
celhifarity of the epithelium with in- 
complete maturation and micropapil- 
lary invasion of capillaries are seen. 
The nuclei/cyioplasmic ratio is high. 
C, severe dysplasia. There is marked 
pleomorphism of ihe cells with irregu- 
larity and prominent nucleoli. 
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J 98), two nonsquamous lesions have been added to ibe WHO 
classification of premalignant lesions: atypical . alveolar hyper- 
plasia and diffuse idiopathic neuroendocrine cell hyperplasia 
(J 3). Both of these lesions are diagnosed rarely. The former 
consists of lesions <5 mm in diameter and composed of a 
peripheral epithelial cell proliferation with minimal cytologic a) 
atypia or stromal response and resembles bronchioloalveolar 
carcinoma. The lesion has been seen in Jung specimens resected 
for hing cancer, but no prospective significance, of this lesion 
has been reported. However, this morphological lesion may play 
a role for the pathogenesis of peripheral lung adenocarcinomas 
(23, 24). The resolution of spiral CT (currently about 3 mm) 
approaches the diameter of these lesions, and it is anticipated 
that atypical alveolar hyperplasia will be increasingly encoun- 
tered in subjects undergoing Ibis procedure (25). Di0use idio- 
pathic neuroendocrine cell hyperplasia consists of a patchy 
increase in the number of well-differentiated neuroendocrine, 
cells in the bronchioles. This process may result in the formation 
of small carcinoid tumors, and for this reason it is considered 
, "preinvasive." To date, small cell carcinomas have not been 
associated with this lesion (13). 

Recently, the use of fluorescence bronchoscopy (see be- 
low) has increased the recognition of dysplastic lesions in : lbe 
large airways and a new morphological entity, ASD, was iden- 
tified (26). Dysplasia of bronchial epithelium in "micropapiJlo- 
matosis" and the possible link between angiogenesis and prein- • 
vasive bronchial epithelial dysplasia were recognized as early as 
1983 by Muller and Muller (27), who also described the ultra- 
structure of these lesions. Jl has been suggested that ibis angio- 
genesis, which is recognized as capillary loops projecting into 
the dysplastic bronchial lining, is responsible for the reduced 
fluorescence seen in dysplastic lesions by LIFE bronchoscopes 
(Figs. 1 and 3; Ref. 26). Future prospective studies will show 
whether this morphological entity is correlated with a progres- 
sion to lung cancer so as to be a target for the use of antiangio- 
genie agents for chemoprcvention. 

In general, there are several questions/problems relating to 
premalignant lesions, which will be addressed in future studies: 
{a) The morp ho logical criteria for premalignant and early- 
malignant changes, both on sputum cytology and in bronchial 
biopsies, have to be validated for intra- and interobserver repro- 
ducibility. 

{h) Uniform and reproducible morphological/eyiological 
criteria have to be published more extensively, and a traininc set 
of slides should be available. By the use of Internet technology, 
this could be more easily facilitated (28). 

(c) 7 he correlation ot" sputum atypia and histological 
•changes in the bronchi in high-risk population is not well 

defined. 

[d) The natural course ol preinvasive changes in die bron- 
chi from the high risk subjects needs to be clarified through 
longitudinal, prospective studies with reference to histological 
changes in ihc bronchi. Ongoing longitudinal studies with flu- 
orescence bronchoscopy ami multiple biopsies with histology 
and other biomarkers will define the ability of these markers to 
assess for risk. 

(r) What is the pathology /biology of the small, often pe- 
ripherally I oca red. imm>r*; (3 mm m diameter V which :nc more 



often diagnosed with newer radiological techniques low- 
dose spiral CT)? 

{/) Optimization of the tissue procurement and processing 
. techniques are important. Distinction of reactive from neoplastic 
processes is usually straightforward, but diagnostic difficulties 
may arise in the case of (a) inadequate or poorly prepared 
histological materia] to evaluate and (b) the presence of etio- 
logical atypia in epithelium, stimulated by inflammation, viral 
infection, radiation, or chemotherapy. 

ig) DNA array analyses of gene expression: will it be 
useful? How to collect proper mRNA? Can mRNA extracted 
from microdissected cells obtained at bronchoscopy be globally 
amplified and still remain representative of RNA present in sttttl 

Biology of Lung Carcinogenesis and Potential Early 
Detection Markers. 

Lung cancer is the end-stage of multiple-step carcinogen- 
esis, in most cases driven by genetic and epigenelic damage 
caused by chronic exposure to tobacco carcinogens. The genetic 
instability in human cancers appears to exist at two levels: at the 
chromosomal level, including large scale losses and gains; and 
at the nucleotide level including single or several base changes 
(29). Lung cancers harbor many numerical chromosomal abnor- 
malities (aneupjoidy) and structural cytogenetic abnormalities 
including deletions and nonreciprocal translocations (30). At 
least three classes of cellular genes are involved: proto-onco- 
genes, TSGs, and DNA repair genes. Oncogenic activation often 
occurs via point mutations, gene amplification, or chromosomal 
rearrangement, whereas TSGs are classically inactivated by the 
loss of one parental allele combined with a point or small 
mutation or aberrant methylation of a target TSG in the remain- 
ing allele. Additionally, dysregulated gene expression (either 
increased or decreased expression) can occur by other, as yet 
unknown, mechanisms (30). Present studies have not yet con- 
firmed a prominent role for abnormalities of DNA repair genes 
in lung cancer. 

Preneoplastic cells contain several molecular genetic ab- 
normalities identical to some of the abnormalities found in overt 
lung cancer cells (Fig. 2). These include allele loss at several 
loci {3p, 9p. Sp, and f7p), mrc and ras up-regulation, cyclin Dl . 
overexpression, p53 mutations, and incieased immunoreactiv- 
ity, be 1-2 overexpression and DNA aneupjoidy (31-35). AHcJo- 
ryping of precisely microdissected.. preneoplastic foci of cells 
suggests that the earliest changes in the bronchial epithelium is 
allele loss at chromosome regions 3p. then 9p, Sp, J7p : 5q. and 
then ras mutations (36-39).Thc biological meaning of LOH is ' 
only vaguely understood. Recent evidence suggests that LOH 
may be a consequence of mitotic recombination, that there is 
only infrequent physical loss of genetic loci, and thai LOH 
probably precedes chromosomal duplication MO). Allelic loss 
would thus be significant primarily in the presence of mutation 
in the retained allele, and gene dosage would not be expected to 
exert a phenotypic effect in LOH. Some reports have indicated 
thai ms activation occurs at early carcinoma singes (31). His- 
tologically norma) bronchial epithelium adjacent io carreers has 
also been shown to have certain genetic Ios.se?. Atypical ade- 
nomatous hyperplasia,, the potential precursor lesion of adeno- 
carcinomas, often have Ki-rat ruinations (<U'i 
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Fig. 2 Sequential changes during lung cancer pathogenesis. Although multiple genetic markers are abnormal in lung cancers, their appearance during 
the lengthy preneoplastic process varies. The liming of the appearance of these changes has been investigated in bronchial prcneoplasia, because 
sequential sampling of the peripheral lung is technically difficult. Several alterations have been described in histologically normal bronchial epithelium 
of smokers. OtbeT changes have been detected in slightly abnormal epithelium {hyperplasia, metaplasia) which we do not consider to be true 
premalignanr lesions. These changes are regarded as early changes. Molecular changes delected frequently in dysplasia are regarded as intermediate 
in liming, whereas those usually detected ai (he CIS or invasive stages are regarded as late changes. It should be stressed that although there is a usual 
order, exceptions regarding the timing of onset may occur. Some changes are progressive, such as chromosome 3p deletions. Thus small discrete 
chances ate present early, progressively become more extensive during pathogenesis, and frequently involve all or almost all of the arm in CIS 
samples. Although allelic loss at the TP53 locus may precede the onset of mutations, data on this sequence are scant. Dysregulalion of the RNA 
component of telomerase (with its appearance in noobasal cells) is an early event, whereas upregulation of the gene is a relatively late event. 



Molecular changes have been found nol only in the limes 
of patients with lung cancer, but also in the lungs of current and 
former smokers without hmg cancer (18. 42. 43). These obser- 
vations are consistent with the niuJtistep model of carcinogen- 
esis and "field cancerization*' process, whereby the whole i eg ion 
is icpeaicdly exposed to carcinogenic damage (tobacco smoke) 
and is at risk for developing multiple, separate, clonal! v unre- 
lated foci of neoplasia. The widespread ancuploidy that occurs 
throughout the respiratory tree of smokers supports this theory 
(4-1). However, the ptesence of the same somatic pi>3 point 
mutation at widely dispersed preneoplastic lesions in a smoker 
without invasive lung cancer indicates that expansion of a single 
progenitor clone may spread throughout the lespiiaforv tree 
These molecular alterations might thus be important 
targets for use in the catty detection of lung cancer and tot use 
as surrogate biomarkers in the follow- up of chemoptcvention 



studies. Detection of these mutant cells should be possible with 
the different molecular techniques in accessible specimens The 
prospects of diagnosing these biological abnormalities in mul- 
tiple types of clinical specimens are discussed below. 

Specimens for CI in ten I Testing: Sputum 

Since the 1930s, cytological examination, of sputum has 
been used for the diagnosis of Jung cancer (4(f). Cytological 
examination of sputa, especially multiple samples, is helpful for 
the detection of central rumors arising fiom the latger bronchi 
(e.£., squamous celt- and small cell carcinomas). Exfoliated 
cells from peripheral tumors, such as adenocatcinomas. arisinn 
from the smaller airways (small bronchi, bronchioles, and ahT- 
oli). especially those less than 2 cm in diameter . can be detected 
only occasionally m sputum samples. This has become of 
gicaier importance because the changes in cigatt-tte exposition 
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(filters and decreased nicotine content) have created an increase 
in adenocarcinomas and a decrease in squamous carcinomas 
(47-49). The sensitivity of sputum cytology for early lung 
cancer is only in the 20%-30% range from screening studies, 
but by adhering to proper specimen collection, and processing 
and interpreting criteria, the yield can be substantially improved 
(50, 5 1 ). The data on the reliability of the sputum are conflicting 
(52^54). Browman et at. (52) reported interobserver agreement 
of 68% for exact and 82% for within - 1- category. Holliday et 
at. (54) reported low agreement within observers (27- 60%) and 
across observers (13-50%). Within - 1 - category intraobserver 
agreement underwent a two- or 3-fold increase in agreement, 
which was also the case for interobserver agreement. The var- 
iation in intra- and interobserver agreement seems to depend on 
experience among the cy totechnicians/cytopathologists and the 
composition of categories studied. A higher degree of agreement 
is obtained for higher grades of dysplasia (54). Risse et at. (55) 
showed that the ability to detect premalignant conditions is 
dependent on the number and type of cells present in the deeper 
airways, suggesting a mode of improvement that is umelaled to 
observer reliability. MacDbugall et at. (56) concluded thai spu- 
tum cytology was loo insensitive and insufficiently accurate, to. 
be included in the routine work-up of any patient suspected of 
having lung cancer. To improve the reliability of sputum cytol- 
ogy examinations a simplification of the diagnostic, categories 
from .6 (normal; squamous metaplasia; mild, moderate, and 
severe atypia; and carcinoma) to 2-3 categories have been 
proposed (54). Future clinicopalhological studies will be re- 
quired to validate this concept. 

To' improve the sensitivity of sputum examination as a 
population- screening tool for the detection of early lung cancer, 
several approaches are currently under development. 

Immunostaining. Annual sputum specimens obtained 
from individuals screened at Johns Hopkins were obtained, and 
the patients were monitored for 8 years (57). Because the 
clinical outcome of these patients was known. archival sputum 
specimens were screened for the presence of btomaikers that 
could indicate the presence of Jung tumors in an early, preinva- 
sive stage. In an attempt lo distinguish the pattern of marker 
expression Tockman et at. (58) studied two monoclonal anti- 
bodies. Positive staining predicted subsequent lung cancer ap- 
proximately 2 years' before clinical recognition of the disease, 
with a sensitivity of 91% and a specificity of 88% (58). One of 
these antibodies (703 04) had a higher sensitivity and was later 
identified as recognizing hnRNP A 2/131 (59). The 'role of 
hnRMP A2/B1 ovcrexpression for detecting preclinical Jung 
cancer has been studied in a large high-risk population including 
6000 Chinese tin miners who were heavy smokers and who had 
an extraordinary rate of lung cancer (60). The results from, this 
study indicated thai detection of hnRNP A2/BI over expression 
in sputum epithelium cells was 2- to 3- fold mote sensitive for 
detection of lung cancer than standard chest X-ray and sputum 
cytology methods. The method was particularly- effective in 
identifying early disease (60). The sensitivity was 7*1% verMts 
7)% for cytology and 42% for chest X-ray. However, the 
biomnt kei had a lower specificity (70%) compared wjih cviol- 
ogy (100%) and chest radiograph (90%). An ongoing clinical 
(rial is evaluating the performance of the A2/BI protein as :t 
biomaifeci foi the caily detection of SP1.C The patients ;ti n*.fc 



for SPJLC have the highest incidence of lung cancer (2-5%) 
among asymptomatic populations (61). In this trial, 13 SPLCs 
were identified by A2/B), and the sensitivity and specificity 
were 77-82%. and 65-81%, respectively. Among the cases 
. identified as positive by rmmunocytc*chemisrry and image cy- 
tometry, 67% developed SPLC within 1 year (62). Whereas the 
previous immunocytochemistry studies on materia) from the 
older screening materia) from the NQ-supporled screening 
studies were made on sputum cells cytologically' classified with 
moderately or gravely atypical metaplastic appearance, the latter 
studies have been done on cytologically . "normal appearing" 
cells. More recently Sueoka et at. (63) reported the confirmation 
of the value of overexpression of hnRNP A2/BJ lo detect 
preclinical lung cancer in Japan. Efforts to improve the sensi- 
tivity of hnRNP markers are ongoing (64). . 

PCR Techniques. PCR techniques have been used for 
the evaluation of molecular biomarkers for early . rung. cancer 
detection. In a pilot study with selected patients from the Johns 
Hopkins Lung Project (JHLP), 8 (53%) of 15 patients with 
adenocarcinoma or large cell carcinoma were detected by mu- 
tations in sputum cells from 1 to 13 months before clinical 
diagnosis (65). However, the method seemed to be less sensitive 
than the protein marker described above, and the identification 
of specific gene abnormalities is further limited by the need to 
know i be specific mutation sequence with which to probe the 
sputum specimens. Currently, this approach is not practical for 
screening undiagnosed individuals. Future advances in gene' 
chip technology may permit testing for all possible mutations of 
common oncogenes and TSGs in clinical specimens of asymp- 
tomatic individuals (62). • 

Microsatellite markers are small repeating DNA sequences 
found in the noncoding regions of a gene. PCR amplification of 
these repeat sequences provides a rapid method for assessment 
of LOH and facilitates the mapping of suppressor genes (66, 
67). Microsatellite alterations are extension or deletions of these 
repeared elements. Detection of microsatellite alterations in 
histological or cytological specimens may facilitate the detec- 
tion of clonal preneoplastic or neoplastic cell populations. Al- 
though the detection of microsatellite alterations does not indi- . 
cate the specific genetic change in the rumor, detection of clonal 
cell populations might serve as a cancer screening marker (65). 
Identical alterations have been found in lung cancers and cor- 
responding sputum samples demonstrating minimal atypia (68). 
The pJ6 gene is located on the short arm of chromosome 
9(9p2 1 ) and is frequently mutated or inactivated in tumors and 
cell lines derived from lung cancer (69. 70). Belinsky et at. (71) 
measured hypcnnethylaijon of the CpG islands in the sputum of 
lung cancer patients and demonstrated a high correlation with 
early stages of non-smalJ cell lung cancer, which indicated that 
p)6 CpG hyperruethylation could be useful in the prediction of 
future lung cancer. However, prospective studies are needed to 
evaluate the role of p)6 hype rmethy la lion as a marker for early 
lung cancer detection. Multiple other genes are inactivated by 
hypennethylation in lung cancer (72), and the detection of 
hyper niethylaiion may be useful for risk assessment and early 
diagnosis. 

Compitirt -assisted ln»a«e Analvsis. Computer- assisted 
image analysis was initially used to detect malignancy- associ- 
ated changes (e.g., subvisual or nonobviom. changes in the 
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distribution of DNA in the nuclei of histologically normal cells 
in the vicinity of preinvasive or invasive cancer; 73). In a 
retrospective analysis of sputum cytology slides, malignancy- 
associated changes alone correctly identified 74% of the sub- 
jects who later developed squamous cell carcinoma (74). The 
technique has been improved, and recent data snowed sensitiv- 
ities of 75% for stage 0/1 lung cancer and 85% for adenocarci- 
nomas wiih a specificity of 90% (75). This quantitative micros- 
copy technique allows the examining of thousands of cells per 
slide within a relative short time. Similar techniques have been 
approved in the United States for cervical cancer screening,, and 
might, in the future, play a role for lung cancer screening. 
However, no prospective clinical studies has evaluated this 
technology in a larger lung cancer screening setting: 

High Throughput Technology. With future advances in 
^gene chip technology, it might become feasible to probe for 
expression of multiple genes in sputum specimens of asymp- 
. tomatic individuals. However, this requires a large amount of 
undegraded RNA from respiratory tract cells. Wiih the high 
throughput technology, a higher sensitivity might be achieved 
by using multiple markers at the cost of achieving a lower 
specificity, which would be undesirable for a screening study. 

In conclusion, we need to reevaluate the ToJe of sputum 
cytology for screening and early detection of lung cancer be- 
cause of advances in biomarkers and technology. Ongoing stud- 
ies with standard and biomarker analysis in high-risk groups 
might change the previous negative attitude and provide a new 
perspective on spunim cytology as a mass screening tool when 
applied in a high-risk population. Adding different molecular 
diagnostic tests gives the possibility for early diagnosis far in 
advance of clinical presentation. However, validation of the 
tests in larger prospective studies is necessary, and the individ- 
ual tests have to be compared wiih each other lo define the role 
of early diagnosis in the overall management of high-risk sub- 
jects. Furthermore, health economic issues have to be consid- 
ered. 

Specimens for Clinical Testing: BAL 

BAL involves the infusion and rc3spiration of a sieiilc 
saline solution in distal segments of the lung via a fibcioptic 
bronchoscope. Ahrendt et al.. (76) examined a series of 50 
resected nomSCLC tumor patients and compared the rumor and 
BAL with regard to molecular inaikers including p53 mutations, 
K-ras mutation, the metbylation status of the CpG island of the 
pJ6 gene, and microsatelhte alteration (Tables I and 2). With 
the possible exception of the test for microsatellite aJreiatjon. all 
of the tests had relatively high sensitivity and could deled 
mutant cells in the presence of a large excess of normal cells. 
The frequencies of these changes in the rumors ranted from 
27% (for K-ras mutations) to 56% ffoi p53 mutations). As 
expected, p53 mutations were mote frequent. in central (piedom- 
inanilysqnamous-cell) tumors, anil K-ras mutations were more 
frequent in peripheral (predominantly adenocarcinoma) tumor?,. 
The specificity was high (nearly 100%) because, with the ex - 
ception of microsatellite alterations, the same genetic change in 
13 A L sample as in minors was always found, but the sensimiry 
was low, and in only !»3% of tumoi s that contained molecular 
lesions were ihc same abnormalities detected in corresponding 
BAI. fluids. Specifically, the test? were least helpful "in the 



group of patients in whom improved diagnostic abilities are 
most needed, those with small, peripherally.Iocated tumors (77). 
Unfortunately, the investigators were not able to compare the 
molecular tests with routine cytopathological analysis of the 
BAL specimens. The sensitivity of the molecular tests in BAL 
specimens has to be improved, and we need to know the results 
from subjects at increased risk (current and former smokers 
without rung cancer and survivors of previous cancer of the 
upper respiratory tract) and subjects with chronic lung diseases 
as well as results from healthy never smokers. 

A European group has previously, shown that genetic al- 
terations detected in DNA from bronchial lavage of individuals 
with lung cancer Were also found, in individuals with no evi- 
dence of malignant disease (78), which raises the question about 
the specificity of such molecular damage in neoplastic condi- 
tions. To improve the sensitivity and specificity of detecting 
allelic imbalance in lung rumors, high throughput PCR : based 
microsatellite assays have been established (79). In a recent 
study by Fielding et al. (80), the up- regulation of hnRNP A2/B 1 
was found to be a promising marker in BAL for the detection of 
premalignanl and malignant bronchial lesions with a diagnostic 
sensitivity of 96% and a specificity of 82%. 

It is too early yel to make conclusions as to whether BAL 
examinations will add to other pathological/molecular biologi- 
cal clinical studies. To obtain diagnostic material for BAL 
bronchoscopy is required, and we do not have any data that 
compare BAL examinations with biopsies. Thus, we do not 
know whether BAL is a valuable adjunct to the biopsies taken 
under the same bronchoscopy procedure, 

Specimens for Clinical Testing: Peripheral Blood 

For many years scientists have searched for a rung cancer- 
specific tumoi marker that could be detected in peripheral blood. 
Optimism was raised in the "eaily" immunocytochemistry era 
by the use of monoclonal antibodies raised against more-or-less 
specific epithelial epitopes. In the search for epithelial cells in 
peripheral blood and bone marrow, monoclonal antibodies 
against cytokeralin have been used. However, these reactions 
are clearly not cancer-specific, and some antibodies have been 
shown to cross- react with normal blood or bone marrow ele- 
ments (81, 82). Another explanation could'be that celts from the 
macrophage/monocyte system may contain proteins derived 
from the primary tumor' that have undergone .necrosis and 
apopiosis and (hat these processed proteins are recognized by 
the antibodies (82). On the basis of "traditional" immnnocyio- 
chemistry, no markets have been able to detect premalignartt or 
early- malignant disorders based on a peripheral blood sample. 
However, with the development of DNA technologies, new 
possibilities have been raised, and. with the use of PCR tech- 
niques, some promising reports have been published. 

Nanogram quantities of DNA circulating in blood arc pres- 
ent in healthy individuals (83, 84). Tumoi DNA is also released 
into the plasma component in increased quant i lies (S3, So). 
Thus, ihe plasma and serum of cancel patients is enriched in 
DNA, an average four times the amount of free DNA as com- 
pared with normal controls (87). In a study by Chen et a I. (88). 
a comparison of microsatellite alterations in tumor and plasma 
DNA was done in SCI.C patients, and 93% of the patients with 
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Table 1 Tissues and other resources for the study of molecular markers 



Specimen 



Rcf. 



Comments 



Nonmalignani (issue horn patients 
ami from cancer- free tclatives 



Numerous 



65, 68. 71, 74 



Tumor tissue 



Sputum 



Surrogate organ 



Bronchial brush/wash 



Bronchial tissues 



BAL fluids 



Blood components 



Tissue for molecular staging 



Tumor cell lines 



Cultures of nonmalignani tissues li<t. 155 



MI, 142, 143 



42, 43, 45, 144, 145 



76, 78, 146, 147, 148 



72, 92, 149 



156, 157, 15$ 



Mixture of ce)J types, may require microdissection (139). Extensively used 
for most. studies. Alcohol- fixed fine- needle aspirates may be used for 
mutational and other studies. 

Respiratory celb usually in small minority. Most samples fixed in. 
.'Saccoroanno's fixative. Several studies have been performed on these 
specimens many years later. 

Predominantly squamous epithelial cells. Buccal smears; brushmgs of 
tongue or tonsil may be explored as potential surrogate organs resulting 
from the field effect of tobacco damage of the entire upper ., 
aerodigesrive tract. This concept needs to be confirmed. 

Predominantly respiratory cells. Fresh, frozen, or alcohol-fixed samples are 
suitable for multiple studies including FISH." 

Usually from bronchial biopsies, but may be obtained from surgical 
resection specimens. Formalin fixation and paraffin embedding required 
for histological diagnosis, although EASI preps may permit 
identification and isolation of subpopulations. Paraffin sections may be 
used for genotyping polymorphisms,' for allelotyping, and for in situ 
hybridization. 

BAL fluids are useful for examining the peripheral airway cells, which are 
the precursor cells of most adenocarcinomas. Numerous mononuclear 
cells present. Enrichment of epithelial cells desirable. 

Analysis of circulating tumor cells and genetic material shed by dying 
tumor cells into the plasma component may yield useful biological and 
diagnostic information. Gene mutations and presence of epithelial cell 
markers have been used to detect circulating tumor cells. Gene 
mutations, allelic loss, microsaiellile alterations, and aberrant 
metbylation have been used to identify tumor cell DNA released into 
the fluid compartment. 

Although little data exists for lung cancers, regional lymph nodes, sentinel . 
lymph nodes, and surgical resection margins have been used in other 
rumor types for molecular staging. 

Provide an unlimited self-replicating source of high-quality molecular 
reagents and have been used for numerous studies. Cell lines may or 
may not reflect the properties of (he tumors from which they were 
derived (26). although tbey probably represent cellular subpopulations 
(27). Aggressive metastatic tumors are more likely to be successfully 
cultured (28) resulting in skewed daia.' 

Epithelial cultures may be useful for studying molecular changes during 
multistage pathogenesis. Temporary as well as a few immortalized 
cultures from nonmalignani epithelial cells have been established. 
B- lymphoblastoma cultuies are useful for linkage analysis, for genetic 
suspeciibiliry studies, and for allelotyping corresponding tumots. 

Tissues such as buccal smears, tumor- free lymph nodes, and peripheral 
blood mononuclear cells arc useful as controls for linkage analysis, for 
genetic susceptibility studies, and for alleloryping corresponding tumors. 



"FISH, fluorescence m situ hybridization; EASI,' epithelial aggregate separation and isolation. 



mictosatelJile alterations in tumor DNA also had modifications 
in the plasma DNA. However, some patients had I OH only in 
the rumor DNA. Because most of the microsatcllite alterations 
were similar in rumor DNA and plasma DNA. they concluded 
thai some of the DNA circulating in ihc blood comes from the 
rumor. Thus, modifications of circulating DNA can be used as 
an early detection marker. Detection of a ben ant DNA me thy I- 
stion in serum DNA in patients with non-SCl.C lias been 
reported (72). All hough the number of pa tit* ms was small and 
the hypermeilryiated DNA was found in all stripes, it opens up 
for ihc possibility to be used as an eaily lung cancer detection 
marker. Furthermore. p}3 and ras pene mutations have been 



detected in the plasma and serum of patients with colorectal 
cancers (89-91). pancreatic carcinomas (92. 93). and hemato- 
logical malignancies (94). 

In. conclusion, the limited direct accessibility of lung car- 
cinomas has led to efforts to identify rumor- associated soluble 
markers in serum oi plasma. Many of the currently recognized 
soluble markers were fust identified as "lumor markers but r 
when evaluated in nonneoplastic tissue, have often been found 
in normal t ells as well as in tumors. Hoi early detection of lung 
cancer, we need mote clinical data evaluating these new molec- 
ular biological markers from multiple sites, especially in high- 
risk croups. 
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Table 2 Molecular approaches for lung cancer investigation 



Specimen 



Ref. 



Comment 



Gene, mutations 



Allelotyping 



Gene expression al RNA 
and protein level 



159, 160, 161 



18, 158 



145, 162, 163, 164, 165, 166 



Molecular cytogenetics 



Comparative genomic 
hybridization 

MoTphometric studies 



40, 167, 163. 169. 170 

171, 172 
74, 173; 174 



Widely used technique, especially for p5 3 and ras genes: Often used 
to determine the r61e of a newly discovered gene in the 
pathogenesis of tung cancer. May be of diagnostic and prognostic 
significance. Multiple methodologies available. ... 

Useful as a partial substitute for mutational analysis and for , 
determining the chromosomal locations of putative tumor 
suppressor" genes. Widely used lo study multistage pathogenesis. 
Readily performed on formalin-fixed.and microdissecled tissues. 
Increasing use of gcnolyping using automatic sequencers. 

Northern blotting and reverse transcripiion-PCR are widely used to 
investigate gene expression. Western blotting often used for. 
detection of protein expression. M situ hybridization for message 
expression can be performed on paraffin-embedded tissues and, 
thus, can be used to investigate multistage pathogenesis. * ■ 
Microarray techniques offer promise of examining all or most of 
the genome but currently require relatively large amount of high-, 
quality RNA from purified cell populations Sage technique useful 
for investigation and identification 'of expressed genes. Similarly, 
advances m proteomics will permit simultaneous detection of 
multiple proteins. Numerous immunohistochemical studies of 
. oncogene expression have been used to study multistage 
pathogenesis. Of particular interest, bnRNP expression on 
exfoliated epithelial cells in sputum samples may predict for 
development of cancer. 

In situ hybridization studies of fixed materials or using smears has 
provided considerable information about numerical and structural 
changes. 

Useful for detection of gene amplifications. Less sensitive for the 
detection of regions of allelic loss. 

May be applied to paraffin-embedded tissues. Useful for determining 
aneuploidy and for measuring a number of nuclear and 
cytoplasmic parameters. 



Specimens for Clinical Testing: Bronchoscopy 

WLB is the most commonly used diagnostic tool for ob- 
taining a definite histological diagnosis of lung cancer. Bron- 
choscopy has major dingnpstic limitations for premalignant le- 
sions. Because these lesions are only a few cells thick (0.2- J 
mm) and have a surface diameter of only a few millimeters, they 
rarely are observed as visual abnormalities. Woolner (95) re- 
ported that squamous cell CIS was visible to experienced bron- 
choscopes is in only 29% of cases. To address this limitation, 
fluorescence bronchoscopy was developed. Early studies of 
fluorescence bronchoscopy entailed the use of fluorescent drugs 
(hemaloporphyrin dyes) that were preferentially retained in ma- 
lignant tissue (96). Although, studies evaluating this approach 
did. in fact, show that eailv invasive ;md in situ cancers could be 
localized, the detection of dysplasia remained problematic (97- 
100). Furthermore, the development of photodynnmic diagnos- 
tic systems was hampered by problems including skin photo- 
sensitizing and interference with tissue autofiuorescence. To 
overcome these problems, a new laser photodynamic diagnostic 
system was developed (101). This system detected rumor- 
specific drug II note secure at 630 nm wavelength, which is far 
from normal tissue antofiuoiescence (500-5SO nm). and inter- 
ference by antoiluorcscence horn normal tissue should then 
have been eliminated, but it remained a significant problem 
(102). 



Another approach was developed by Palcic et oi (103), 
who noticed the lack of autofluorescence in the tumor lesions by 
using blue light (442 nm) rather than white light to illuminate 
the bronchial surface. They amplified the difference in antbftuo- 
rescence between normal, piemalignant, and tumor tissue for 
clinical use (I03 r 104). Using a high-quality-charge coupled 
device and special algorithm, the LIFE was developed, taking 
advantage of the principle that dysplastic and malignant tissues 
reduce 3uiofluoTCSceni signals compared with normal tissue 
(Fig- 3). 

Several studies have been performed comparing the diag- 
nostic specificity and sensitivity of LIFE bronchoscopy versus 
WJLB in diagnosing preinvasive and early-invasive lesions 
(105-109; Table 3). Most of the studies reported a higher 
diagnostic sensitivity of LIFE bronchoscopy in the detection 
pjemalignant and caily- malignant lesions at the cost of tower 
specificity {i.e.. more false-positive results). In most of these 
studies, lesions with moderate dysplasia or worse were the target 
of the study and rated as "positive.'' The prevalence of prein- 
vasive and early lung cancer varies widely from one study to 
another, from 20.2% (105) to 65.8% (102). The e*plan3tlon 
might be beyond the risk profile of genetic variations or differ- 
ent levels of experience among the endoscopists as well as the 
pathologists involved. Furthermore, there seem:, to be a naming 
cflecr in using ihe LIFE bronchoscope, which has been demon 
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straied by Venmans a oi ( J 07). In their study, the diagnostic 
sensitivity increased from 67 to 80% when comparing the first 
and the second half of the study, the use of the LIFE device in 
conjunction wnh WLB improved the detection i ate of preneo- 
plastic lesions and OS significantly ( fable 3). Kurie ei al. (106) 
looked for more subtle tissue transformation, but ibeir studv 
included few patients ' wjih moderate dysplasia or worse. No 
improvement in the evaluation of metaplasia rnde* was ob- 
served by the use of LIFE bronchoscopy. Thus, differences in 
the study population might explain the diffeicnt conclusion. 
There aie still no clinical studies with sufficient long-term data 
showing iliat moderate dysplasia is the most relevant clinical 
predictot of eventual ma lie nancy. Limitations in making con- 
clusions fiom (be twisting studies rue also the potential meth- 
odological bias r elated to I be order in which ihc diffeient bron- 
choscopy procedures are done and whether the same examiner 
has per formed both proredures. To address these issues, a 



prospective randomized study between LIFE bronchoscopy and 
WLB was done al the University of Colorado Cancer Center. 
The study design 1 included a randomization with regard to the 
order of procedure as well as the order of the individual biorr- 
choscopist (10°). The order of the procedure and of the indi- 
vidual bronehoscopist did not affect the results. The study also 
demons 1 1 a ted a significantly higher sensitivity in detecting pie- 
malignant lesions visualized by the LIFE, but at the cost of a 
lower specificity (10°). The reason for the tow diagnostic spec- 
ificity found with the LIFE bronchoscopy in ibe different studies 
might be attributable to the visualization of more abnormal foci 
with the LIFE bronchoscope, with the consequence that a larger 
mini be i oJ biopsies were taken and. thus, there was a hrgber risk 
of more false- positive results. The use of LIFE bronchoscopy 
has led to the identification of a new morphological entity, ihe 
ASD. which is described above. In a lecem morphological studv 
rsngtodysplastir changes* we re frequently found m preneoplastic 
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Fig. 4 Seventy- one- year- old man .wilh a spicular nodule in upper left 
lobe demonstrated on low- dose helical CT (picture), but not visible on 
chest X- radiography. CT- guided biopsy showed adenocarcinoma. 



and early-malignant lesions in the bronchi (26). The morpho- 
logical entity has been confirmed in preneoplasias among smok- 
ers, and the perspectives of this finding have been extensively 
discussed (110). The prognostic significance of this morpholog- 
ical entity is currently studied in ongoing long-term follow-up 
studies. Future studies have to evaluate the role of ASD as a 
biomarker for early lesions and whether it can be used as a 
marker for treatment effect or therapeutic target for chemopre- 
vention. 

The LIFE bronchoscope may play an important role in the 
screening and follow-up of subjects at high risk of developing 
lung cancer. At this stage, however, it is unknown whether the 
LIFE bronchoscope will lead to a reduction in lung cancer 
mortalily. There are also no data on cost-effectiveness and 
cost-benefit analyses available for this new diagnostic proce- 
dure. The use of the LIFE bronchoscope may also in the future 
be extended to other indications, e g., patients staged as having 
resectable lung cancer on one side. Whether LIFE bronchoscopy 
of ihe contralateral lung will disclose abnormalities, which 
would change the therapeutic decision, is not yet reported.. 

Recent At) van res in Radiology 

The previous NCI- sponsored screening trials failed to dem- 
onstrate any reduction in the lung cancer mortality by sputum 
cytology and yearly chest radiography as mass screening tools 
for lung cancer scicening. Limitations of design and execution 
of the studies, however, have been discussed extensively (3. 
Ill, 112). An extended follow-up (median. 20.5 'years) of the 
Mayo Lung Project was recently published (113). There was 
still no diffeience in lung cancer mortality between the inter- 
vention arm and the control arm (4.-1 it;..si/j 3.9 deaths per 1000 
person-years), flowevei. the median survival for patients with 
resected early- stage disease was 16.0 yea is in the intervention 
arm versus 5.0 years in the usual- care :inn [P < U.0>). The la net 
findings have raised the question as K> whether some small 
lesions wiih limited clinical relevance may have been identified 
in the intervention arm. and the question of "overdiagnosis" was 
discussed in accompanying editorials MM) 



Mass screening for lung cancer has been performed in 
Japan for many years and has been performed in over 500,000 
. people in about 80% of the local communities (1)5). Sobue et 
ah (J 16) observed that annua! clinic-based chest X-ray screen- 
ing for lung cancer in Japan showed reduced lung cancer mor- 
tality by about one-fourth among individuals who. underwent 
screening once a year. In this screening program, the relative 
odds ratio of dying from lung cancer within 12 months was 
0.535 and in the 12-24-month period was 0:638 (117). How- 
ever, many studies have focused on the pitfalls in the detech'on 
of abnormalities by radiography (118-122). The limit of chest 
radiographic sensitivity for nbduJe detection is roughly 1 cm in 
diameter, by which time the tumor has over JO 9 cells and may 
already have violated bronchial epithelium and vascular epithe- 
lium. CT has been shown, to be more effective in the detection 
of peripheral hmg lesions compared wilh plain radiography or 
conventional tomography of the whole lung (123, 124). 

Spiral CT scan is a relatively new technology with the 
. ability to continuously acquire data resulting in a shorter scan- 
ning time, a lower radiation exposure, and improved diagnostic 
accuracy compared with those of plain radiography (125-127). 
Spiral CT allows the whole chest to be imaged in one or two 
breath- holds, reducing motion artifacts and eliminating i expira- 
tory misregistration or missing nodu J es. Although there is 
greater radiation exposure wilh CT than with chest radiography, 
low-dose techniques (lower mA of 30-50 compared with 200 
for conventional CT) have achieved calculated exposure doses 
that are 17% that Of conventional CT and 10 times that of chest 
radiographs. Further reduction in radiation dose while maintain- 
ing diagnostic accuracy is a topic of current research. Further- 
more, for the baseline screening, low- dose spiral- CT- scan i.v. 
contrast is not administered. Nodules as small as J-5 mm can be 
shown with modern spiral CT technology (25, 128). The obvi- 
ous advantages with this new technology led some groups in 
Japan and in the United States to look to Jow-dose spiral CT as 
a tool for screening (Refs. 129-131; Tables 4 and 5). 

In a Japanese report, spiral CF scans and chest radiographs 
were done twice a year in 1369 individuals (J 29). Peripheral 
lung cancer was detected in 15 (0.3%) of 3*157 examinations, 
and, among the 15 Jung cancer cases detected, the results of 
chest X-ray were negative in J J (73%), and the tumors were 
detected only by low-dose spiral CT. The detection rates of 
low-dose spiral CT and chest X-ray were 0.43% (15 of 3457 
examinations) and 0.12% (4 of 3457 examinations), respec- 
tively. Furthermore, 14 (93%) of the J 5 lung cancers were stage 
I disease. The histology showed that II of the 15 lung cancer 
cases were adenocarcinoma, and 4 had squamous cell carci- 
noma. The effective exposure dose with spiral CT scan in that 
study was calculated to about one- sixth that of conventional Cf. 

The ELCAP in New York was designed to determine: in) 
the frequency with which nodules weie detected: (/>) the fre- 
quency with which detected nodules represent malignant dis- 
ease: and fr) the frequency with which malignant nodules are 
curable | 131). In the ELCAP smdy. 27 lung cancers were found* 
among 1000 subjects screened. Among the 27 patients with 
cancer. $5% had stage 1 disease (Table 5). 

Another population- bared study on low- dose CT screening 
has been published by Sonc cr ni. (130). using a mobile low- 
dose spied CT scanner. The detection rale was 0.-18% {i.e., 4-5 
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Sensitivity' 




Specificity 






' Predictive values 












Relative 




Relative 


PPV° 


NPV 








No. of 


LIFE+ 




sensitivity 


LIFE+ 


specificity 


LIFE+ 


LJFE+ 


PPV NPV PPV 


NPV 


Author 


biopsies 


WLB 


LIFE WLB LIFE+WLB 


WLB LIFE WLB LIFE+WLB 


WLB 


WLB 


LIFE LIFE WLB WLB 


Lam etal (105) 


700 


0.67 


NR 0.25 


63(2.77 . 


0.66 NR 0.90 


NR 


0.33 


0.89 


NR NR 0.39 


0.83 


Kurie et al b (106) 


234 


NR 


0.38 NR 


NR 


NR 036 NR 


NR 


NR 


NR 


0.16 0.81 NR 


NR 


Venmans etal (107) 


139 


NR 


0.89 0J8 


L43 


NR 0.61 0.88 


NR 


0.20 


NR 


0.14 0.99 0.32 


0.98 


Vermulen etal (108) 


172 


0.93 


NR 0.25 


3.75 


0.21 NR 0.87 


NR 


0.13 


0.96 


NR NR 019 


0.90 


Kennedy et ai (109) . 


394 


0.79 


0.72 0.18 


4.4 


0.3 0.43 0.78 


0.38 


0.21 


0.85 


0.25 0.87 .0.17 


0.80 



* PPV, positive predictive value; NPV, negative predictive value; NR, not reported. 
b Based on reference pathologist. 

* If invasive caicinoma is inchided. 



Table 4 Results from three population-based screening studies with low-dose spiral CT (LDCT) 













Detection rate % 




Age incl. . 




No. of individuals 


True 


Fake 


Predictive 








Authors 


studied 


positive n 


positive" % 


value % 


LDCT 


. X-ray 


Pack-yr 


yr 


. Kaneko « o/. (129) 


1369 . 


,15 


15.6 


6.6 


0.43 


0.12 


>20 


>50 


Sonc et ai (130) 


• 3967 


19 


5.0 


8.8 


0.46-0.5 




>30* 


40-74 


Henschke etal (131) 


1000 


27 


20.J 


J 1.6 


: 2.7 


0.70 


>W 


>60 



" Defined as individuals wilh "test-positive,'* in whom further workup gave no suspicion of malignancy. 
* The study also included a group of nonsmokers. 
** Average = 45 (not reported in the other studies). 



Table $ Histology, stage, and size of primary lung cancer detected by low- dose spiral CT 





No. of cancers/ 
No, screened 


Histology % 






TNM % 






Size (mm) 




Author 


Adeno" Squam. 


Other 




II III 


IV 


Average 


Range 


<I0 


11-20 >2I 


Kaneko.ff ai (12?) 
Sone et at. (130) 
Hcnschke et ui (131) 


15/1369 (1.1%) 
19/5483 (0.3%) 
27/1000(2.7%) 


73 ' 17. 
63 5 
67 . 3 


32 
30 


93 
84 
85 


7 

4 n 


16 


12 
17 


8-18 
6-47 


4 
15 


M 3 
8 4 



* Adeno, adenocarcinoma; Squam.. squomous cell carcinoma; TNM, rumor-node-nielastasis. 



cases per 1000 examinations). Surprisingly, there was no dif- . 
ference in the detection rale among smokers (0.52%) versus 
nonsmokers (0.46%). The results from the three population- 
based studies are summarized in Tables 4 and 5. The conclusion 
from these studies is thai 85% of the. lung cancers defected by 
low -dose Ct were in .si age I, offering improved possibility for 
curative treatment and better prognosis in general. However, the 
issue of "false-positive" scans has to be taken into consideration. 
Thus far, up to 20% of the participants with nodules on the scan 
had no malignancy during the follow-up period. T,be possibility 
that the cancers found represent incidental cancers as in the 
Mayo Lung Project must also be considered (1 14). The results 
from these studies confirm the expectation that low- dose CT 
increases the detection of small noncalcified nodules and f that 
lung cancer at an earlier and more curable stage are detected. 
The mobile CT screening srudy by Sone et nl. (130) showed that 
low -dose CT increased ihc likelihood of detection of malignant 
disease 10 times :•■$ compared with radical aphy. The overall rate 
of malignant disease was lower in the Japanese studies (129. 
130) compared with rhe HLCAP siudv tkrf J 3 J : detection rates 
0.43-0.48% \rrsus 2.7%). T his could be because the Japanese 
studies screened individuals from the general population ages 



40-74, whereas ELCAP screened people at high risk, ages 5 60, 
with a tobacco history of at least 10 pack- years. Thus, as 
expected, the risk of the population to be screened affects the 
rate of cancer detection. 

Questions remaining to be answered include: (a) what are 
the diagnostic sensitivity and specificity of this procedure: and 
(b) does screening reduce lung cancer mortality? The spiral CT 
has not been as sensitive for small central cancers as it is for 
small peripheral cancers (129- 131)1 Minute nodules of lung 
cancer that are near the threshold of. detectabiliry may be over- 
looked at spiral CT screening f 132). A prospective study of the 
diagnostic sensitivity of spiral CT has recently shown that the 
diagnostic sensitivity exceeded the sensitivity- of conventional 
CT in previous reports (25). However, there were limitations in 
the detection of intrapnlnionary nodules smaller thai? 6 mm and 
of pleural lesions. Compared with surgery (thoracotomy . with 
palpation of deflated lung, resccrion. and histology), the sensi- 
ii vi ry of spiral CT was 60% lor intrapulrnonary nodules ol <b 
mm and 95% lor nodules of >6 mm and was 100% for neo- 
plastic lesions ^6 mm. Furthermore, a marked difference in ihe 
sensitivities of lwr> independent observers was found for nod- 
ules smaller than 6 mm, whereas agreement was much better for 
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6^I0-mm nodules (25). Given these promising preliminary clin- 
ical results, further research is needed to determine the optimal 
technique for spiral CT screening, which includes colli mat ion, 
reconstruction interval, pilch, and viewing methods. Decreasing 
the slice thickness to 3 mm, monitoring the viewing of exami- 
nations, and computer-aided diagnosis have been used to im- 
prove the diagnostic capability of spiral CT in the detection of 
pulmonary nodules (J33-I36). 

Future large scale randomized studies have to confirm 
whether in fact spiral CT screening will lead to a reduction in 
hrog cancer mortality. In a randomized study, the following 
questions arise: (a) what is the optimal high-risk group to study 
and what should be the control arm? (b) what should be the end 
points (goals) of the studies? The ultimate goal is to reduce the 
hmg cancer mortality. However, although this is a long- term 
goal, intermediate end points from such studies should be eval- 
. uated. The change to more curable stages at diagnosis for the 
Jung cancer patients is one such immediate goal; (c) what is the 
optimal workup and the morbidity of this program? (d) what is 
the cost, of such a screening program? and {e) what is the 
false-positive rate of the screening findings? Incorporation of 
smoking cessation programs should be included in the future 
design of screening studies because it has been shown that 
screening with low- dose CT in participants who are still smok- 
ing provides substantial motivation for smoking cessation (137). 

The studies with spiral CT-scan have demonstrated the 
superior diagnostic ability in the detection of small peripherally 
located rumors, most of the malignant ones of adenocarcinoma 
type of histology, the diagnostic sensitivity of spiral CT for 
more centrally located rumors (mostly squamous cell carci- 
noma) is significantly lower than for the peripherally located 
ones. Through these spiral CT studies^ we will leam about the 
biology, pathology, and clinical course of these small tumors, 
which might be different from what we know about clinically 
more evident rumors detected routinely in previous studies. 

Because lung cancer is so common, the introduction of any 
new screening technique in this area has to be underpinned by 
careful definition of ihe cost implications and must be justified 
by compelling evidence. The cost-effectiveness of the spiral 
CT approach should be assessed by evaluating the rate of 
over- diagnosing nonmalignnnt. relatively common abnormali- 
ties and comparing CT imagine to other diagnostic technologies. 

PET with FOG has recently emerged 3S a practical and 
useful imaging modality in the preoperative staging of patients 
with lung cancer. However, whereas CT is most frequently used 
to provide additional anatomical and morphological information 
about lesions, the FDG PET imaging provides physiological and 
metabolic information thai characterizes lesions that ate inde- 
terminate by CT. fOG PET imaging takes advantage of the 
increased accumulation of fDG in transformed cells and is 
sensitive f~95%) for the detection of cancer in patients who 
have indeterminate lesions on CT 1 138). The specificity (- 85%) 
of PET imaging is slightly less than its sensitivity because some 
.inflammatory processes avidly accumulate FDG. The high neg- 
ative predictive v;»Jue of PET suggests 'hat lesions considered 
negative on the srmlv are benign, biopsy is not needed, and 
radiographic follow.'- up is recommended Several studies have 
documented the increased accuracy of PET compared with CT 
in the evaluation of the hilar and mediastinal lymph node s tarns 



in patients with lung cancer (138). However, the PET resolution 
is sufficient only for nodules ^6 cm and will not be helpful in 
detecting the very small nodules. Compared with low-dose 
spiral CT, the FDG PET scan is more expensive and time 
consuming- The role of PET scan in early diagnosis of lung 
cancer in an asymptomatic high-risk population is not yet eval- 
uated However, future studies have to include PET evaluation 
to define its role in a population screening setting. 

Conclusion 

Recent advances in molecular biology and pathology have 
led to a better understanding and documentation of morpholog- 
ical changes in the bronchial epithelium before development of 
clinical evident lung carcinomas. Combined with technical de- 
velopments in radiologic a) and bronchoscope techniques, these 
procedures offer great promise in diagnosing lung cancer, far in 
advance of clinical presentation. Any of these individual proce- 
dures could be incorporated into the routine management of 
individuals at risk for developing primary or secondary lung 
cancer,, and for several of these methods, clinical studies are 
under way. Preliminary reported data are very promising for the 
early detection of lung cancer. Future studies must incorporate 
the different methods in a multidisciplinary scientific setting to 
evaluate the role of the individual method in the Overall man- 
agement for individuals at high risk for developing lung cancer. 
Several of these tests might diagnose the disease at the stage of 
clonal expansion before invasive carcinoma has developed. A 
management and intervention strategy appropriate to that stage 
of disease have to be developed. Preliminary studies of chemo- 
prcvention agents are reported, and new agents based on other 
biological mechanisms are under development and ready for 
clinical trials. It is now time to plan clinical trials that evaluate 
both diagnostic and therapeutic approaches to access their im- 
pact on the incidence of clinical lung cancer. 
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Localization of tissue inhibitor of metalloproteinases 1 (TTMIM) iu human 
colorectal adenoma and adenocarcinoma. 
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Tissue inhibitor of matrix metalloproteases 1 (TIMP-1) inhibits the proteolytic activity of 

not l° P f 63568 ^ hCreby P fCVents °aneer invasion. However T £ M alio 
possesses other functions such as inhibition of apdptosis, induction of ma Zn 

ssr^sss? ° f c t we ^ i^-rfiSS that 

nmL \!L I fr °' n co,orec ' al cancer patients and that high TIMP-1 levels 

pred.ct poor prognose To clarify the role of TIMP-I in colorectal tumfrigenesis the 

an , or ia cases of colorectal adenocarcinoma TIMP-1 mRNA was detected bv in *ih. 

was3h? nt eu ^ SCen onl y ^ically in normal mucosa. No TIMP 1 nX 
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Tissue inhibitor of malm metalloproteases 1 (HMP-1) inhibits 
tteproteoJytk activity of matmWbnoprotUrjd^ 
Parents cancer invasion. However, TIMP-1 also possesses otheJ 



onsfr | demonstrated that TIMP-l is elevated tob^fr^n^ 
rectal cancer patients and that high T1MP-J Jeye^DredW ™£ 
prognosK To clarify the role ot A-T^ 00!^^^ 
^^e mress.on pattern of TIMP-1 in benign and^SS 
colorectal tumorswas studied. In all of 24 cases of colorectal 
pdenocarpnoma TIMP-1 mRNA was detected by instohS 

cells located at the invasive front but was seen only sporadically in 
normal mucosa. No TIMP-1 mRNA was seen in any^SSS 
£Z£ £1 ma ' , 6» aD «.*P i *?>» c«)k, in vascular cells £ s^SS 
muscle ceMs. Comparison of sections processed for TIMP-1 write 
^{^ at " >n ^ S ^! i0ns i»™»»onfarochennca^lSneJ Twffc 
.TIMP-1 showed good common between 
It^JST^ and inimnnoreacUvity. Combining. TTMP-1 />. situ 
hybridizalion w,lh umnunohislocbenikal sta3 for a-srooott 

ETitat* C ^ Ssh0 ^ UMP-lniRNA inmyofioSb 
but not m macrophages. TIMP-1 mRNA was detected in 2 of 7 
adenomatous polyps in the adenoma area: in both cas£ ^ociated 

Don wtn invading colon cancer cells. . * W>VUJ 



S;my1S 

J*J*'T? 1 ? fW Cancer ce)1 invasi0D metastasis is the 

sucfltZ ?T ? - ta £ eB media,ed b * F^MySc «W 
such as the matrix metalloproteinases (MMP)." Under normal 
phy^ologzc conditions, the tissue degrading activiti^ cH™ 

S n r,* eP ' by of ^ naturally occurring 

inhibitors: tissue inhibitors of metalloproteinases (TIMP). T1MP f 
a 2X kDa glycoprotein demonstrated to be present in most bodilv 

manner." Overexpression of TIMP-1 in various cancer models 
has shown a suppressive role in the malignant progression ?W 
ever, as opposed .0 this anti-invasive role of TO4P-1 seveTal 
recem studies have demonstrated quite different functions of ^ 
MMP-,nh>bitor mcludmg stimulation of cell growth raalLant 
Wormatioi, and inhibition' of apoptosis, suggest a Pos^We 

clay a dual ^ * * ecu, «« , ** ™P-1 may actually 
play a dual role in cancer progression and metastasis » 

n^M.„?V^^ ? pTO,em are S] Snincamly increased in various 
malignan, leases and that such MMP elevations are correlated 

I^v InVaS ' OD ' me ' aS,asis and *"«« P atie "< survival ■' . 3 

h^ddiuon many reports have described similar overexpression of 
JlMP-1 mRNA and protein m several cancer types '«-*> More- 
over, we and others have demonstrated that moment of £ 
creased plasma levels of TIMP-1 by immunoassay serais as a 
sUong maike, or shon survival and .ecunence of rtsele in 
pafents with coloieca) cancer SimjIaiIv _ , ^ ™ 



between ^gh protein levels and poor prognosis is known for the 
rype-1 plasminogen activator inhibitor (PAI-1)>«J Qmsiderin. 
the pro tease inhibiting fu nction of these inhibitors, these finding 
it^ujMlroVersial; htTWEve*. alternative functions have been 

In order to better onderstand the role of TJMP-1 m colwectal 
cancer, histochemical analyses may provide some indications: A 
number of studies of the localization of TIMP-1 in colorectal 
cancer have-been published; however, the results of these S 

T^T^r ,nU8ct0,y ' N f r" and c °^S"e^«pc*ed*at 
miP-1 mRNA was expressed both in invasive aYienc^norna, 
carcinoma w , m* and adenoma and that the expression was T 
served m both the stromal as well as the epithelial compartment bf 
*a. S f d SJ/ B C ° DtraSt - ^ «> ^eagnes«5o lepor J 
* a '™ p -. 1 b*KA was expressed only in the stromal compait- 
ment of colorectal adenocarcinomas in spindle-shaped cells sur- 
rounding the mvasive cancer cells. The results of irnmunohisto- 
. chemical studies of TIMP-1 in colon are also confbeting; Hewi^ 
and colleagues" reported that . TIMP-1 was expressed in the cot- 
necuve tissue and basement membrane in both normal mucosa, 
adenomas and adenocarcinomas with only little stainine of the 
neoplastic epiftelium. On the other hand, Tomita and coneanues^ 
reported that TJMP-1. was expressed in both stromal and epithelial 
cells in colonic polyps and adenomas; as well as in adenocarcino- 
mas, ,n which the neoplastic cells were strongly hnmunoreactive. 

hi order to resolve these inconsistencies, we undertook our study 
^ H ,1",™%*^*°* and iriimMohistochemistry demon- 
strated that TIMP-] is expressed in myofibroblasts in the stroma at 
the invasive front of colorectal adenocarcinomas. Because TIMP-1 
was virtually absent from normal colorectal epithelium, we eval- 
uated the possibility of using TIMP-1 as a diagnostic tool to 
differentiate colorectal adenomas from Dukes' stage A colorectal 
adenocarcinomas. . 0 



Material and methods * 

Tissue samples 

■ "V J l iS . S , Ue raa?eria ' 'Deluded was obtained from University Hos- 
pital of Hvidovre (Copenhagen, Denmark) in accordance with a 

mSSSS ?*" by )ocal scientific ethica ' =ommit.ee (KF 
k 1 ,? Fo 1 u^ee, ? aichival samples (formalin fixed and paraffin 
embedded) colleaed from 1989 .0 1993 included Dukes' stage A 
colorectal adenocarcinomas (n = g) and colorectal adenomatous 
POOPS I" - 6, j were pedunculated (1 with mild and 2 with 
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moderate dysplasia) and 3 were sessile (1 with moderate and 2 
with focally severe dysplasia)]. Samples from 16 colorectal ade- 
nocarcinomas (1 Dukes* stage A, 6 Pukes' stage B, 8 Dukes*, stage 
C and I Dukes* stage D), 1 villous adenoma and 1 malignant colon 
lymphoma were prospectively collected during 1999-2000. These 
prospectively collected tissue specimens were dissected so that 
samples contained both normal mucosa and tumor tissue and were 
obtained within 30 min following surgical bowel resection. The 
specimens were immediately fixed in 4% neutral buffered formalin 
for 20-24 hr and then paraffin embedded. The 14 archival samples 
had also been formalin fixed and paraffin embedded. 

Generation of rionoverlapping T1MP-1 cDNA fragments by PCJR. 

The full length TTMP-1 cDNA (GenBank NM_003254) cloned 
in pSP64 vector 29 was used as template to generate 2 nonoverlap- 
ping PGR fragments for in vitro transcription, and named f 104 (bp 
56-378) and f!06 (bp 398-680). First, the whole insert (-780 bp) 
was on out by digestion with Hind 111 and tfamtti ana punned 
after agarose gel electrophoresis using the Qiaex II gel extraction 
kit (Qjagen, Crawley, prated Kingdom). To generate nonoverlap- 
ping antisense probes and the corresponding sense probe, 2 PCR 
fragments were generated using, upstream primers flanked by a 
linker sequence containing an EcoKL restriction enzyme site (un- 
derlined nucleotides) and a T3 polymerase binding sequence 
(boldface) 5 gagaatt cartaacccicactaaagEgaga)-3 ' , and down- 
stream primers flanked by a linker sequence containing a BamBl 
restriction enzyme site and a T7 polymerase binding sequence 
S'- fogatcc taatocgactcactatagggagJ-j 1 9. The TTMP-1 specific up- 
. stream primers were 5'- acccacxatggccccctttg -3' for f 104 and 5'- 
(linker) r gcaggatggacteugcaca -3' for f!06, and the downstream 
primers were 5'.- linker- actcctcgctgcggttgtgg -3' for f!04 and 5' - 
(linker) - tatctgggaccgcagggact -3; for fl 06. PCR using the 2 f!04 
primers or the 2 f 106 primers was done as previously described. 30 

The PCR products were purified by column chromatography 
using S-200HR microspin columns (Amersham Pharmacia Bio- 
tech, Inc., Piscatway, NJ), and their size tested by agarose gel 
electrophoresis. Both migrated as —300 bp fragments in accor- 
dance with the predicted size (322 and 282 bp, respectively). An 
ABI. PRISM 310 genetic analyzer was employed for DNA se- 
quencing analysis and was performed according to the" manufac- 
turer's instructions (Perkin Elmer, Applied Biosystems, Foster. 
City, CA) using the primers specified above. The DNA sequences 
obtained were confirmed by comparison with the specific TTMP-1 
cDNA nucleic acid sequence (GenBank NMJD03254). 

Plasmids containing human MMP-2 cDNA (pCol7201, bp 647- 
1284) and human MMP-9 cDNA (pCol9202, bp 1751-2326) have 
been described elsewhere. 31 

lh vitro transcription 

Antisense and sense riboprqbes were labeled with j5 S UTP 
(NEN, Boston, MA) by in vitro transcripiion using T7 and T3 
RNA polymerases (Roche, Basel, Switzerland). The DNA tem- 
plate was digested with DNase (Promega, Madison, W3). Nonin- 
corporated S:> S UTP and DNA was removed by column chroma- 
tography using S-200HR microspin columns (Amersham 
Pharmacia Bioiech, Inc., Piscatway, NJ). The 35 S activity was 
adjusted for every probe by dilution to 500,000 cpm/uJ. 

In siiu hybridization 

In situ hybridization was performed essentially as described 
previously. 32 hi brief, 3 u.m paraffin sections were deparaf finized 
in xylene^, hydrated with graded ethanol and boiled in a microwave 
oven for 10-12 min in 10 mM citrate buffer, pH 6.0. After 
additional 20 min at loom temperature, the sections were dehy- 
drated with graded eihanol and the 55 S labeled probes (2X JO 6 cpm 
in 20 ixl hybridization mixture 5 1 per slide) incubated overnight at 
55°C in a humidified chamber. Sections were washed in Ilellen- 
dahl chambers with SSC buffers containing 0.1% SDS and 10 mM 
DTr at 150 rpm at 55°C using a Buhler incubation shaker (Jo- 
hanna Otto GmbH, Hechingen. Germany) for 10 min in 2XSSC, 



10 min in 0.5XSSC, and 10 min in 0.2XSSC. Sections were then 
RNase A treated for 10 min to remove nonspecificalfy bound 
riboprobe. Subsequent wash was performed in 0.2 X SSC as spec- 
ified above. Sections were dehydrated and soaked Into an autora- 
diographic emulsion (Qford), exposed for 5r7 days if not otherwise 
stated and finally developed. Sections were counterstained with 
haematoxylin and eosin. 

Jmmimoperoxidase staining 

Irnmunohistochermstry was performed essentially, as described 
previously.^ Five micrometer paraffin sections were deparaf- 
finized with xylene and hydrated through ethanol/water dilutions: 
Tissue pretreatment was performed with protease-K (5 pg/ml) 
digestion for 20 min. Sections were blocked for endogenous per- 
oxidase activity by treatment with 1% hydrogen peroxide for 
15 min. The sections were washed in 50 mM Tris 150 mM NaQ, 
pH 7.6, containing 0.5% Triton lX-100 (TBS-T). Incubation with 
-antibodies was-done overnight at 4°C. Sheep polyclonal antibodies 
(pAb) against TTMP^l and nonimmune goat IgG were used at a 
final concentration of 4J0 u.g/ml. Two monoclonal antibodies 
(MAb) against TTMP-1 , 33 NM4 {clone rHX6A > NeoMarkersv fie- 
mont, CA) and CalB2 (clone 147r6Dl 1, CalBiohern, Oncogene 
Res. Products, Cambridge, MA), and a MAb against trinitro- 
pbenyl (TOP) 34 were all incubated at 1.0 pg/ml (all 3 MAbs are 
IgGl). CaJB2 MAb recognizes both free TIMP-1 and TTMP-1 
in complex with MMPs. 33 NM4 MAb only recognizes free 
TIMP-1- 33 According to the manufacturer's descriptions, both 
MAbs are raised using recombinant human TTMP-1. The sheep 
polyclonal antibodies were raised by immunization with TTMP-1 
purified from human dermal fibroblasts. The IgG was obtained by 
triple precipitation using ammonium-surf ate and characterized by 
immunodiffusion and rocket i mmun oel ectrophor esis. 35 In adchV 
lion, we have shown that the pAb recognize both free and MMP- 
complexed TTMP-1. 36 Furtbennore, the specificity of the antibod- 
ies was analyzed by "Western blotting analysis against recombinant 
human TTMP-1 expressed in NSO mouse myeloma cells. Here, the 
antibody preparation recognizes a band of approximately 28 kDa 
in accordance with the molecular weight of TTMP-1. To certify 
that the pAb recognize TTMP-1 in colon tumors, the antibodies 
were immobilized on a seph arose column. Total protein extracted 
from 3 colon adenocarcinomas was passed through the cohimn 5 
times and the bound and subsequently eluted protein analyzed in a 
Western blot using a TTMP-1 monoclonal antibody (MAO 5). A 
single band of approximately 28 kDa was revealed in accordance 
with the molecular weight of TTMP-1 (results not shown). In 
immunohistochemistry, the sheep pAb were detected with biotin- 
ylated rabbit-anti-goat IgG, which' cross-react with sheep IgG 
(1:100, code E466, DakoCytornation) followed by horseradish 
peroxidase in complex with streptavidin (code K377, DakoCyto- 
marion). The MAbs were detected with the Envision- mouse re- 
agent (EnVision reagent, K4003, DakoCytoroption), followed by 
tyramine amplification, using biotinyl tyraroine substrate as-spec- 
ified by* the manufacturer (Nen, Boston, MA). Sections were 
developed with NoyaRed substrate as specified by the manufac- 
turer (Vector Laboratories, Burlingnaroe, CA) for 15 min. Finally, 
sections were counterstained in MayeTS haematoxyliri, dehydrated 
in eihanol and mounted. 

Combined in situ hybridization a/id immunohistochemistry 

Double labeling by combining m situ hybridization and imrou- 
nohisiochcmistry on paraffin sections has been described previ- 
ously. 32 In brief, using MAb against a-sm-actin (clone 1A4) 
diluted 1:1000. against cytokeratin (clone AE1/AE3) dijuted 
1:1000. or against CD68 (clone PGM1) diluted l:200 r secuon.s 
were incubated for 2 hours at room temperature and then detected 
I with anti-mouse-IgG/hoTse radish perbxidase-conjugated polymers 
(Envision-mouse reagent, DakoCytomation, Glostrup, Denmark). 
Sections were developed with diaminobenzidine (DAB) for 
7-10 min, and immediately dehydrated for in situ hybridization, 
which was performed as described above using the antisense 
probes of fl04. Sections were counterstained with haernaioxyhn. 
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Results 

Analysis of TIMP-1 probes and antibodies for in situ 
hybridization and immunohistochemistry 

Histopathologic^ diagnosis of prospectively collected speci- 
mens from 18 colorectal lesions revealed 16 colorectal adenocar- 
cinomas, 1 villous adenoma and 1 malignant lymphoma. ^S- 
labeled antisense and sense RNA probes were generated by in vitro 
U^nscnption from 2 nonoverlapping DNA sequences of the human. 
TTMP-1 cDNA and tested by in situ hybridization on adjacent 
secdons from 5 of the colorectal adenocarcinomas. The 2 antisense 
probes showed ah identical hybridization pattern in all the 5 cases, 
located an the stromal compartment surrounding the invading 
cancer cells, while no specific signal was seen with the 2 sense 
probes (F?g. 1). To test whether the TTMP-1 mRNA was accom- 
panied byHMP-l protein expression, iiranunohistoche'mistry was 
performed on 8 of the adenocarcinomas (including the 5 men- 
tioned abov e) and the malignant lympho ma usin g sheep anti- 
numan TlMFl polyclonal antibodies on sections adjacent to 
TTMP-1 in situ hybridized sections. The TCMP-1 rnRNA and 
imrounoreactiyity was observed in the same ceDs in aB of the 9 
casestested (Fig. 2A\ including the malignant lymphoma. The 
anti-TTMP-1 polyclonal antibodies did not react with other cell 
populations in all of 8 adenocarcinomas and the malignant lym- 
phoma apart from some normal and malignant epithelial cells that 
were weakly stained on the luminal apical surface. Two MAbs 
against TTMP-1 (CalB2 and NM4) required strong signal amplifi- 
cation but showed a staining partem similar to that of the poly- 
clonal antibody preparation (Fig. 2J3), with the only exception that 
neither: of the 2 MAbs stained the luminal apical surface of the 
normal and malignant epithelium; No signal was obtained with 
^ n ??- UnC S ° at SCrum 01 a MAb (° f san >e subclass as Calb2 and ■ 
' directed against the synthetic hapten trinitropheny) (TNP). 

TIMP- J mRNA expression patterns in colon cancer 

Expression ofTTMP-1 mRNA was then analyzed in I the remain- 
ing 9- colorectal lesions by in situ hybridization. TTMP-1 mRNA 



expression was in all the cases of colon adenocarcinoma (inc)udino 
those mentioned above) highly expressed in stromal fibroblast-lnV 
cells located at the invasive front (Fig. 3a,d). TIMP-1 mRNA 
signal was also observed in fibroblast-like cells located in the 
tumor stroma towards the colonic lumen in 8 of 10 cases where 
to nssue structure was present (data not shown). No or little 
TIMP-1 mRNA was detected in the central part of the Carcinomas, 
in 5 of the 16 colorectal adenocarcinomas, we observed TTMP-1 
mRNA signal in some fibroblast-like cells located around the * 
muscle layer, of some arteries located in the submucosa distant 
from the cancer area. The normal colonic mucosa, including the 
lamina propria that- was present in all samples tested, was generally 
negative (Fig. 3M- Only ^relatively weak TIMP- 1 mRNA signal 
was detected in stromal fibroblast-like cells surrounding one or a 
very few normal crypts (Fig. 3c % f) in 3 out of 6 cases tested with 
extended exposure time (10 days »x usually 5 days). In the villous 

.t^^i^^T: ° n]y 3 fCW TIMP ^ mRNA Positive cells asso- 
rted wjIL f^al fflflanunaTion (data hot shown). In the malignant 
lymphoma of the colon, TTMP-1 mRNA expressing fibroblast-like 
cells were, different from the adenocarcinomas, located in a dif- 
fuse pattern throughout the whole tumor. No. TTMP-1 mRNA 
signal was observed in any of the 18 cases in the cancer cells, 
smooth muscle cells or vascular cells. - . 




r Jl ,C fi URE ^ b J' bridiMt ion with 2 nonoverlappj Df > TTMP-1 

H^IpH r 0Mran ° I ? * eTC abated with 2 Donoverlapping «S- 
labded antisenseprobes for TIMP-1 mRNA [f!06 (a M andfHW (di\ 
and a concspondmg T1MP-J sense probe, f 06 (c). TT^ i situ hy 

The " mtk^n I"' ^^P 31 *™ in d ^field illumination (b~d). 
hXww ^ Pr w" Sh0W ^ samc hybridization partem and the 
hyhndization signal « seen in the same cells (arrows in a b and d) 

ihc TIMP-1 mRNA « located in the tumor so oma (indicated by 
Sr) surround.^ the mvasive cacccr cells (Ca) that are devoid of 
I1MJM m situ hybridization signal, (a^d): Bar =100 ixm. 



Characterization of TIMP-1 mRNA expressing cells 

To test whether the TO^l mRNA positive fibroblast-lilce cells - 
could be (myofibroblasts and/or macrophages, sections from 4 
colorectal adenocarcinomas and the malignant lymphoma were 
farst jmrmmohistochemicaJly stained with - antibodies directed 
Dt ;^" actiD [foT Section of myofibroblast/sroootb muscle 
cells.(SMC)] or CD68 (for detection of macrophages) and subse- 
quently incubated with a TIMP-1 mRNA antisense probe In 
normal colon tissue, a-sm-actin is expressed by vascular smooth 
muscle cells, smooth muscle cells of lamina muscularis mucosae 
and tunica muscularis as well as pericrypial myofibroblasts. 37 In 
colon tumors, a-sm-actin is expressed by tumor- associated fibro- 
blast-like cells located throughout the tumor stroma, which are 
defined as myofibroblasts. No TIMP-1 mRNA was detected in any 
a-sm-actin positive smooth muscle cells, including those of the 
vessels, the lamina muscularis mucosae and the tunica muscularis. 
In addmon, no TIMP-1 mRNA was detected in the a-sm-actin 
positive pencryptal myofibroblasts of the larnina propria in any of . 
the .5 lesions. TIMP- J mRNA signal was- in contrast seen in 
a-sm-actin-positive tumor associated myofibroblasts located at the 
invasive front of the colon cancers. In 3 of the adenocarcinomas, 
more than 80% of TTMP-1 mRNA-positive cells located close lo 
0>e invading cancer cells were a-sm-actin-positive (Fig 4). 
TTMP-1 nuWA positive fibroblast-like cells located more distant 
ftom the invasive cancer cells, towards the submucosa, expressed 
little or no a-sm-actin. In 1 adenocarcinoma and in the malignant 
lymphoma approximately 50% of the TIMPl mRNA positive cells- 
expressed a-sm-actin. Thus, the TTMP-1 mRNA expressing cells 
constitute a subpopulaiion of tumor-associated myofibroblasts lo- 
cated at the invasive front of the tumor. No TTMP-1 mRNA signal 
could be identified in any of the CD68-positive cejls (Fig. 4). 

Expression of TIMP-J and MMP-2 and 9 in colon cancer 

MMP-2 and MMP-9 are 2 type IV co])ao enases expressed m the 

^x^> 1V o e u CanCef: ,i5SUe of colorec,a l adenocarcinomas. Thus 
MMP-2 has been reported to be expressed by fibrobiast-Jilce cells 
jn the cancer stroma,'^* and MMP-9 by macrophages at the 
leading edge of the invasive cancer. 3S To dircctrv compare the 
expression patterns of MMP-2 and MMP-9 with that of TIMP- J, 
adjacent sections from 5 colorectal adenocarcinomas were hybrid- 
ized with probes for TIMP-1, MMP-2 and MMP-9 niRNAs. We 
found thai the expression of TIMP-1 mRNA was localized char- 
acteristically at the invasive hont of the growing tumor, whereas 
the expression of MMP-2 mRNA was most intense in the central 
areas, showing decreased expression towards the invasive front 
(Hg. Do). MMP-9 mRNA expressing cells were found nt the 
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2 ~Jn situ hybridization and immunohistochemistry for TIMFf- 1 in human colon cancer. (A) Two- adjacent sections from a ^u^ 3 "^^ 0 ™ 
inoma were incubated wiih polyclonal antibodies against TJMP-J {a t c) and a TIMP-1 mRNA antisense probe (2>). The TIMP-1 



FiGiaiE 1 

adenocarcinoma were incubated wiih polyclonal 0 . . ,_ F _, T _ _ 

immnnoreacrivity (red-brown color, arrows in a and c) and the TIMP-1 mRNA (silver grains, arrows in b) are identified in the same cells (arrows 
in a,b). Immunoperoxidase staining with the TIMP-1 pAb reveals the T1MP- 1 -positive cells as fibroblast- like cells (arrows in c) located in the 
stroma (St). No TIMP-] imrounoreactivity is seen in cancer cells (Ca). a,b: bais 50 pirn; c: bars =13 pin. (#) Four consecutive a^"*" 1 
sections were incubated with CalB2 MAb ami-TlMP-1 (a)," NM4 MAb (b\ sheep anti TIMP-1 pAb (c) or mouse anti TNP (d). The 3 MAbs 
were detected with Envision, reagent followed by TS amplification and the sheep pAb with biotinylated rabbit anti-goat followed by 
HRP-conjugated streptavidin (see Material and methods). The 3 T)MP- 1 antibodies react with the same cells (arrows). No irnrounore activity is 
seen when the sections are incubated with anti-TNP. 



invasive front like those expressing T1MP-J mRNA but with a 
distinctly different distribution. Foci wiih high expression of 
TJMP-1 mRNA were not accompanied with increased expression 
of MMP-9 mRNA and vice-versa (Fig. 5b). Thus, TIMP-1 mRNA 
expression is not coregulated with MMP-2 or MMP-9 mRNA 
expression. 

TJMP ]^ in adenomas and Dukes' siaze A carcinomas 

TIMf*- J antigen can readily .be measured ";r> blood and we have 
previously reported that levels of T1MP- 1 in blood ore significantly 



elevated in colorectal cancer patients compared to healthy donors 
and that high plasma TIMP-1 levels are associated with short 
survival of colorectal cancer patients. 21 - 3 * TTMP-1 has therefore 
been suggested to be a novel marker for detection of early stage 
colorectal cancer and for prognostic stratification- of colorectal 
cancer patients.- 1 - 39 These findings, together with the characteristic 
expression pattern of TIMP-1 at the invasive front of virtually all 
the colon cancers and die absence or minute TIM P-l expression in 
normal and benign colon mucosa, prompted the evaluation of 
TIM P-l expression as a marker for earjy invasive colon cancer. 
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«*- ~ Scions were firs, for immuno . 

^sx^^ ^sa&is^ £ esse 

TTMP-J mRNA signal (red arrows in o£ Bars = J3 ^m. e «- s ra-aclin-posmve myofibroblasts have little or no 



Ion fi \ "IT'" XIMP -' mRNA »P««i0D.in an addi- 
tional 6 colorectal adenomatous polyps with ibe expression in an 
addmonp, 8 Dukes' sta-e A colorectal adenocarcinomas. TJMP™ 

If » T de ' CC,ed in 2 °' 6 3denomas ' all 8 

DuXes stage A caronomas showed TIMP-1 mRNA sional ai ,be 

ST 6> ' ' POS " iVe adC "°" ,a Related type) 

Ik m A e< P ress ' on wa s confined to a single focus in 

; T'T h,5,0,0 S ical =»«'l>'sis of additional sections from 
h. satnple clearly revealed disruption of tbe dysplastic epithelium 
■ be same area 1„ tne olheI T J M p_, mRNA positive adenoma 
sess.le type) □ few TIMP-l mRNA expressing fibroblasts were 

"a IT"*? r nal - e "" n °' d ' reC,)y aSSOCia ' ed ' h ~ 

aiea (data nor shown). 



<h I i n S'^ a " of (hC 9 JDukes ' st3 S c A- carcinomas analyzed 
showed TIM P- mRNA egression in myofibroblasts located at 
he mvas.ve from of ibe rumors, whereas wpicssion was detected 
m only 3 of 7 adenomas, and in these was seen in fibioblast-Iike 
«Hs associated w«h focal inflammation at the epiibelial-s.romal 
interface in 2 of the cases and with arteries in the submucosa in I 



Discussion 

Our study was undertaken to clarify .he expression and cellular 
locabzunon pi the MMP inhibiior TJMP-i in human colon ade- 
nocarcinomas. Our studies were founded on the use of 2 specific 
anr.sense RNA probes derived from 2 oonbveijnppin" TIMP-1 
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Figure 5 - in situ hybridization for TTMP-1 , MMP-2 and MMP-9 in 
human colon cancer, (4) Adjacent sections were incubated with probes 
specific for TIMP-1 mRNA (o.c) and MMP-2 mRNA {b t d) y respec- 
tively, and is shown in brigbtfield (a t b) and darkficld illumination 
(c.d). The TCMP-1 mRNA signal increases towards the submucosa 
(Sm) whereas the MMP-2 mRNA signal decreases and is most intense 
in the central areas <Ca). (B) Adjacent sections were incubated with 
probes specific for TIMP-1 mRNA (a t c) and MMP-9 mRNA (b t d) r 
respectively, and is here shown in bright field (a,b) and darkfield 
illumination (c,d). Both the TIMP-1 mRNA signal and the MMP-9 
mRNA signal aje most intense ai the invasive fiont towards the 
submucosa (Sm), but their expression patterns are quite different, with 
MMP-9 showing the most restricted expression. Bars = 100 ixm. 



cDNA fragments and specific pAb and MAbs against human 
TIMP-1. The TIMP-1 mRNA signal in all colorectal adenocarci- 
nomas investigated was seen in fibroblast -like cells located in the 
tumor periphery. An identical hybridization pattern was observed 
with the 2 antisense TIMP-1 pTobe.s and application of comple- 
mentary sense probes on neighboring tissue sections as negative 
controls did not result -in any hybridization signal; therefore, we 
conclude that the hybridization signal generated with the antisense 
probes represents the genuine TTMP-1 mRNA. TIMP-1 imjinino- 
renciivity wns also distinctly located in fibroblast-like stromal cells 
injhejomor peripheiy, and these cells were identified to be the 
same cells as the TIMP-1 mRNA expressing cells. A preparation 



of sheep pAb against human TTMB-T 35 and 2 well-characterized 
MAbs stained the very same cells in the tumor stroma.. Weak 
staining of the apical surface of soine normal and malignant 
epithelial cells was observed with tbe pAb in some of the samples. - 
No staining was obtained when roe.Janti-TIMIM antibodies were 
substituted with nonirnmune goat serum or anti-TNP MAb isuxh 
! bated at the .sairie concentrations- These immunoWstCK^ermcal . 
findings strongly suggest that the- TIMP-1 antigen detected in the 
fibroblast- lilce cells represents the genuine TTMP-1 protein. 

In our study, we found TTMP-1 mRNA expression in stromal 
fibroblast-like cells located in the tmnbr periphery in all colorectal 
adenocarcinomas tested, whereas no expression was detected in . 
the. cancer ceils in.. any -of the. cases tested.* This finding is in 
agreement with studies by Zeng and colleagues, 12 ^ but is partly in 
i disagreement with findings by Newell and : colleagnes. 37 In addi- 
joori to TTMP-1 mRNA signal in fihroblast-Kke cells in the turner 
•iperi phery T Newell a nd colleagues 27 detected a weak TIMP-1 
mRNA signal iii both benign and malignalt epithelial cells. 27 This ' 
observation was, however,, based on the use' of probes: fiom a 
5ing|eTTMP-l cDNA subclone and no additional comrols;to verify 
the expression pattern. Tbe difference between out results and 
those of Newell and colleagues may be explained by methodolog- 
ical differences, since the procedure employed, by Newell and 
colleagues was in several steps different from the one used in the 
present study, e+g., Newell and Colleagues used 3 fl-labeled probes, 
whereas we used 35 S -Jabeled probes. It is in this context notewor- 
thy that in order to look for a low expression level of TTMP-1 
mRNA, we performed in situ hybridization experiments with pro- 
longed exposure time (10 days vs. usually 5 days) with both our 
TIMP-1 antisense probes and both TTMP-1 sense probes, but with 
this challenge we did not detect any TIMP-1 mRNA in any 
'epithelial cells. It cannot be excluded though that the TIMP-1 
IrnRNA is expressed in epithelial cells below the detection hnut of 
jour in situ hybridization procedure. 

An interesting observation in our study was the characteristic 
'intense TTMP-1 mRNA and protein expression in the tumor pe- 
riphery of all colon adenocarcinomas, while little or no expression 
jwas seen in the center of the carcinomas. Only in the colorectal 
lymphoma did we find TTMP-1 mRNA and protein expression in 
jfibToblast-like cells located throughout the tumor tissue. The 
iTTMP-l expression pattern in the colon adenocarcinomas is in 
jcontrast to the expression pattern reported by Hewitt and col- 
leagues, 19 who found that the TTMP-T staining in most colorectal 
'adenocarcinomas was equally intense in fibroblasts throughout the 
jiumors and thai some of the cases even showed decreased 71MP-1 
'signal intensity towards the rumor periphery. This difference may 
|be explained by possible cross-reactivity of the polyclonal anti- 
bodies employed by Hewitt and colleagues or that Hewitt and 
'colleagues employed cryostai sections, while we analyzed paraffin 
'sections. ; * 

I The TTMP-1 expressing cells had a fibroblast-like morphology 
jand using combined in situ hybridization for TTMP-1 mRNA and 
ijnmiunohistochemistry for a-srh-actin, we found that many, gen- 
erally more than 50%, of the TTMP-1 mRNA positive cells coex- 
pressed o>sm-actin. According to the cellular morphology of the 
TlMP^l expressing cells and their localization in the invasive- 
'front, we could conclude dial the cells were myofibroblasts and not 
smooth muscle cells. 

I The myofibroblast is a cell type present in the normal colon 
mucosa, that originally was described as a pericryptal fibro- 
blast 37 so and later was identified with antibodies against a-sm- 
'aciin. 37 In the lamina propria, !he myofibroblasts form a continu-. 
! ous cell layer just below the intestinal epithelium. The pericryptal 
myofibroblasts are phenotypically different from the neighboring 
'quiescent interstitial fibroblasts that do not express markers ol 
'smooth muscle cells/ 1 During early steps of colonic turn on genesis 
the number of myofibroblasts is significantly increased. 41 The 
fTlMP-3 expressing myofibroblasts may be generated after activa- 
tion of me pericryptal myofibroblasts and/or the quiescent inter- 
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sbtial fibroblasts. Adegboyega and colleagues 41 hypothesized that 
the- tumor-associated myofibroblasts originate from the quiescent 
interstitial fibroblasts of the lamina propria, rather than from peri- 
cryptal myofibroblast or smooth muscle cells, which may help to 
explain why we found some of the TIMP-1 mRNA expressing 
fibroblast-hke cells a-sm-actin-positive and some a>sro-actin-ne°- 
aove. ° 

Several MMPs including MMP-2, MMP-II and MMP- 14 are 
expressed by fibroblast-like cells in human colon cancer wi- 

W ^ Ch may indeed m y° fibroWa sts. The role of the (myo-) 
fibroblasts in colon cancer progression is not known. Since TIMP- 1 in 
human colon cancer appears only to be expressed by fibroblast-like 
cells most of which are myofibroblasts, and since high T1MP-1 levels 
measured in blood or tumor extracts from colon cancer patients are 
strongly associated with a poor prognosis*^ ^* ; t could be argued 
that the TIMP-1 expressing myofibroblasts play a tumoi -promobn o 
role, lrruriunohistochemical localization studies of proteins involved 
in the activation and regulation of the efficient serine protease plas- 
minogen, including urokinase plasminogen activator (uPA) and its 
specific inhibitor PAJ-] show that both are mainly expressed by 
myofibroblasts in human breast cancer.^ Hisb levels of tiPA and 
PAl-l^are strongly correlated with poor prognosis in breast can- 
cer, supporting (he assumption that the myofibroblast express a 
promoting role m cancer invasion. We recently reported that me 
predommam PA1- 1 expressing cell in human colorectal cancer also is 
the myofibroblast,^ and earlier studies indicated that elevated levels 
. r^J. ,n CoJon C3ncer Paiienw are associated with poor progno- 
sis- Together these findings indicate thai myofibroblasts are stron°lv 
contributing to the expression of proteins involved in the reflation" of 
extracellular mam* degrading proteases that facilitate cancer invasion 
and metastasis. 

A particularly interesting finding of the present smdy was the 
absence of TIMP- J. mRNA in 4 of 7 adenomas, whereas in all ot 
9 Dukes' . singe A carcinomas ihe TIMP- J mRNA was expressed in 
hbroblasi-bke cells along the invasive from. In the ? benign lesions 
m which .he TIMP- 1 mRNA wns seen in the 5denoma%ren the" 



TIMP-1 mRNA positive cells were confined to a single focus with 
locally increased inflammation related to the dysplastic epithelium. 
Evident disruption of the dysplastic epithelium was observed in the 
adenoma with most intense TJMP-1 mRNA signal. Intestinal in- 
flammation may be caused by disruption of the mucous epithelium 
that leads to focal leakage of mucinous colon material into the 
lamina propria. Increased intestinal permeability is a common 
deficiency in Crohn's disease and interestingly TIMP-j mRNA 
was found in the intestinal granulation tissue of Crohn's disease 50 
and is expressed by myofibroblasts isolated from Crohn's dis- 
ease. 5 ' Induction of TIMP-1 in myofibroblasts in a benisn or 
preinvasive tumor may also be a response to locally increased 
MMP activity or a response to the presence of a specific MMP in 
the local microenvironment. MMP-2 and MMP-9 mRNA expres- 
sion, however, did not appear to be cpregulated with TIMP-1 
mRNA expression in the colorectal adenocarcinomas. Specific 
MMPs may indeed be involved in the transition of noninvasive to 
invasive disease; in smdies of preinvasive lesions (ductal carcino- 
mas in situ) of the human breast we recently reported that MMP-13 
is specifically expressed in myofibroblasts associated with micro- 
invasive events. -2 Future studies may clarify whether TIMP-1 
expression in colorectal adenomas is coneJaied with expression of 
specific MMPs, cytokines and/or growth factors, such as TGF-p ) 
and TGF-0251, and whether TIMP-Ican be used as a histopaibo- 
Jogical marker for malignancy in colorectal tumors. 



Acknowledgements 

We thank Dr. G. Murphy for ihe TIMP- 1 cDNA, the polyclonal 
antibodies against T1MP-K MAC 15 and recombinant TIMP- 1. We 
are grateful to P.G. Knudsen and C: J_0nborg for their excellent 
technical assistance, and to L.R Engelholm for desisnin^ ihe 
primers. We appreciate the critical comments, to the article by Dr.._. 
G. KcHcrinann. - . 



Best Available Copy 



■> 



HMP-1 EXPRESSION IN THE COLON 

References 

I 



205 



31. 



1. . Liotta LA, Steeg PS, Steder-Steveason WG; Cancer mctastaas aod 

angiogenesis: an imbalance of positive and negative regulation. Cell 
1991;64:327-36 

2. Stetler-Stevenson WG, Yo AE. Proteases in invasion: matrix metal- 
loproteinasevSeiiuVCance^ . 

3J Murphy G, Kokliris P, Came AF. Dissociation, of tissue inhibitor of 
roetaBbproteinases (T1MP) from enzyme complexes yields fully ac-* 
tive mhibitor. Biocnem J 1989#6 1:103 1-4. 

4; Welgus HG, Srricklin GP. Human skin fibroblast collagenase inhibi- 
tor. Comparative studies in human connective tissues, scram, and 
amniotic fluid. J Bid Chero 1983*258:12259-64.. 

5. Khokha R, Waterhpnse P. The role of tissue inhibitor of metaDopTO- 
teinase-1 in specific aspects of cancer progression and reproduction. 
J HenrooncoV J$94;18:123-7. 

6\ - * Docberty AJ, Lyons A, Smith BJ, Wright EM, Stephens PE, Harris. 
Tf, Murphy G, Reynolds JJ. Sequence of human tissue inhibitor of 
metaUoproteinases and its identity to eryuVoid-potentiating activity. 
Nature 1985:31 8:66-9, • " 

7l Guedez L, Courtemanch L, Sleder-Stevenson Tissue inhibitor ofr 
metalloproteinase (TCMP)-1 induces differentiation and an antiapop- 
toticphenotype in germinal center B cells. Blood 1998;92:1342-9. 

8. Hewitt RE, Brown KB, Corcoran M, Stetler-Stevenson WG. Increased 
expression of tissue inhibitor of metaOoproteinases type 1 {71MP-I) 
in a more tumourigenic colon cancer cell line. J Patbol 2000:192: 
455-9.. • 

9. Guedez X, Stetler-Stevenson WG, Wolff L, Wang J, Fuknshima P, 
Marisoor A, Stetler-Stevenson M. In vitro suppression of programmed 
cell death of B cells by tissue inhibitor of metalloproteinases-1. J din 
Invest 1998;102:2002-J0. 

10. Li G» Fridman R, Km HR. Tissue inhibitor of meLalloproteinase-1 
v inhibits apoptosis of human breast epitheb'al cells. Cancer Res 1999; 

59:6267-75. . , 

11. Jiang Y, Goldberg ID, Shi YE. Complex roles of tissue inhibitors of 
metaUcproteinases in cancer. Oncogene 2002;21:2245-52. 

12. Zeng ZS, Guillem JG. Distinct pattern of matrix metalloproteinase 9 
and tissue inhibitor of metalloproteinase 1 mRNA expression in 
human colorectal cancer and liver metastases. Br J Cancer" 1995:72: 
575-82. 

13. Gomez DE» Alonso DF, Yoshiji H, Tborgeirsson UP. Tissue inhibi- 
tors of metaHoproteibases: structure, regulation arid biological func- 
tions. Eur J Cell Biol 1997;74:111-22. 

14. Guillem JG, Levy MF, Hsieb LL, Johnson MD, LoGerfo P, Forde 
KA, Wehistein IB. Increased levels of pborbin, c-myc, and ornithine 

^decarboxylase RNAs in human colon cancer. Mol Carcinog 1990J: 
68-74. 

15. Lu Xp, Levy M. Weinstein IB, Santella KM. Immunological quanti- 
tation of levels of tissue inhibitor of metalloproteinase- 1 in human 
colon cancer. Cancer Res 1991 $ 1 :623 1-5. 

16. Mimori K, Mori M, Shiraishi T, Fujie T, Baba K, Haraguchi M, Abe 
R, Ueo H, Akiyoshi X Clinical significance of tissue inhibitor of 
metalloproteinase expression in gastric carcinoma. Br J Cancer 1997: 
76:531-61 

17. Fong KM. TTMP1 and adverse prognosis in non-small ceD lung 
cancer. Clinical Cancer Research 1996;2:1369-72. 

18. Ree AH, Florenes VA, Berg JP, Maelandsmo GM, Nesland JM, 
Fodstad O. High levels of messenger RNAs for tissue inhibitors of 
meialloproteinases (T1MP-I and T1MP-2) in primary breast carcinc- 
mas are associated with development of distant metastases. Clin 
Cancer Res 1997;3:1623-8. 

19. Hewitt RE, Leach 1H, Powe DG, Clark 1M, Cawston TE, Turner DR. 
Distribution of collagenase and tissue inhibitor of metaUoproteinases 
(7TMP) hi colorectal tumours. Inl J Cancer 1991;49:666-72. 

20.. Zeng ZS, Cohen AM, Zhang ZF, Stetler-Slevenson W, Guillem JG. 
Elevated tissue inhibitoi of metalloproteinase 1 RNA in colorectal 
cancer -stroma correlates with lymph node and distant metastases. Clin 
Cancer Res l°95;J:899-906. 

21. Holten- Andersen MN. Stephens RW, Nielsen BJ, Murpby G, Chris- 

tensen I J, Stetler-Slevenson W, Bnmner N. High preoperative plasma I 
tissue inhibitor of metalloproteinase- 1 levels are associated with short j 
survival of patients with colorectal cancer. Clin Cancer Res 2000;6: 
4292-9. In. 

22 Oberg A. Hoyhtya M t Tavelin B, Stenling R, Lindmark G- Limited ' 
value of preoperative serum analyses of matrix meLalloproleinases 
. (MMP-2 f MMP-9) and tissue inhibitors of matrix metaUoproteinases I 
("J1MP-1, TIMP-2) in colorectal cancer. Anticancer Res 2000:20: 42. 
1085-9L 

23. Pellegrini P. Comasta 1, Bergbella AM, Gaigano E, Mammarella C, 
Adorno D. Simultaneous measurement of soluble carcmoembryonic 
antigen and the tissue inhibitor of metaUoproteinase T1MP1 serum 43. 
levels for use as markers of pre-invasjve to invasive colorectal cancer. | 
Cancer Immunol Imniunotlier 2000;49:388-94; i 



,24. 



Grondahl-Hansen J,.Cln^ensen jj^. Rosenqmst C; Bnmner N 
I Mouridseto HT, Dano K, Blichert-Toft Ml High Fev^s of nrokinase- 
i type plasminogen activator and its irmsbitor PAIrJ in cytosolic ex- 
• tracts of breast can^noxnas are associated with poor prognosis. Cancer 
Rcs 1993;53:2513-21.. : 1 . 

25. Nielsen HJ, Pappot H, ChristenicD H, Bttoiot^; ThbrJadns-Ussing 
O, Moesgaard F, Dano K, Grondahl-Hansen J. Association between 
-. plasma concentrations of plasminogen activator inhibitor- 1 and sur- 
I vival in patients with colorectal cancer. BMJ 1998^16:829-^30. 
126. Wind T, Hansen M, Jensen JKi Andrcasen PA* The molecular basis 
' ' for anti-proteolytic and non-proteoJytic fbnetions of plasminogen ac- 
| tivator inhibitbr type-1: roles of the reactive centre loop, the shutter 
j region, the flexible joint region and the sinaU.serpin fragment Biol 

/Cbem20(Kt383ai-36i-- * 
(27. Newell KJ, tWtty JP, Rodgers "WH, Matrisian. LM. .Expression and 
. localization of matrixrdegtading me^oprbteinases during colorectal 
j tpmoiigenesis. Mol i^rcinog 1994;10:199-206. 
'28. Tomita T, hvata K . Matrix metaBc^roteiMsei and tissue inhibitors of 



metaJlopiotemases in colonic adencm^-adenbfearcinbmas. Dis Colon 
j . Recfiam 1996^39:1 25^-64. . . . 

,29. O'Shea M, Willenbiock: F, TOhamson Cockett MI,- fteedman 
RB, Reynolds JJ, t)ocherty AJ, Murphy, G. Site^irected- mnUu" ons 
i that aher. the inhibitory activity of the tissue inhibitor of metaUopro- 
i • teinases-1: importance of the ^terminal region between cysteine 3 
| and cysteine 13. Biochemistry 1992*31:10146^52, 
3a. Engelhohn LH, Nielsen BS, Netzel-Aroett S, Solberg H, Chen XD, 
| Lopez Garcia JM, Lopez-Otin C,'Yonng MF, Birkedal-Hansen H, 
j Dano K, Lnnd XR» Behrendt N, et al. The urokinase plasminogen 
activator receptor-associated protein/endol80 is coexpressed with its 
interaction partners urokinase plasminogen activator receptor and 
matrix metalloprotease-13 during osteogenesis.- Lab Invest 2001 ;81: 
1403^14. 

Pyke C, RalOiaer E, Huhtala P, Hwsxainen T, Dano 1^ Tiyggvason 
| K. Localization of messenger RNA for Mr 72,000 and 92,000 type IV 
collagenases in human skin cancers by in situ hybridization. Cancer 
| Res 1992-52:1336-41. 

■32. Nielsen BS, Rank F, Lopez JM, Balbin M, Vizoso F, Lund LR, Dano 
i K, Lopez-Otin C Collagenase- 3 expression in breast myofibroblasts 
I as a molecular marker of transition of .ductal carcinoma in situ lesions 
to invasive ductal carcinomas. Cancer Res 2001;61:7091-100. 



35. 

J 

36.' 



33. Holteh-Andersen MN, Brunner N, Maimonis P, Jensen V, Murphy G, 
| Piironen T. Characterization of monoclonal antibodies to tissue inhib> 
i itor of metal loproteinases- 1 . Journal of clinical ligand assav 2002-^25: 
' 87-90. 

34. Sbuhnan M, Wilde CD, Kohler G. A better cell line for making 
| bybridornas secreting specific antibodies. Nature 1978;276:269-70. 

Hembry RM, Murphy G, Reynolds JJ. Immuholocalization of tissue 
inhibitor of metaUoproteinases (TTMP) in human cells. Characteriza- 
tion and use of a specific antiserum, J Cell Sci 1985;73:105-19. 
Holten-Andersen MN, Murphy G, Nielsen HJ, Pedersen AN, Chris- 
lensen V t Hoyer-Hansen G f Brunner N, Stephens RW. Quantitation of 
TTMP-1 in plasma of healthy. blood donors and patients with advanced 
cancer. Br J Cancer 1999;80:495-503. . 
37. Sappino AP, .Dietrich FT, SkalB O, Widgren S, Gabbiani G. Colonic 
' pericrypta) fibroblasts. Differentiation pattern in embryogenesis and 
phenotypic modulation in epithelial proliferative lesions. Virchows 
Arch A Pathol Anat Hislopatbol 1989;415^51-7. 
J38. Nielsen BS, Timsbel S. Kjeldsen L, Sehested M, Pyke C, Bonegaard 
\ N, Dano K. 92 kDa type IV collagenase (MMP-9) is expressed in 
I neutrophils and macrophages but not in malignant epithelial cells in 
I human colon cancer. Int J Cancer 1996;65:571-62. 

39. Holten-Andersen MN, Christensen 1J, Nielsen HJ. Stephens RW, 
I Jensen V, Nielsen OH, Sorensen S, Overgaaid J, Lilja H r Hairis A r 
j Murphy G. Brunner N. Total levels of tissue inhibitor of melallopro- 
. temases 1 in plasma yield high diagnostic sensitivity and specificity in 
| patients with colon cancer. Clin Cancer Res 2002; 8: 156464. 

40. Kaye Gl r Lane N, Pascal RR. Colonic pericryptal fibroblast sheath: 
replication, migration, and cyiodifferentiaubn of a mesenchymal cell 
system in adult tissue. II. Fine structural aspects of normal rabbit and 
human colon. Gastroenterology 1968;54:852-65. 
Adegboyega PA, 'Mifflin RC. DiMari. JF, Saada Jl, Powell DW. 
Immunohistochemical study of myofibroblasts in normal colonic mu- 
cosa, hyperplastic polyps, and adenomatous colorectal polyps. Arch 
Pathol Lab Med 2002:126:829-36. 

Poulsom R, Pignatelli M. Stetler-Slevenson WG, Liotta LA, Wright 
PA, Jeffery RE, Lonecioft JM. Rogcis L, Stamp GW. Stromal ex- 
pression of 72 kda type IV collagenase (MMP-2) and TIMP-2 niR- 
NAs in coloreciaf neoplasia. Am J Pathol 1992;141:389-96. 
Rouyer N, WoJf C, Chenard MP, Rio MC, Qiambou P, Bellocq JK 
Basset P. Slromelysin-3 gene expression m human cancer: an over- 
view. Invasion Metastasis 1994;14:269-75. 



206 



HOLTEN-AKDERSEN ETAL 



*££ JP * N * Cheparf MP, Rio MQ Chambon P. 

i, ^frvr^"^^ ^ f 3 *"* ractaJlopmtdnasc (MT-MMP) gene 
« «F«sed m stroraaf cells of hnman colon, breast, and head and 
neck carcmoraas. Proc Nail Acad Sd U S A 1995:92:2730-4 " 

45. Nielsen BS, Sehested M , Dunn S, Rank F, Tn^bel 5 Rygaarf J 
SSSTtt. ^ * Ur0jdMSC P^-^n activator is S m' 

46 Srv ^r^c^ ~ Ub Ilivcst 200I;81:148M01 
£f^J£ J*?? ^^ HansCD G. Rank F. Hanrilton-Dotoit 
n^^Sf ' ^ dlt ?*» PA - ^ myofibroblast is the predominant 
fir^ SCT A aCbV I a i 0r,nbibitor - 1 ^pressing cell type in C brelS 
caronomas. Aro J Paihol 2003;163:1887-99 luauujcasi 

K' S * Q N, Fennelly JJ, An- 

™^Lv ^^fasmmoscn activator, a new and independent 

HT, Dano K, Bhchert TM High levels of uroKnase-type plasminogen 



activator and its inhibitor PAI r l in cytosolic extracts of breast carri 
nomas axe assoaattd with poor prognosis. Cancer Res 1993^3:25^ 

49. DlemannM^Iansen U, Nielsen Hi, Andreasen PA, Hjjyer-Hansen G 
Land Dan* K, Nielsen BS. Leading edgTmyofi^rSS 
human colon cancer express PAH. Am J Clin Patbol 2004; 1^56- 

50. Saaria^Kere UK, Vaalamo M, PnoJaklcainen P, Airola X, Parks 
WC KarjaJam^dsberg ML. Enhanced expression tfma^ 

51 * ^^^^^ °' Watson ^Mafcidi YR. Expression and 
regulation of *sue jnhibitor of metalloprotemase-l andST^t 
aJ)oproi c; nases> intesnnal myofibroblasts in ir^a^torTwi 
disease. Am XPaibol 2003;! 62:1 355-1 36U . . 



Best Available Copy 




INTERNATIONAL 



Journal 



ancer 




Best Available Copy 



International Union Against Cancer 



^UiCG 

global cancer control 



J. Seffrw,* President (USA) 
LJ; Denis, Treasurer (Belgium) 
• J. Batty,. Chair, Finance Committee (USA) 

G. Brier (Australia) 
R.C Burton (Australia) 
M-Daube (Australia) 
KA Dinshaw .(India) 
L. Elovainio (Finland) * 
MX Gospo^arowjcz (Canada) 
R,Tl Huwow (Ireland) 

T. Kiiauawa (Jap aar) ^ ■ ' . '" 

A. Pundaux Kurkure (India) 
R£i Lenbard (USA) . 
A, Llombart-Bosch (Italy) 



COUNCIL 



M Luwia (Indonesia) 
C Maujnsqn (UK) . 
LH. Marches] (Brazil) 
HJvMickelson(USA) 
K. Nilsspw (Sweden) I 
SL Omar (Egypt) I 
T. Phiup (France) . [ 
R RDKHsoW, Past President (Israel) 



Y. Saloojeb (South Africa) 
H. Sancho-Garnier (France) 



RJ. SorwETTzat (USA) 
O. Sorhde (Norway) 
JC Tajima (Japan) . 
W. Weber (Switzerland) 
S. Wilkinson (UK) 
D.Zacx5(USA) 
D. Zaridze (Russia) 
Y. Hui Zhang (China) 
M. Zrv (Israel) 
H. zuk Hausen (Germany) 



UICC, URL: http^/www.nicaorg 
UICC, E-mail: info@uicc.org 



I 



- The International Union Against Cancer (UICC) is devoted exclusively to all aspects of the worid-wide fight against cancer. Its objective* arc to advance scientific 
and medical knowledge in research, diagnosis, treatment and prevention of cancer, and to promote all other aspects of the campaign against cancer throughout the world. 
Partial lax emphasis is placed on professional and public education. ■ j 

Founded in 1933, the UICC is a BoiHgovenuDCDtal, independent associ ation of mare than 290 member organizations in over 80 countries. Members are vohratary 
cancer leagues and societies, cancer research and/or treatment centers, and in some countries ministries of health. 

The UICC is non-profit, non-political, and Don-sectarian. Its headquarters are in Geneva, Switzerland; It creates and carries pot programs around the world in 
collaboration with hundreds of volunteer experts. Supported by member ship dues, national subscriptions, grants and donations, its annual budget is about US $4 million. 

The UICC is governed by its members which meet in General Assembly every 4 years; Its elected Council and Executive Committee are responsible for Program 
structure and implementation. | 

The UICC organizes an Imemational Cancer Congress every 4 years as weJJ as annual symposia* workshops, and training courses. It publishes the lnicmanonn} . 
Journal of Cancer (30 issues per year), UICC News (quarterly), \hcjni emotional Calendar of Meetings on Cancer (bi-annuaJlyX and a number of technical reports, 
textbooks and manuals. . 



SUBSCRIPTION INFORMATION 



© 20Q5 TVaey-Iiss, Inc., a Wiley Company. AD rights reserved. No part of this publication may 
be reproduced in any form or by any means, except as permitted under section 107 or 108 of 
the 1976 United States Copyright Act, -without either the prior written permission of the 
publisher, or authorization through the Copyright Clearance Center, 222 Rosewood Drive, 
Danvers, MA 01923, telephone: (978) 750-S400, fair (978)750-4470. Requests to the publisher 
for permission should be addressed to the Permissions Department, c/o John Wiley & Sons, 
Inc., Ill River St„ Hoboken, NJ 07030. Fax: (201) 748-6008; Tel.: (201) 748-6011; hltpJ/ 
r.wfley.conygWpermissions. j 



International Journal of Cancer (Print ISSN 0020-7136; Online ISSN 1097-0215) is 
published 30 tiroes a year, semi-monthly with extra issues in January, March, May, July, 
September, and November, by Wiley-liss, Inc., through Wiley Subscription Services, Inc., a 
Wiley Company. Send subscription inquiries in care of John Wiley & Sons, Inc., Attn: 
Journals Aibniu Dept UK, 111 River St, Hoboken, NJ 07030, (201) 748-6645. | 
Advertising inquiries should be addressed to Advertising Department, c/o John Wiley & 
.Sons, Inc. 11 1 River SL, Hoboken, NJ 07030. Telephone: (201) 748-6921. | 
Offprint sales and inquiries should be directed to the Customer Service Department, in 
care of John Wiley & Sons, 111 River St., Hoboken, NJ 07030. Telephone: (201) 748-8776. 
Subscription price: Volumes 113-337, 2005 t30 issuesX Print only. $2 ; 905 worldwide. 
Electronic only: $2,905 worldwide. A combination price of $3,195 worldwide includes the 
subscription in both electronic and print formats. A special personal- rate is available to 
individuals foT $295 worldwide. All subscriptions containing a print element, shipped outside 
the U.S., will be sent by air. Payment must be made in U.S. dollars drawn on U.S. bank. 
Periodicals postage paid at Hoboken, NJ and at additional mailing offices. Postmaster, send 
address, changes to INTERN ATION AL JOURNAL OF CANCER* Subscription DistribytioTj, 
c/o John Wiley & Sons, Inc., Ill River St, Hoboken, NJ 07030. Change of address: Please 
forward to the subscriptions address listed above 6 weeks prior to move; enclose present 
rnsDing label with change of address. Claims for missing, issues: Claims for undelivered 
copies will be accepted only after the following issue has been received. Please enclose a 
mailing label cr cite your subscriber reference number. Missing copies "will be supplied when 
losses have been sustained in transit and where reserve stock permits. Send claims in care of 
John Wiley fic Sons,. Inc., Attn: Journals Admin Dept UK. Ill Riyer St., HobokenJ NJ 07030. 
Indexed by: EMBASE/Excerpta Medica • Current Contents/Life Sciences * Science Citation 
Index • Scisearch • BlOSlSData Base ■ Index Medicus * Cambridge Scientific Abstracts • 
Chemical Abstracts -* Reference Updste • Smoking and Health Database. 



This journal is printed on acid-free paper. 



f 



INTERNATIONAL JOURNAL 
OF CANCER 
2005 Wiley-Liss, Inc. . 

The International Journal of Cancer is 
published for the International Union 
Against Cancer by Wiley-Liss, Inc., a di- 
vision of John Wiley & Sons, Inc. Five 
volumes are issued annually, each consist- 
ing of six numbers. 

Abstracting, and other journals may re- 
print the summaries of articles without re- 
questing authorization. Authors alone are- 
responsible for views expressed in signed 
articles. The mention of specific compa- 
nies or of certain manufacturers' products 
does not imply that they are endorsed or 
recommended by the Intern ation a) Union 
Against Cancer. 



References: When quoting from the 
International Journal of Cancer, 
please use the official abbreviation: 
}nt. J. Cancer 



I 



10 3: J Med Viro l. 2001 Aii r 7nf4V(C|.n'fi n 

( ait frsr aviu^s - g dn £> /UW -° 1 U 6 v Related Articles. Links 



SSr^ indUCCd by dCUgUe ™ of hum aa 

induced by dengue viSs Son nf? the P 4 * 0 **"** of hemorrhage 

produchon of tissue plasminogen activatarVtPA^ i • Ul ^ cMon 0n the 

(PAI-l)in vitro ^boA^^^^^^fT^ activator inhibi ^ 1 
cells, and a human microv asXendo ^^^^^V 0 ^ °°« Veins 
the secretion of tPA but not PA tl of J Z.n ? V , nfechofl significantly induced 

endothelial cells was^l^ 

but not control antibody inhibited DV-indu^K J J ^ Antlbod y a g*nst IL-6 
PMID: 12794725 (PubMed - indexed for MEDLINE] 



488: Mod Pathol. 1992 May-,5(3):250^6. ^ated Arto.es. Links 

Neu oncogene expression in ovarian tumors: a quantitative study. 
HuMfaerPC, CameyJWP, NabcrSP, DeLelHsRA, Membrino W. Wolfem 
DepartmeatofPathology.TufeUniversitySchoolofMedicine.Massachusete 

St^lSr a nr A eX T S :°1 by S,0t W0t P"*™ Product expression by 

capture ELISA and immunohistochemtstry in 57 primary and metastatic ovarian 
neoplasms, two paraovarian leiomyosarcomas, and eight normal ovaries. Some 61% of 
ovarian tumors but none of the paraovarian neoplasms or normal ovaries overexposed 

T,T A Ql96% 0f 016 ovarian «««« ^t overexposed neu were of . 
epithelial type Epithelial ovarian tumors had significantly higher amounts of the neu 
oncogene product as determined by capture ELISA than either germ cell and stromal 

'S^^^^-^^ 0253 - Differe ^types of ovarian carcinomas . 
had significantly different amounts of neu oncogene product as measured by capture 
ELISA; endomeHioid rumors had the highest, and poorly differentiated carcinomas not 
otherwise specified had the lowest (p less than 0.025). ELISA values mRNA 
overexpression, and immunohistochemical staining intensity did not correlate with stage : 
at r diagnosis or architect** or nuclear grade in ovarian rumors. We conclude that capmre 
fcLISA is a simple, effective way to measure the neu oncogene protein product and that 
there is a good correlation between ELISA levels and immunohistochemical staining 
intensity. However, ELISA values did not correlate with stage or histologic prognostic 
factors in ovarian neoplasms. . 
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Real-time quantitative RT-PCR of cyclia Dl niRNA in mantle cell 
lymphoma: comparison with FISH and immunohistochemistry; 

HuiP, HoweJG, Crouch J, Nimmakavatu M . Qumsiveh MB. Tallini a Flvnn SD 
Smith BR . ' ' 

Department of Laboratory Medicine, Yale University School of Medicine 333 Cedar 
Street, P.O. Box 208035, New Haven, CT 06520-8035, USA.' ' 

Presence of the balanced translocation t(l 1 ; 14)(q 1 3;q32) and the consequent 
overexpression of cyclin Dl found in mantle cell lymphoma (MCL) has been shown to be 
of important diagnostic value. Although many molecular and imrmmohistochemical 
approaches have been applied to analyze cyclin Dl status, correlative studies to compare 
different methods for the diagnosis of MCL are lacking. In this study, we examined 39 
archived paraffin specimens from patients diagnosed with a variety of 
lymphoproliferative diseases including nine cases meeting morphologic and 
immunophenotypic criteria for MCL by: (1) real-time quantitative RT-PCR to evaluate 
cychn Dl mRNA expression; (2) dual fluorescence in situ hybridization (FISH) to 
evaluate the t(l 1 ; 1 4) translocation in interphase nuclei; and (3) tissue array 
immunohistochemistry to evaluate the cyclin Dl protein level. Among the nine cases of 
possible MCL, seven cases showed overexpression of cyclin Dl mRNA (cyclin Dl 
positive MCL) and two cases showed no cyclin Dl mRNA increase (cyclin D 1 negative 
"MCL-like"). In six of seven cyclin Dl positive cases; the t(l 1;14) translocation was 
demonstrated by FISH analysis; in one case FISH was unsuccessful. Six of the seven 
cyclin Dl mRNA overexpressing cases showed increased cyclin Dl protein on tissue 
array unmunohistochemistry; one was technically suboptimal. Among the two cyclin Dl 
negative. MCL-like cases, FISH confirmed the absence of the t(l i; 14) translocation in 
both cases. All other lymphoproliferative diseases studied were found to have low or no 
cyclin Dl mRNA expression and were easily distinguishable from the cyclin Dl 
overexpressing MCLs by all three techniques. In addition, to confirming the need to 
assess cychn Dl status, as well as, morphology and immunophenotyping to establish the 
diagnosis of MCL, this study demonstrates good correlation and comparability between 
measure of cyclin Dl mRNA, the 1 1 ;14 translocation and cyclin Dl protein. 
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Modulation of glucagon receptor expression and response in transfected 
human embryonic kidney cells. 

tkegami T , Cypess AM , Boiiscarcl B . 

Department of Medicine, George Washington University Medical Center, Washington 
District of Columbia 20037, USA. 

The modulation of glucagon receptor (GR) expression and biological response was 
investigated inhuman embryonic kidney cell (HEK-293) clones permanently expressing 
the GR with different densities. The GR mRNA expression level in these clones was 
unregulated by cellular cAMP accumulation and presented a good correlation with both 
the protein expression level and the maximum number of glucagon binding sites 
However, the determination of glucagon-induced cAMP accumulation in these cell lines 
revealed that the enhancement of receptor expression did not lead to a proportional 
increase in cAMP formation. Under these conditions,.the maximum cAMP production 
induced by NaF and forskolin was not significantly different among selected clones 
regardless of the receptor expression level. High receptor-expressing clones showed' the 
greatest susceptibility for agonist-induced desensitization compared with clones with 
lower GR expression levels. The results of the present study suggest that the GR can 
recruit non-GR-specific desensitization mechanism(s). Furthermore, the partial inhibition 
or alteration of the overall cAMP synthesis pathway at the receptor level may be a 
necessary adaptive step for a cell in response to a massive increase in membrane receptor 
expression level. H 
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Developmental regulation of acidic fibroblast growth factor (aFGF) 
expression in bovine retina. 

Jacquemin E , JgnetL, OMyerL, BugraK, WentM, CourtoisY, JeannxJC. 

Unite de Recherches Gerontologiques, U: 118 INSERM, Paris, France. 

Acidic fibroblast growth factor (aFGF) is a signalling molecule implicated in a wide 
variety of biological processes such as ceH growm/ differentiation and survival It has 
been purified from bovine retina: The present study was carried out to detect which cells 
ui the bovme retina expressed aFGF at the different stages of embryonic and post-natal 
development. The specific aFGF mRNA and protein were detected by in situ 
hybridization employing riboprobes and immunocytochemistry using affinity purified 
polyclonal human recombinant aFGF antibodies respectively. No signal was detected by 
either technique until 4-5 months and then there was progressive expression of aFGF 
with terminal morphogenesis of the retina. By 8-9 months of embryonic development 
nuclei of the 3 neuronal layers (ganglion cell layer, inner and outer nuclear layers) were 
all uniformly and intensely labeled. A slight labeling of the pigmented epithelium of the 
retina was ako visible throughout development and maturation. These results showed a 
good correlation between message and protein expression in these cell types. In contrast 
glial cells m the nerve fiber layer and vascular endothelial cells displayed a nuclear 
ltnmunostaining for the protein in the absence of message. These data suggest that aFGF 
plays a role m the late steps of retinal differentiation by autocrine and paracrine 
mechanisms. 
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The p21(Cipl) protein, a cyclin inhibitor, regulates the levels and the 
intracellular localization of CDC25A in mice regenerating livers, 

Ja i me Pujol MJ , Serratosa J; Pantoja C Canela N. Casanovas <X Serrano M. 
Agell N, Bachs O . 

. Department of Cell Biology and Pathology, Faculty of Medicine, Institut d'Investigacipns 
Biomediques August Pi Sunyer (IDIBAPS), University of Barcelona,' Barcelona, Spain. 

Liver cells from p21(Cipl-/-) mice v subjected to partial hepatectomy (PH) progress into 
DNA synthesis faster than those from wild-type mice/These cells also show a premature 
induction of cyclin E/cyclin-dependent kinase (CDK) 2 activity. We studied the 
mechanisms whereby cells lacking p21 (Cip 1) showed a premature induction of this 
activity. Whereas the levels of CDK2, cyclin E, and p27(Kipi) were similar in both wild- 
type and p2 1 (Cip 1-/-) mice, those of the activator CDC25A were much higher in 
p21(Cipl-/-) quiescent and regenerating livers than in wild-type animals. Moreover, 
p21(Cipl-/-) cells also showed a premature translocation of CDC25 A from cytoplasm 
into the nucleus. The ectopic expression of p21(Cipl) into mice embryo fibroblasts from 
. p21 (Gipl-/-) mice decreased the levels of CI)C25A and delayed its nuclear translocation. 
The levels of CDC25A messenger RNA in p21(Cipl-/-) cells were higher than in wild- 
type cells^ suggesting that this- increase might be responsible, at least in part, for the high 
levels of CDC25A protein in these cells. Thus, the results reported here indicate that 
p21(Cipl) regulates the levels and the intracellular localization of CDC25A! We also 
found a good correlation between CDC25A nuclear translocation and cyclin E/CDK2 
activation. In conclusion, premature translocation of CDC25 A to the nucleus might be 
involved in the advanced induction of cyclin E/CDK2 activity and DNA replication in 
cells from animals lacking p21(Cipl). 
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Alteration of frizzled expression in renal cell carcinoma. 
Jansscns N, Andries L . Janicot M Pftg^raTr r^vir^ 

Department of -Biochemistry, University of Antwerp, Wilrijk, Belgium 
njansse9@prdbe.jnj.com e 

To evaluate the involvement of frizzled receptors (Fzds) in oncogenesis, we investigated 
mRNA expression levels of several human Fzds in more than 30 different humantoT 

ZT"*!!?^ n0rmal tis —P^, usingrea^e " 
8 oTn uZJ W f ** mRNA ,eveI of Fzd5 was ^edly increased in 

8 of 11 renal carcinoma samples whilst Fzd8 mRNA was increased in 7 of 1 1 renal 
^norna samples. Western blot analysis of crude membrane fra^l rtvliSlt 

fce1™^ A WjMi-atenin signaling pathway acUvation was confirmed by 
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protem expression was investigated in a broader panel of kidney tumor samplSTzdS 
membrane string was detected in 30% of clear cell carcinomas, and mere wata stone 
correlation with nuclear cyclin D I staining in the samples. Our data su^estedTal Sd 
expression of certain members of the Fzd family, andLr downsSSS coufd 
prov.de alternative mechanisms leading to activation of the Wnt signaHnSwT n 
renal carcmogenes.s. Fzd family members may have a role as a biomarker 
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Abstract 

To evaluate the involvement of frizzled receptors (Fzds) 
in oncogenesis, we investigated mRNA expression lev^ 
. els of several human Fzds in more than 30 differ- 
ent human tumor samples and their corresponding 
(matched) normal tissue samples,. using real-time quan- 
titative PCR. We observed that the mRNA level of Fzd5 
was markedly increased in 8 of 11 renal carcinoma sam- 
ples whilst Fzd8 mRNA was increased in 7 of 11 renal 
carcinoma samples. Western blot analysis of crude 
membrane fractions revealed that Fzd5 protein expres- 
sion in the matched turrior/nprmal kidney samples corre- 
lated with the observed mRNA level. Wnt/p-catenin sig- 
naling pathway activation was confirmed by the in- 
creased expression of a set of target genes. Using a kid- 
ney tumor tissue array, Fzd5 protein expression was 
investigated in a broader panel of kidney tumor samples. 
Fzd5 membrane staining. was detected in 30% of clear 
cell carcinomas, and there was a strong correlation with 
nuclear cyclin Dl staining in the samples. Our data sug- 
gested that altered expression of certain members of the 



Fzd family, and their downstream targets, could provide 
alternative mechanisms leading to activation, of the Wnt 
signaling pathway in renal carcinogenesis. Fzd family 
members may have a role as a biomarker. 

: ~~ Copyright O 2004 5. Kir ©ef AG, B»sH 



Introduction 

The Wnt signaling pathway is evolutionary conserved 
and controls many events during embryonic develop-, 
ment. Members of the Wnt gene family of secreted glyco- 
proteins are involved in embryonic induction, generation 
of cell polarity, cell proliferation and the determination of 
cell fate [J, 2 J. Recently, it has become evident that the 
Wnt pathway is also deregulated in a range of tumors [3]. . 

The Wnt signaling pathway is activated when Wnt pro- 
teins bind to a cell surface receptor complex consisting of 
a member of the frizzled receptor (Fzd) family and either, 
low-density-lipoprolein receptor-related protein (LRP)5 
or LRP6 [4, 5]. A detailed characterization of the Fzds. 
and the immediate downstream events after Wni binding 
has been hampered by the lack of pure biologically active 
Wnts. 

Downstream of the receptor complex, three pathways 
may be initiated, depending on the composition of the 
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ligand and receplor complex The 'Wut/p-catenin path- 
way*, the *Wnt/Ca 2+ pathway' or the VVnt poiariiy path- 
way' [6J. The Writ/0-catenin pathway has been iinked to 
carcinogenesis. Gtnetic alterations in components of this 
pathway (adenomatous polyposis coli, APC, axin and 0- 
catenin)can result in the accumulation ofnon-phosphory- 
lated 0-catenin [3, 7] and this can promote carcinogenesis. 
Conversely, neither the Wnt/Ca 3+ pathway nor the Wm 
polarity pathway involves the activation of P-catenin [for 
review, see ref. 1,6). 

Mutations in one pf the three regulatory genes (APC, 
p-catenin and axih), overexpression of Wnts and Fzds or 
the expression of a constitutively active Fid have been 
. linked to Wnl/P-catenin pathway activation in various 
tumors [8, 9], 

To evaluate the involvement of Fzds in oncogenesis, 
we investigated mRNA expression levels of several hu- 
man F2ds,(Fzd2, 3, 5, 6, 7, 8 and 9) in more than 30 differ 
ent human tumor samples using real-time quantitative 
PCR. Each sample was compared with its corresponding 
(matched) normal tissue sample. The most striking obser- 
vation was the dramatically increased Fzd5 and FzdS 
mRNA expression seen in the renal carcinoma samples. 
This was confirmed at the protein level using Western 
blotting. Kidney tumor tissue arrays confirmed Fzd5 
membrane staining in 30% of clear cell carcinomas, with 
nuclear cyclin Dl showing a strong correlation with the 
Fzd5 membrane labeling. Fzd8 protein expression analy- 
sis was not performed due to the lack of suitable reagents. 
These data suggest that Fzd5 may have a role in renal cell 
carcinogenesis due to its frequent overexpression obr 
served in these tumor samples. Potential future applica- 
tions could include uses in tumor targeting or as a poten- 
tial biomarkerl 



Materials and Methods 

Ttssite Samples 

Frozen tumor tissue samples wiih corresponding normal tissue 
from the same patient were derived either from human biopsy. or 
autopsy material (Depart men I or Pathology, University or Antwerp, 
kindly provided by Pror. E. Van Marck). Tissue specimens were 
snap-frozen in liquid nitrogen and kept at -S0°C until use. Frozen 
sections of kidney tumor and normal tissue samples were stained 
with hcmaioxyfin-eosin io support the pathologist's observations and 
to confirm the typeofkidney tumor. Pa rafTin-em bedded tissue slides 
of renal carcinoma, lung carcinoma, breast and colon carcinoma 
wcic obtained: after encryption, from the Department of Pathology 
(Middelbeim Hospital. Antwerp. Belgium). The CU human kidney 
cancer (SupeiBioChips Laboratories) tissue array used in this study 
contained 59 tissue samples consisting of 9 normal kidney tissues. 



30 clear cell ren3l carcinoma samples and another 20 renal cell tumor 
types (chromophil, chromophobe, papillary type, collecting duct car- 
cinoma and samples with mixed types)L 

RNA Isolation and Reverse Transcription 
Total RNA was extracted from tissue specimens using Ultraspec 
Reagent (Biotccx. USA) according to the manufacturers instruc- 
tions. All total RNA was routinely treated with DNase (DNA-free kit, 
Ambion, USA). I ug of total RNA was used to synthesize cDNA 
using oiigo-dT primers (Superscript; Jovitrogcn, Merelbeke. Bel- 
gium). Reverse transcription was performed at 42 ' C for 60 min, fol- 
lowed by 70* C for 1 0 min. 

Real-Time PCR 

Real-time PCR was performed on either an ABI Prism 7700 or 
7900 Sequence detection system (perkin-Bmer Applied Biosystenw, 
Foster City, Califs USA) using ihc 5* nuclease assay (Taqman™). 
Primer and probe sequences were designed using Primer Express (PE 
Applied Biosyslems) and are shown in table I. Quantitative values, 
were obtained from the threshold cycle number (Ct) at which the 
increase in the signal associated with exponential growth of PCR 
products is detected using PE Biosyst ems analysis software, accord- 
ing to the manufacturer's instructions. . • V 

We have used Ihc 2" A AO method to analyze the relative changes 
in gene expression of the different genes between tumor and corre- 
sponding normal tissue samples. We used the mitochondrial ATP 
synthase 6 (ATPsy6) as the endogenous RNA control p 0; Janssens et 
ah, in prep.}, and each sample was normalized to its ATPsy6 content. 
The relative expression of the target gene was also normalized to the 
corresponding normal tissue sample (Calibrator). Results, expressed . 
as the amount of target sample relative to the ATPsy6 gene and the 
calibrator, were determined as" follows, N = 2- ,6Cmmpk ~ ACmJibrater^ 
where the ACl values of the sample and calibrator were determined 
by subtracting the average Ct value of the sample and the calibrator 
from the average Ct value of the ATPsy6 genel Amplification \vas 
done essentially as described previously JI0J. Briefly, 50 pi of reac- 
tion mixture containing I pi of c£)NA template were amplified as 
follows; incubation at 50 TC for 2 rain, denaturalion at 95 'C for 
10 min, and 50 cycles at 95 *C for 15 s and 60 *C for r min. 

Membrane Preparat ion, Gel Electrophoresis and Jrnmvnobtotting 
Tissue samples were weighed, suspended at a 40 times dilution 
1= 40 volumes/original wet weight of tissue (v/w)) in 50 mAI Tris- 
HCI buffer, pH 7.4, and homogenized with an UJlra-Turrax homoge- 
nizes After centriruga lion for l0.min,24,000£ai4*C, the pellet was 
washed three times by resuspension in the Tris-HO buffer followed 
by centrifugal ion. The final membrane pellets were stored at -80 P C 
in the Tris-HCI buffer at a concentration of 0.5-1 rog/ml. The Bra oV 
ford protein assay (Pierce, Aalst, Belgium) was used Tor protein deter- 
mination. Proteins (50 \xg) were separated. by 8% SDS-PAGE and 
transferred to nitrocellulose membranes. After primary and second- 
ary antibody incubation, the aniigen-aritibody-peroxidase complex 
was detected by cherriiluminescence (Pierce, Aalst, Belgium) accord- 
ing to the manufacturer's instruct ions. 

Immimohistochemxstry 

Immunohistochemislry was performed on 10-pAAihick cryoscc- 
tions pf unfixed tumor tissue and on 6-u.V-ihick paraffin "sections 
from renal tumor tissue fwed by formalin or by an alcohol-based fixa- 
tive. Adjacem tissue blocks from renal tumors were processed with 
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Table 1. Real-time PCR primer ond probe sequences 



Target cDNA ; Primer/probe sequences* 


Fragment 
position* 


Accession 
No* 


FZD2 


(a) 5*-aicccgtgcccggc-3* 

(b) y-giauigatcaiglagaccglgaagic-3' 

(c) 5'-FAM-tacacgccgcgcalg!cgc-TAMRA-3' 


1,345-1,013 


AB017364 

♦ 


FZD3 


(a) 5*-!c3cgccagtgcatggg-3' * 

(b) 5'-ugicacctlcaatt»altcatcg-3* 

(c) 3'-FAM<aiccccggaac(ct3accaicatcciitt.TAMRA-3* 


1,473-1547 


- AB039723 


. F2D5 


(a) S'-lcccaacciracttcrMit-V 

(b) 5'-tciccaag(cgccgcg-3' 

(c) 5'-FAM<cttcatggigctgagccccc-TAMRA-3(' 


2, 143-2,204 


HSU433I8 


FZD6 


•* ^*"5Lutcccvaggiiaagagaa-j 

(b) S'^ccagagagiclggagaiggai^ 

(c) 5'-FAM-t^ggtj^acc»gcclcgccag-TAMRA-3' 


2,094-2,J70 


AF072873 - 


FZD7 


(a) 5*-cctglggaaaggcataacigtg-3* 

(b) J*-aaccaacgggaaaccicaga-3' 

(c) 5'-FAM-aa^caacnitataggcaaagcagCgcaa-TAMRA-3' 


2,687-2,762 


AB017365 


vryr\o 
r LaJq ■ 


(a) 5 -igiBglcggtgcictgcll-3' 

(b) 5*-cgctccatgtcgataaggaag-3' 

(c) 5'-FAM-ccacc»cgccaccglctcca-TAMRA-3* 


853-919 


. AB043703 


TZD9 


(a) 5'-ccccgggagclacggacO' r 

(b) 5'-iaglcalgtgcaagacc3cgg-3' 

(c) 5'-FAM-tggcacgcactgccactat3aggct-TAMRA-3* - . 


1,696-1,763 


HSU82I69 


ATPsy6 


137 j ~ggigia£gtglBccii£tggi-3 
(°) 5 -gggcgcaglgaUataggcil-3' 

(c) 5 -FAM-aagigggctagggcaitiliaaiciiagagcg-TAMRAO' 


. 58O-503 


AF36827I 


c-myc 


(a) 5'-3ccaccagc3gcgacieiga-3' 

(b) 5'-tceagcagaaegtgatecagact-3' 

(c) 3 ; -FAM^acctHigccaggagccigcctci-TAMRA-3' 


U97 T I,413 


HSMYCI 


Cyclin Dl 


(a) 5'-gaacctggccgcaatgac-3*- 

(b) 5'-cgcciciggcailiigga-3' 

(c) 5'-FAM-ccgcacgailicaltgaacacit-TAMRA-3' 


4,148-4,211. 


AF5J 1593 


PPAR5 


. (a) 5'-agcatcctcaccggcaaa-3' 

(b) 5'-gtctcgatgicgtggatcaca-3' 

(c) 5'-FAM-ccagccocacggcgccct-TAMRA-3' 


932-990 


NM-006238 . 


3 (a) =.Scnse primer, (b) = anlisense primer, (c) = probe. 

! l?JS!^! W givCn ^ cordin « »° lhc HMBUGenBank accession No. of cloned sequence. 
EMBUGenBank accession No. of cloned sequence. * 



formalin and with the alcohol-based fixative. Paraffin and cryosec- 
lions were mounted on poly-Z-lysine or 3-aminopropy||riethoxys>- 
lane-gelaiin-coated slides. The 59 tissue samples on the CLI human 
renal cancer tissue array slides were, all fixed with formalin and 
embedded in paraffin, and ihe seciions were mounted on silanc- 
eoated slides (SuperBioChips Laboratories). In addition to renal car- 
cinoma tissue, sections from 10 formalin-fixed paraffin-embedded 
lung carcinomas were stained for F*d5. Colon and breast tumors 



were used as positive controls for 0-eatenin and cyclin Dl immuno- 
staining. 

The following primary antibodies were used: F2d5 (Upstate Bio- 
technology), P-caienin (Zymed), cyclin Dl (Zymed). E^cadherin 
(Novaccstra) and cytokeratin 8 (Biogenex). Cryosections were Fixed 
in 4% paraformaldehyde for 5 min; acetone for 5 min at -20*C and 
70% ethanol for 5 min. Endogenous peroxidase activity was 
quenched using3% H:<X Paraffin sections of formalin- and alcohol- 
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Table 2. Fzd mRNA expression in lumor samples 



Sample* 


Tissue 


Tumor lype 


.vfold expression increase** 
















FZD2 


F2D3 


FZD5 


FZD6 


- FZD7 


F7D8 


F7DQ 


133702 


kidney " 


* adenocarcinoma 


0.17 


0.5 


332 


2.13 


0.06 


1.32 




137770 


kidney 


renal eel) carcinoma 


1.23 


3J6 


836 


2.61 


1.3 


831 




138844 


kidney 


renal cell carcinoma 


0.31 


0.11 


6.84 


1.18 


0>23 


338 


2.8 


137146 


kidney 


renal cell carcinoma 


0.47' 


n as 


J.JU 


2.23 


0.73 


4.42 


1.37 


137564 


kidney 


. fciidi ten carcinoma 




J OS 
£.7 J 


O A 

y.o. 


•1.57 


7.64 


3.52 




..133408 


kidney 


renal cell carcinoma 




n os 


a to 


X..HQ 


*/>e 
J. is J 


JO. /J 




139)88- 


kidney 


renal celt carcinoma 




U.JO 


ft £A 


1 .J J * 




A At 




13569? - 


kidney 


renal cell carcinoma 


2.6 • 


0.36 


4.83 


ft 0 






1.5,1 


139064 


kidney 


renal ceJt carcinoma 


536 


1.82 


U5 


7 *A 


1 ft 

1.9 


2.19 


2.85 


. 134585 


kidney 


renal cell carcinoma ' 


1.47 


0.72 


1.38 


0.47 


0.09. 


0^6 




140279 


kidney 


jenal cell carcinoma 


733 


18.17 


3.93 


6.05 


6.41 


4.65 


7.91 : 


137252 


ovary 


t2rr»nr>mrmm*j 

Vat L (IJlwal LUI|)d 


0.4 


339 


0.49 


A A A 

0.44 


0.92 


0M - 


0.53 


138256 




yttytitdry carcinoma 


0.7 


537 


1.22 


2.19 


0:4 


0.1 1 


- 


146472 


ovarv 


serous pa pi nary carcinoma 


• 039 


339 


2.56 


1.34 




.0.67 


OJ? 


145845 


colon 


adenocarcinoma 


3.45. 


• 336 


0.58 


U2 


1.67 


1J24 


1.99 


146145 


colon 


adenocarcinoma . 


5.46 


634 


4.42 


637- 


.7. 3d* 


• 6.08 


0.15 


146630 


colon 


adenocarcinoma 


4.01 


433 


0.3 


1.16 


0.54 


035 


0_I2 - 


146633 


colon 


adenocarcinoma 


J. 87 


1.07 


1.62 


1.47 


. 0.51 


2.07. 




147055 


colon 


adenocarcinoma 


0.66 


1.41 


1.01 


1.33 


024 


.0.79 . 


4.87 


142253 


lung 


adenocarcinoma. 


a *7 
u.o / 


0.1/ 


U.43 


* 1.87 


O XX 




' ft A 1 


143036 


lung 


adenocarcinoma 


0-67 


1.24 


2.63 


1 11 


ill 




y.Jy 


.138938 


. lung 


adenocarcinoma 


0.93 


0.88 


1.07 * 


i -i 
i 


l./J 


ft in 


U./D 


133563 


Jung 


adenocarcinoma 


2.76 


1.23 


0.31 


1.34 


0.99 


0.78 


4.97- 


144387 


lung 


■*. adenocarcinoma 


12.85 


0.43 


0.49 


0.25 


2.73 


0.54 




137304 


lung 


acinary adenocarcinoma 


0.54 . 


935 


1.65 


1.25 


0.63 


341 


2.64 


M4546 


lung 


epithelial carcinoma 


O.) 


0.67 


0.2) 


83 J 


0.27 . 


0.97 


1.32 


13762) 


lung 


epithelial carcinoma 


1.52 


2.19 


.0.37 


236 


0.52 


0.45 


1.06 


145552 


lung 


epithelial carcinoma 


1.09 




0.16 


0.91 


1.47 


0.13 


03 


143987 


testis 


embryonal carcinoma 


0.24 


0.6 


19.43 


1.33 


2332 


2.43 


033 


137332 


stomach 


leiomyoma 


1932 


3.91 


0.45 


1.53 


18.98 


32.82 


163 


139026 


stroma 


gastrointestinal carcinoma 


663 


331 


0.03 


6.67 


8.44 


2.28 . 


20.82 


136049 


rectum 


adenocarcinoma 


J .71 


0J2 


0.4 


0.54 


0.29 


0.88 


028 


140794 


gall bladder 


adenosquamous carcinoma 


0.19 




0.93 


3.02 


0.2 







* Sample identification numbers were given by the pathologist. 

b Results are expressed as x-fold increase of the gene in the tumor tissue sample compared to its matched normal tissue sample after 
normalising both samples on the basis of their. ATPsy6 content. A cutoff of 3-fold was used to de/ine differential expression. Significant 
(> 3-fold) increases in the expression level of the Fid recepior arc shown in italics. - = Expression of the target gene undetectable in one or both 
. samples (rumor and/or normal). 



fixed tissue were processed with a irypsm-riiratermicrowave pre- 
treatment or with an EDTA- microwave pre treatment to unmask epi- 
topes, respectively. Sections were then sequentially processed with 
primary antibodies, broiiny toted secondary antibodies and streptav- 
idin-biotin-peroxidase <F2d5. E-cadherin and cyiokeratin 8). For 0- 
caienin. polyclonal rabbit antibody with the En Vision detection sys- 
tem (DAKO) was used. The slides were further developed using 3- 
aminr>9-eihylcarba20Je. count erstaine'd wiih hemalaun and mount- 



ed with glycerin, gelatin. Stained sections were observed with an 
Axioplan 2 microscope equipped with an Axiocaro digital camera. 
Staining intensity for 0-caicnin was scored as no staining (value 0), 
weak and fragmentary staining of cell membranes (valve 1), moder- 
ate membrane staining of less than 50% of the tumor cells (value-2). 
moderate membrane staining of more than 50% of tumor cells (value 
3) and strong membrane staining of more than 75% of tumor cells 
(value 4). The cvdin Dl staining was quantified as a percentage of 
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Ivmot Biot 200-1:25:161-1 7| 



JanssensMndries/Jankroi/Perera/Bakker 



f*9* 1. Fzd5 protein expression in matched 
tumor/normal kidney samples. T - Tumor 
sample; N = maiched normal sample. Sam- 
ple identification numbers are given by ibfc 
pathologist. 



139188 135699 139064 134585 140273 

T T N T M T N T N 




FzdS 



Actin 



cyclin Dl-immunoreactjve nuclei m tumor cells in three fields (area: 
1 8,64 1 pro 7 ) or each tumor sample; The tojal number of tumor nuclei 
ranged from. 5 J lo 164. The correlation between Fzd5 and 0-caienin 
staining and between Fzd5 and cyclin Dl staining was evolved by 
ihe Mann-Whitney U test. 



sion level was observed to be relatively tow in these lung 
tissues compared to the other tissues investigated 



ResuHs 

Fzd mRNA Expression in Maiched Human 

Tumor/Normal Tissue Samples 

Fzd expression in tumor tissue was compared with Fzd 
expression in matched normal tissue samples and normal- 
ized to the expression of the housekeeping gene mitochon- 
drial ATPsy6 (tabJe2). A 3-fold increase was considered 
significant. . 

In the kidney tumor samples, in which 10 ofl 1 sam- 
ples were dear cell carcinomas, Fzd5 was upregulated in 8 
of the J 1 samples. A similar observation was made for 
Fzd8 and Fzd2, which were upregulated in 7 and 5 renal 
tumor samples, respectively. None of the -other Fzds 
showed consistent upregulation. 

Both Fzd2 and Fzd3 were upregulated in 3 of 5 colon 
adenocarcinoma samples. No other Fzd expression was 
significantly different compared to the normal colon tis- 
sue sample. Fzd3 showed an increased expression in all 3 
ovarian carcinoma samples. Fzd expression was not al- 
tered in any of ihe lung tumor samples. The Fzd expres- 



Western Blot and ImMunohistochemistry Analysis on 
Renal Carcinomas 

Western blotting was used to evaluate Fzd5 protein 
expression in the renal tissue samples used for mRNA 
expression analysis. Membrane fractions of the renal car- 
cinoma and corresponding normal tissue samples were 
prepared. As previously shown (table 2), Fzd5 mRNA 
upregulation was detected in 8 of the* I J matched tumor/ 
normal samples. Increased expression of Fzd5 protein 
was seen in membrane fractions from 9 of I I samples 
(fig. I). In most cases, concomitant increases in Fzd5 
mRNA and protein levels were observed. 

Hematoxylin-eosin Staining of the cryosectioned' tu- 
mors confirmed the presence of clear cell carcinoma. 
Fzd5 immunostaining in clear cell carcinoma (fig. 2a, b) 
was observed to be localized to cell membranes and to 
nuclei. Cytokeratin 8 (fig. 2c) and E-cadherin (fig. 2d) 
were also detected. E-cadherin labeling of ceil membranes 
in clear cell carcinoma was Jess intense and patchy com- 
pared to epithelial cells of normal renal tissue. p-Catenin 
staining was confined to the cell membrane. 0-Caienin 
levels in the clear cell carcinoma membranes were highly 
variable. Nuclear 0-catenin staining was not observed in 
any of the samples. Epithelial cells in nprmal renal tissue 
showed intense membrane staining and some cytoplasmic 



Staining. In addition, weak p-catenin staining of endothe- 
lial cells was observed. A high number of cyclin Dl- 
immunoreactive nuclei was observed in cleared) carcino- 
ma (fig. 20- 

On the CL1 human kidney cancer tissue array, 30% 
(n = 9) of the clear eel) carcinoma tumor samples (n = 30) 
showed Fzd5 immunoreactivity (fig. 3a). Membrane-as- 
sociated >catenin staining was observed in 33% of the 
FzdS-positi.ve tumor samples and 57% of Fzd5-negative 
clear renal cell carcinoma samples (table 3; fig. 3c, dX 
Again, nuclear P-catenin staining was never observed. 
Statistical analysis did not reveal a difference in the 
expression of 0-catenin between FzdSrposiiive and F2d5- 
negative tumor samples (fig. 4a). 

Nuclear cyclin Dl was observed in ; 89% of the Fzd5- 
positive clear ceil carcinoma samples (table 3; fig. Je). 
Only 38% of the Fzd5-negative clear cell carcinoma sam- 
ples; contained nuclear cyclin Dl. Statistical analysis' 
showed a significantly higher cyclin Dl expression in 
Fzd5-positive compared to Fzd5-negative tumor samples 
(fig. 4b). 



Expression of PPAR5 was investigated because it repre- 
sents a direct target of the P-catenin pathway with T cell 
factor binding sites in its promoter. Expression of c-myc - 
was found to be upregulated in 7 of J 1, whilst cyclin Dl 
was upregulated in 10 of 1 1 kidney tumor samples (ta- 
ble 4). PPAR5 was upregulated in 9 cases. AH three select- 
ed target genes showed a marked upregulaijon in the 
• majority of renal tumors, which suggested that the Wnt/ 
p-catenin pathway was activated in these samples; 



Discussion 

F2d family member overexpression has been postu- 
lated to play key roles in different tumor types such as 
esophageal carcinoma [11], gastric cancer [12] and head 
and neck squamous, cell carcinoma [13], The current 
study evaluated the potential implication of F2ds as 
tumor-associated antigens in different tumor types. We 
screened a number of matched normal/tumor tissue sam- 
ples for the expression of a variety of Fzds using real-time 



c-myc { Cyclin Dl and Peroxisome 
Proliferator-Actiyated Receptor 8 Expression in Renal 
Carcinomas 

WnVp-catenin.pathway activation in the kidney tissue 
samples was investigated looking at the expression of a 
number of target genes, which have previously been 
shown to be upregulated when the pathway is active. 
Gene expression of c-myc, cyclin Dl and peroxisome 
proliferator-activated receptor 5 (PPAR5) was analyzed. 
Increased expression of both c-myc and cyclin Dl genes 
have been implicated in cell proliferation, and carcino- 
genesis, and they represent two of the more important and 
closely studied target genes of the Wnt signaling pathway. 



quantitative PCR. 



Rg. 2. Distribution of FzdS (a, b), cyioktnt 'm S (c), E-cadhenn (d) 
and ^catcnin (©) immunoreactiviiy i n paraffin sections from a renal 
tumor processed by an alcohol fixative. From the same tumor/a for- 
malin-fixed block was used for cyclin D\ immuoosiaming (f). Fzd5 
immunostaming shows distinct immunoreaciiviry in cell membranes 
and in nuclei of clear cell renal carcinoma. Clear cells are immunore- 
active for cylokeratin 8. A-Caienin and E-cadherin staining of mem- 
brane? is rather weak, and. not uniform, in clear cell renal carcinoma. 
Nuclear 0-catenin immunoreaciiviiy was not observed. In clear cell 
renal carcinoma, many nuclei showed cyclin 0| immunoreactivity. 
The inset in f shows a detailed view of the cyclin Dl labeling of nuclei 
' in clear cell renal carcinoma. 



Resujts obtained revealed that both Fzd5 and Fzd8 
mRNA were overexpressed in the majority of renal carci- 
noma samples when compared to the matched normaJ 
kidney samples. Fzd2 and Fzd3 were upregulated in 3 of 5. 
colon adenocarcinoma samples. Fzd3 was also upregu- 
lated in the ovarian tumor tissue samples compared to the 
matched normal tissue samples. None of the other F2ds 
evaluated showed a specific differential expression pat- 
tern in any of the samples studied. Fzd5 and Fzd8 show 
69. 1 % similarity and belong to the same subgroup of Fzds 
(14], The significantly higher expression of Fzd5 and. 
Fzd8 in the renal tumor samples, as compared to the nor- 
mal renal samples, suggests a higher probability that this - 
subgroup may be implicated in the progression of renal 
cancer. Therefore, we decided to further examine the pos- * 
sible role of Fzd5 in renal carcinoma. 

We observed, using Western Wotting, that protein lev- 
els were mostly consistent with mRNA levels in the tumor 
samples. Jn order to be able to determine the F2d5 expres- 
sion in a broader range of kidney tissues, we utilized a 
tissue array. Fzd5 membrane staining was detected in 9 of 
30 (30%) clear cell carcinomas, and importantly, mem- 
brane staining was not detected in the matched 9 normal 
kidney tissue samples. 

Since the Wnt signaling pathway appears to play an 
important role in embryonic development, in particular 
embryonic kidney induction [J 5, 16], activation of this 
pathway in the adult kidney due to mutation or overex- 
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Table 3. Correlation between Fzd5 and 
P-catenin or cyclin D J expression 

F*d5,% 



ft-Caienin + 
^Catchin- 
CycJin Dl + 
Cyclin/ Dl - 



33 
67 
89 
II 



57 
43 
38 
62 



pression or one of the components of (he pathway could 
te a germmingfactorm the development of renal can- 
cm Therefore, several studies have Joolced into the possi, 
ble function the Wnt/^catenin pathway plays in ra>al 
carcinogenesis APC gene mutations have been demon 
strated not to be involved in renal carcinorria [17,181. In 
addition p-catenin mutations are rare events in renal car. 
cmoma f 19, 20]. Nevertheless, cytoplasmic accumulation 
of 0-catenm has.been reported in a number of renal cell 
carcinomas [19], and thus the Wnt signaling pathway 



T,ble4,Wnt/p<atenin target ^nernR^^ ?on in lumor5ampics 
Sample Tissue Tumor type 



133702 
137770 
138844 
137146 
137564 
133408 
139M 
135699 
139064 
134385 
-14027? 



kidney 
kidney 
kidney 
. kidney 
kidney 
kidney 
kidney 
kidney 
kidney 
kidney 
kidney 



adenocarcinoma 
renal ceU carcinoma 
renal eel) carcinoma 
renaJ cell carcinoma 
renal cell carcinoma 
renal cell carcinoma 
renal cell carcinoma 
renal cell carcinoma 
renal cell carcinoma 
renal cell carcinoma 
renaJ cell carcinoma 

> identification numbers were fiiven by the pathologist 

fTPsyS content. A cutoff of £35 ^ ^ ^ 22 ^ ^ '* ° f 
t> W*Q increases in the cession X^,^ 



0.54 
J 3.9 
2.39 
7.62 
33.82 
7.8 
2.22 
12.18 
22.11 
1.79 
61.68 



4.52 


0,52 


28.91 


J.53 


31.49 


7.48 


1S.38 


3.15 ) 


1965 


8.65 


8.92 


4.86 


9.92 


11.67 


22.73 


5.53 . 


5.04 


. 6.33 


1.14 


1.67 


54.95 


14.62 
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H9. 4. Box plot charts (thick black line = 
median) illustrating the relationship be- 
tween FtdS irnrouiKwlaining and p^catenin 
f»and cyclin Dl (») expression in clear cell 
renal carcinoma. No significant correlation 
was observed between ibe p< alt „i„ scores of 
FzdS-posmve and -negative clebr cell renal 
caronorna. Nuclear cyclin. Dl $ , a i„j„j in 
clear cell renal carcinoma showed a stenifi- 
canl.d.iTerence between ftdj-positive and 
"dS-negative tumor samples. 
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might act as an inducer of m'morigenesis In the kidney. 
This view is supported by the observation that aberrant 
activity of the Wnt signaling pathway has been reported in 
renaJ-cancer-derived ceJ) lines. Zang et a). [2 J ] observed a 
higher expression level of Wnt5a and Fzd5 roRNA in ihe 
renal cancer cell line GRC-1 than in the normal renal cell 
line HK-2. Expression of 0-cateniri was aJso higher in 
GRC-1 than in HK^2. 

To determine the status of the canonical Wnt signaling 
. pathway in our renal carcinoma samples, we have quanti- 
tated the mRNA levels of three important target genes of 
T cell factor/lymphoid enhancer factor activation by 
potenin. The mRNA levels of these three target genes 
■ (c-jnyc, cyclin Dl and PPAR 5) correlated largely with the 
* expression, of F2d5 in these samples, suggesting that the * 
canonical pathway Is activated- On the kidney tissue 
array, cyclin Dl protein expression showed a highly sig- 
nificant correlation with the Fzd5 expression in the tumor 
samples (table 3). Cyclin Dl prolein is frequently. overex- 
pressed in various tumors, but in only aproportion of the 
cases is it due to amplification of the cyclin Dl gene [22J. 
Therefore, other mechanisms such as upregulatiori of gene 
. transcription may play a substantial role in the overex- 
pression of cyclin Dl [23-26]. Our data, showing in- 
creased cyclin D 1 expression in renal carcinoma samples, 
are consistent with the results of Stassar et ai. [27], They 
studied genes that are associated with human renal carci- 
noma by suppression subtractive hybridization and re- 
ported 14 differentially expressed genes, including cyclin 
Dl. Although we would have expected an increased nu- 
clear 0-catenin staining, nuclear accumulation of p-cate- 



* nin was not observed in any of the tumors or on the tissue 
array. This result is consistent with the data presented for 
renal cell carcinomas by Kim et aL U9J. They did not 
detect nuclear p-catenin staining in the 52 renal cell carci- 
nomas examined. The lack of nuclear 0-catenin staining 
has also been reported by others in tumors that might 
have arisen from Wnt/p-catenin pathway activation f 28- 
31]. 

While expression of both Wnt5a and Fzd5 does induce 
duplication of the Xenopus head, exogenous expression of 
Fzd5 in a Xenopus model does not induce duplication of 
the head |32]. Fzd5 does not activate the p-catenin signal- 
mg pathway on its own, as the presence of its endogenous 
ligand is also required. Our results suggest that Fzd5 may 
have a role in renal cell carcinogenesis due to its frequent 
overexpression observed in these tumor samples, and we 
hypothesize that if Fzd5 is overexpressed, it has a rather 
limited effect on 0-catenin signaling. However, in the 
presence of its endogenous still unknown ligand, it acti- 
vates the canonical Wnt signaling pathway. The elucida- 
tion of this ligand and its binding characteristics is still 
under investigation. Ultimately, knowledge of the specific 
expression patterns of both Wnt and Fzd members could 
lead to directed tumor targeting or could be used as a 
tumor marker. 
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Erratum in: 

• Int J Cancer 2002 Feb 20;97(6):878. 

+55? (nicf Science' 

Immunohistochemical analysis of NY-ESO-1 antigen expression in normal 
and malignant human tissues, 

Jungbluth AA , Chen YT. Stockert E , Busam KJ , Kolb D, Iversen K , Coplan K. 
Williamson B , Altorki N, Old I LJ, 

Ludwig Institute for Cancer Research, Memorial SIoan-Kettering Cancer Center, New 
York, NY, USA.jungbIua@mskcc.org 

NY-ESO-1, a member of the CT (cancer/testis) family of antigens, is expressed in normal 
" testis and in a range of human tumor types. Knowledge of NY-ESO-1 expression has 
depended on RT-PCR detection of mRNA-and there is a need for detecting N Y-ESO-1 at 
the protein level. In the present study, a method for the immunochemical detection of 
NY-ESO-1 in paraffin-embedded tissues has been developed and used to define the 
expression pattern of NY-ESO-1 in normal tissues and in a panel of human tumors. No 
normal tissue other than testis showed NY-ESO-l reactivity, and expression in testis was^ 
restricted to germ cells particularly spermatogonia. In human tumors, the frequency 6f 
NY-ESO-l antigen expression corresponds with past analysis of NY-ESO-I mRNA 
expression e.g., 20-30% of lung cancers, bladder cancers and melanoma, and no 
expression in colon and renal cancer. Co-typing of NY-ESO-1 antigen and mRNA 
expression in a large panel of lung cancers showed a good correlation. There is great 
variability in NY-ESO-1 expression in individual tumors, ranging from an infrequent 
. homogeneous pattern of staining to highly heterogeneous antigen expression. Copyright 
2001 Wiley-Liss, Inc. 
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Multidrug resistance phosphoglycoprotein (ABCB1) in the mouse 
placenta: fetal protection. 

Kalabis.GM Kostaki A, Andrews MH. Petropoulos S. Gibb W. Matthews SG . 

Department of Physiology, University of Toronto, Ontario, Canada. 

The multidrug resistance phosphoglycoprotein ATP-binding cassette subfamily B 
(ABCB1) actively extrudes a range of structurally and functionally diverse xenobiotics as 
well as glucocorticoids. ABCB I is present in many cancer cell types as well as in normal 
tissues. Although it has been localized within the mouse placenta, virtually nothing is 
known about its regulation. In the mouse, two genes, Abcbla and Abcblb, encode 
ABCB I. We hypothesized that there are changes in placental Abcbla and Abcblb gene 
expression and ABCB1 protein Levels during pregnancy. Using in situ hybridization, we 
demonstrated that Abcblb mRNA is the predominant placental isoform and that there are 
profound gestational changes in the expression of both Abcbla and Abcblb mRNA. 
Placentas from pregnant mice were analyzed between Embryonic Days (E) 9.5 and 19 
(term approximately 19.5d). Abcblb mRNA was detected in invading trophoblast cells 
by E9.5, peaked within the placental labyrinth at El 2.5, and tiien progressively decreased 
toward term (P < 0.0001). Abcbla mRNA, although lower than that of Abcblb at 
midgestation, paralleled changes in Abcblb mRNA. Changes in Abcbl mRNA were 
reflected by a significant decrease in ABCB 1 protein (P < 0.05). A strong correlation 
existed between placental Abcblb mRNA and maternal progesterone concentrations, 
indicating a potential role of progesterone in regulation of placental Abcblb mRNA.' In 
conclusion, there are dramatic decreases in Abcbla and Abcblb mRNA and in ABCB1 at 
the maternal-fetal interface over the second half of gestation, suggesting that the fetus 
may become increasingly susceptible to the influences of xenobiotics and natural steroids 
in the maternal circulation! 
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Expression of human telomerase reverse transcriptase gene and protein 
and of estrogen and progesterone receptors, in breast tumors: Preliminary 
data from neo-adjuvant chemotherapy. 

j Kanunori M . Izumiyama N, Hashimoto M Nakamur^ Okano T Kurabavashi R 
NaokiH, HomnaN, OgawaT, IGjounjsluM, TakuboK ' Kurabayashi R , 

Division oflJreast and Endocrine Surgery, Department of Surgery, Graduate School of 
Medicine, The University of Tokyo, Bunkyo-ku, Tokyo 1 13^ Japal kamnor^ 

ais(guniin.acjp. 

Human telomerase reverse transcriptase (hTERT), the catalytic subunit of telomerase is 
very closely associated with telomerase activity. Telomerase has been implicated in ' 
cellular immortalization and carcinogenesis, in situ detection of hTERT will aid in 
deterrninmg the localization of telomerase-positive cells. The aim of this study was to 
detect expression of hTERT mRNA, hTERT protein, estrogen receptor (ER) and 
progesterone receptor (PR) in paraffin-embedded breast tissue samples and to investigate 
me relanonship between hTERT expression and various clinicopathological Zme £s 
m breast tumongenesis. We used in situ hybridization (ISH) to examine^hTE^e 
pp^^^j'pp' ^^'^j'^^'^^^^tocheinistry (IHC) to examine expression of hTERT protein 
r m ^ SSUeS mClUding 64 aden ocarcinomas, 2 phyllode rumors and their 
-adjacent normal breast tissues. hTERT gene expression was detected by ISH in 56 (88%) 
carcnomas but in neither of the 2 phyllode tumors. hTERT protein expression was } 
detected by IHC in 52 (81%) carcinomas, but in neither of the 2 phyllode tumorT 
Moreover ER and PR were expressed in 42 (66%) and 42 (66%) carcinomas, ' 
respectively, and in neither of the 2 phyllode tumors. In 4 cases of breast carrinoma that 
strongly expressed hTERT gene and protein before treatment, neo-adjuvant 
chemotherapy led to disappearance of gene and protein expression in all cases. There was 
a strong gelation between detection of hTERT gene expression by ISH and of hTERT 
protein by ICH in tissue specimens from breast tumors. These results suggest that 
deechon of hTERT protein by ICH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 
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Abstract. Human telomerase reverse transcriptase (hTERT), 
the catalytic subunit of telomerase. is very closely associated 
with .elomerase activity. Telomerase has been implicated u» 
cellular imnwrtalization and carcinogenesis. In situ detection of 
hTERT will aid in determining the localization of telomerase- 
posilive cells. The aim of this study was to detect expression of 
hTERT mRNA. bTERT protein, esuogen receptor (ER) and 
progesterone receptor (PR) » paraffin-embedded breast tissue 
samples and to investigate the relationship between hTEKI 
expression and various clinicopathologica) parameters in breast 
tumorigenesis. We used in situ hybridization (1SH) to examine 
hTERT °ene expression, and immunohisiochemisiry (IHQ to 
examine°expression of hTERT protein, ER and PR. in breast 
tissues including 64 adenocarcinomas. 2 phyllode tumors and 
their adjacent normal breast tissues. hTERT gene expression 
was detected by ISH in 56 (88%) carcinomas, but in neither 
or the 2 phyllode tumors. hTERT protein expression was 
detected by 1HC in 52 (81%) carcinomas, but in neither of 
the 2 phyllode rumors. Moreover. ER and PR were expressed 
in 42 (66%) and 42 (66%) carcinomas, respectively, and in 
neither or the 2 phyllode rumors. In 4 cases of breast carcinoma 
that strongly expressed hTERT gene and protein before 
treatment neo-adjuvant chemotherapy led to disappearance of 
gene and protein expression in all cases. There was a strong 
correlation between detection of hTERT gene expression by 
ISH and of hTERT protein by 1CH in tissue specimens from 
bieast tumors. These results suggest that detection of hTERT 
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protein by 1CH can be used to distinguish breast cancers as » 
potential diagnostic and therapeutic marker. 

Introduction 

Breast cancer is the most frequent malignancy in w °men. 
affecting up to one in every eight females worldwide. The 
most important climcopathological prognostic parameter so 
fa, identified is the absence or presence of lymph node 
metastasis, but the identification of further parameters for 
both lymph node-positive and -negative patients would 
facilitate an individually based risk-directed therapy (1). A 
promising emerging molecular marker is telomerase a 
ribonucleoprotem enzyme complex, which when activated 
or upregulated allows tumor cells to escape from cellular 
senescence and to proliferate indefinitely (2). The human 
telomere is a simple repeat sequence of six bases^ fTTA GGO) 
that is located at the ends of each chromosome (3). Telomeres 
are believed to protect against degeneration, reconstruction, 
fusion, and loss (4) and to promote the homologous pairing 
of chromosomes (5). The end-to-end chromosome fusions 
observed in some tumors may result from the loss of telomeres 
and may be partly responsible for the genetic instability 
associated with tumorigenesis. Telomerase catalyzes the 
synthesis of telomere DNA and facilitates cell immortalization 
through the stabilization of chromosomal structure (6-8). 
Although the expression of the human RNA component of 
telomerase (hTERQ is widespread, die restricted expression 
pattern of the mRNA of hTERT. the human telomerase 
catalytic subunit gene, is correlated with telomerase activity 
(8-13). As telomerase activity seems to be the key. player in 
tumor cell immortality, it has importance as a target molecule 
for anti-cancer therapy. Telomerase activity has been shown 
to correlate with poor clinical outcome in neuroblastomas 
and other rumors (14). Fo» breast cancer, however, telomerase 
activity is a controversial prognostic marker: some studies 
sugoest that telomerase activity, clinicopathological parameters 
and disease outcome are linked, whereas oihers do no. find 
this association (14-23). 
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We have succeeded in very clearly and sensitive )y demon- 
strating hTERT mRNA in thyroid, colorectal, parathyroid 
and lung tissues by use of an oligonucleotide probe (13,24-26). 
Strong correlation has been observed between hTERT mRNA 
and/or protein expression and telomerase in a variety of 
malignant tumors (13.14,24.25,27,28). In the present study, 
we used ISH to examine expression of the hTERT gene, and 
1HC to examine expression of hTERT protein, ER and PR, in 
64 carcinomas and 2 phyDodes rumors of breast to determine 
whether hTERT protein can be used to differentiate breast 
cancers. We also analyzed hTERT mRNA and protein 
expression with special reference to clinical features and 
histological findings to investigate the potential role of 
hTERT mRNA expression analysis in predicting the biological 
characteristics of breast cancers. Since hTERT expression in 
breast tumors has not previously been analysed by ISH or IHC t 
our investigation also examined various clinicopathoJogica! 
parameters; including age, histopathplogical type, tumor size, 
lymph node status, relapses, and the expression of ER and PR. 

' Materials and methods 

Tissue collection. Sixty-six samples were obtained during 66 
mastectomies: 64 breast carcinomas, 2 phyllodes tumors and 
66 specimens of the adjacent normal breast gland. ]n 4 cases, 
samples were obtained during core needle biopsies (CNB) 
before neo-adjuvant therapy, and again during mastectomies 
after neo-adjuvanl therapy; for these cases, all measurements 
and examinations were performed both before and after the 
neo-adjuvani therapy: The patients ranged in age from 32 lb 
90 years. Trie patients wiih carcinomas ranged in age from 37 
to 90, mean 56, years and were all women. The women with 
phyllode tumors were aged 32 and 38 years, respectively. 
The surgical and CNB tissue samples were frozen rapidly 
with liquid nitrogen and stored at -80"C until fixation. Then, 
they were fixed in 10% buffer formalin solution and 
embedded in paraffin. Surgical and CNB samples were 
collected from the patients after obtaining their informed 
consent, and the study protocol was approved by the Medical 
Department of the University of Tokyo Ethics Committee. 
The pathologic diagnoses were made by the surgical 
pathological specialists at our institute on the basis of 
examination of hematoxylin-eosin. stained slides. A 
pathological review was performed for all breast tumors 
according to the BRE score. pT and pN staging were assigned 
according to the 1997 WHO classification (7lh edition). 

MCF-7 human breast cancer cells, kindly provided by the. 
Cell Resource Center for Biomedical Research, Institute of 
Development, Aging and Cancer, Tohoku University, were 
used as positive controls. The cells were incubated in RPMl- 
1640 medium with 25 mM HEPES buffer. L-gluiamine. and 
J0% fetal bovine serum (Gibco, Grand Island, NY) on a 
chamber-attached slideglass (Lab-Tele* Chamber Slide™: 
Nalge Nunc Imemational, NapervjJIc. 1L) in a humidified 5% 
CCX atmosphere at 37"C. The cells were ihen fixed with 10% 
buffered neutral formalin (Sigma Chemical Co., Si- Louis. 
MO). The cultured MCF-7 cell line thai was used as a positive 
control *V35 tested for telomerase whh a PCR-based standard 
TRAP assay (6.! 3). These cells were also used to prepare cell 
blocks. Briefly, the celts were fixed in 10% buffered neutral 



formalin, resuspended in molten agarose and then embedded in 
paraffin. Sections from these cell blocks were used as positive 
procedural controls in ISH and IHC. The negative control in 
JSH was obtained by replacing the oligonucleotide probe with 
RNase. The negative control in IHC was obtained by replacing 
the primary antibody -with Tris-buffered saline (TBS). 

* 

Oligonucleotide probe for ISH. The specificity of the oligo- 
nucleotide sequence was initially determined by a GenEMBL 
database search using the Genetics Computer Group Sequence 
Analysis Program (GCG, Madison, WI) based on the fastA 
algorithm (29); the sequence exhibited 100% homology 
with the hTERT gene sequence. A d(T)^ oligonucleotide 
was used to verify the integrity and lack of degradation of 
the mRNA in each sample. All oligonucleotide probes 
were synthesized with a bapien-jabcled nucleotide, such as 
digoxigeninrdUTP (Boehringer-Mannheim), at the 3 - end via 
direct coupling by using standard phospboramidite chemistry 
(Research Genetics, Huntsville, AL) (30). The probe used for 
detection of hTERT by ISH was generated from the original 
sequence for Homo sapiens telomerase reverse transcriptase 
(AF0I5950), 2766-2800: 5'-GCCTCGTCTTCTACAGGGA 
AGTTCACCACTGTCTT-3* (13,24-26). 

ISH. JSH was r*rforroed wiih the GenPoint nucleic acid hyper- 
detection system (Dako; Carpenteria, CA) (31). Formalin- 
fixed, paraffin-embedded tissue sections (5 wm thick) were 
deparaffinued in xylene and a graded alcohol scries. Tissue 
sections and CNB samples were then pretreated with target 
retrieval solution (Dako, Si 700) at 95*C and proteinase K 
(Dako, S3004) at room temperature. Next; the tissues and 
CNB samples were fixed in 03% hydrogen peroxide followed 
by a methyl alcohol series at room temperature. Digoxigenin- 
labeled anti-sense oligonucleotide in mRNA in situ hybri- 
dization solution (Dako, S3304) was placed over the tissues 
and CNB samples. After hybridization at 37*C overnight, 
the slides were washed in stringent wash solution (Dako. 
GenPoint System Kit) at 45'C. The tissues and CNB samples 
were exposed to avidin blocking solution (Dako, X0590) 
at room temperature followed by biotin blocking solution 
(Dako, X0590) at room temperature. The tissues and samples 
were then incubated at room temperature with a sheep 
monoclonal hapten- labeled anti-digoxigenin antibody (Dako, 
p5104), and the slides were then fixed with biotinyl tyramine 
(Dako, GenPoint System Kit) at room temperature. Finally, 
the slides were incubated with HRP-conjugated stieptavidin 
(Dako, GenPoint System Kit) at room temperature. Since 
33' diaminobenzidine letrabydrochloride (DAB) was used as 
the substrate, a positive reaction was visible as a brown color 
under a Hgbi microscope. The sections were weakly counter- 
stained with 0.1% hematoxylin. 

IHC. IHC was performed by the avidin-biotin complex/ 
horseradish peroxidase method. Tissue sections were stained 
for hTERT wiih a commercially available monoclonal antibody 
(NCL-L-hTERT; Novocastra, Newcastle upon Tyne. UK). 
Sections were dewaxed in xylene. Aniigen retrieval was done 
by incubating sections immersed in 0.01 M citrate buffer at 
pH 6.0 in a mlciowave oven ol 99*C. The sections were 
allowed to cool down at room temperature. The sections 
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Figure I. Reprrscniative results of ihc TRAP assay. H functional telomerase is 
present, the enzyme adds DNA to the substrate in 6-base-pair (bp) increments, 
resuhing in a ladder-like distribution of products. The 6-bp ladder signals art 
apparent for MCF-7 and brtssl cancers (ease nos. 6. 14, 23. 44 and 59) and are 
no* apparent for lysis buffer as the negative control. An extract of MCF-7 was 
used as a positive control for the TRAP assay and as an Internal Teloroeraat 
Assay Standard 0TAS) positive control for PCR amplification, with lysis 
buffer as the negative control (Lysis). 

were iben immersed in 1% hydrogen peroxide (H 3 0 2 ) in 
methanol to block endogenous peroxidase activity. Following 
that, the sections were washed in TBS (pH 7.6) before being 
incubated in normal rabbit serum for 20 min io block non- 
specific binding. After draining off the excess serum, the 
sections were incubated with the primary antibody at room 
temperature. The sections were washed in TBS before being 
incubated with the secondary antibody (biotinylaled rabbit 
ami-mouse, Dako). The sections were washed again with 
TBS and incubated with avidin biotin complex/horseradish 
peroxidase. After washing the sections with TBS, peroxidase 
activity was visualized under Jight microscopy by applying 
DAB chromogen (Dako). The sections were counterstained 
with hematoxylin, dehydrated in increasing grades of alcohol 
and finally mounted in diburyl phihalate (DPX) mountant. 

Homogeneous staining or a speckled/dotted pattern in the 
nucleus was considered positive staining, and absence of 
distinct nuclear staining was taken as negative staining. 
Grading of the percentage of stained cells (hTERT labeling 
index) was performed by previously published criteria (1) as 
follows: Grade 1, negative staining; Grade 2, 1-10% positive 
siaining nuclei; Grade 3, 1 1-50% positive nuclei; and Grade 4, 
>50% positive nuclei. Jmmunbstained slides for ER and 
PR were scored as previously described (32,33). In brief, 
each entire slide was evaluated by light microscopy. First, a 
proportion score was assigned, which represents the estimated 
proportion of positive -staining tumor cells (0, none; 1 „ < 1/100; 
2. 1/100 to 1/10; 3, 1/10 to 1/3; 4, 1/3 io 2/3; and 5, >2/3). 
Next, an" intensity score was assigned, which represents the 
average intensity of positive tumor cells (0. none; 1, weak. 
2, intermediate; and 3, strong). The proportion and intensity 
scores were then added to obtain a total score, which ranged 
from 0 to 8. Slides were scored by pathologists who did not 
have knowledge of ligand- binding results or patient outcome. 



Tabic I. Relationships between mRNA status (negative/ 
positive) by ISH and standard clinical, pathological, and 
biological factors in the 66 tumors. 

No. of patients (%) 

Total '■ 

population hTERT hTERT * 

(%) negative positive P-value* 



Total 66 10(15.2) 56(84.8) 

Age NS 

<50 26 6 (23 J) 20(76.9) 

>50 40 6(15.0) 34(85.0) 

Hisiopatho- 

logical type ^ . NS 

Scirrhous 32 2 (6.4) 30(93.6) 

Papillorubular 20 2(10.0) 18(90.0) 

Solid tubular 6 1(16.7) 5(833) 

Mucinous 2 1 (50.0) . 1 (50.0) 

Non-invasive 4 2(50.0) 2(50.0) 

Phyllodes 2 2 (100) 0. (0) 

Tumor size I NS 
(cm? 

Tl (<2-0) 20 0 (0) 20 (100) 

77(2.0-5.0) 34 . 6(17.6) 28(82.4) 

T3(>5.0) 10 2(20.0) 8(80.0) 

Lymph node 

starus* NS 

pNO 34 6(17.6) 28(82.4) 

pNI 26 2 (7.7) 24(923) 

pN2+pNM 4 0 <0) 4 (J00) 

Relapse NS 

+ 10 0 (0) .10 (100) 
56 10(11.6) 46(82.4) 

ER expression 

+ (^2) 42 6(143) 36(85.7) 

- (<2) 24 4(16.7) 20(833) 

PR expression 

+ <*2) 42 4 (93) 38(903) 

. <<2) 24 6(25.0) 18(75X1) 



V test. NS. not significant. ^Information available for 64 patients. 



Statistical analysis. Differences in p-values were analyzed 
with the x 7 »esi for independence, and Fisher's test was used 
for conflations. In all comparisons, p<0.05 was considered 
significant. 

Results 

Representative results of the TRAP assay art shown in Fig. I. 
The cultured cells, which were tested for lelomerase activity 



Figure 2. Contlaiion bf iwetn bisiologic diagnosis, human iclomtrase jcv 
rtccpior (ER) by IHC and progesterone lecrpior iPR) by IHC in brcaM cantci 

with the TRAP assay, gave positive results with all procedural 
controls (MCF-7 and 5 breast cancer samples) (Fig. I). 

1SH revealed that hTERT mRNA was strongly expressed 
in the nuclei and cytoplasm of almost all of the MCF-7 human 
cancer cells (data not shown). Expression of hTERT mRNA 
was detected inr56 (88%) of the 64 breasi cancers and in none 
of the pbyllodes tumors of the breast (Table I) wijh the anti- 
sense probe, whereas no expression was detected with the anti- 
sense probe treated with RNasc (data not shown). The levels of 
expression were heieiogeneous within the carcinomatous 
regions. Strong expression of hTERT mRNA was not confined 
to the carcinomatous regions but was also detected in 
infiltrating lymphocytes (Fie. 2a and b). Higher expression 
levels of boih signals of hTERT mRNA were detected in 
some sections containing both carcinomas and lymphocytes, 
compajed with the adjacent non* cancerous mammary gland, 
but no clear differences in signal intensity were observed 
between caicinomas and lymphocytes. The signals in both 
the normal and cancer tissues were mainly present in the 
lymphocytes, and the signal intensity was similar in both, 
although a piecise quantitative comparison of the in siw signals 
was impossible. 



rx iranscripio* (bTERT) mRNA by 15H. hTERT protein by IHC. estrogen 
. (o. H&E): (b. bTERT mRNA): (c. hTERT protein); (d.ER) and (e. PR). 

JHC revealed that bTERT protein was strongly expressed 
in the nuclei, nuclear membrane, and cytoplasm of almost all 
of the MCF-7 human cancer cells (data not shown). Expression 
of hTERT protein was delected in 52 (81%) of the 64 breast 
cancers and in none of the phyllode lumors of the breast 
(Table Jfl). The levels of expression were heterogeneous within 
the carcinomatous regions. As shown in Fig. 2c; strong 
expression of hTERT protein was observed in nuclei, nuclear 
membrane and cytoplasm, similar lo the pattern in MCF-7 
human cancer cells. Normal mammary gland, and stromal 
cells generally showed negative imrounoreactivily against 
hTERT protein antibody. 

A nuclear signal for the ER (Fig. 2d), as assessed by IHC. 
was observed in 36 (56%) of the 64 breasi cancers and in 
none of the phyllode tumors of the breasi. with positive 
scores ranging from 2 to 8 (Tables I and ))). A nuclear signal 
for the PR (Fig. 2e). as assessed by IHC. was observed in 38 
(59%) of the 64 breast cancers and in none of ihe phyllode 
tumors of the breasi. wiih positive scores ranging from 2 to 8 
(Tables I and II). 

We used 1511 and JHC lo examine hTERT expression 
in 4 cases of breast cancer before and after neo-adjuvanl 



Table H. Relationships between mRNA status (negative/ 
positive) by IHC and standard clinical, pathological, and 
biological factors in the 66 tumors. 



Total 



No- of patients (%) 

bTERT 
positive 



hTERT 
negative 



Total 

Age 
<50 
>50 

Histopathologic^! 

Scirrhous 

PapiJlorubuJar 

Solid tubular 

Mucinous 

Non-invasive 

Phyllodes 

Tnrnor siie (cm)> 
Tl (<2.0> 
T2 (2.0-5D) 
T3 (>5.0) 

Lymph node status* 
pNO 
pN» 

pN2+pNM 
Relapse 



66 14 (212) 52(78:8) 



26 
40 



32 
20 
6 
2 
4 
2 



20 
34 
10 



34 
26 
4 



10 
56 



10(385) 
4(10.0) 



2 (6.4) 
4 (20.0) 
4(66.7) 
0 (0) 
2 (50.0) 
2 (100) 



4 (20.0) 
6(17.6) 
2 (20.0) 



6(17.6) 
6(23.1) 
0 (0) 



16(615) 
36(90.0) 



30 (93,6) 
16(80.0) 
2 (33 J) 
2 (100) 
2(50.0) 
0 (0) 



16(80.0) 
28(82.4) 
8 (&K0) 



28 (82!4) 
20(76.9) 
■4 (100) 



42 
24 



ER expression 
+ (*2) 

- (<2) 

PR expression 
+ (£2) 42 

- (<2) 24 



1 (10.0) 
13(23.2) 



6(14J) 
8 (333) 



6(143) 
8 (333) 



9 (90.0) 
43 (79.6) 



36(85.7) 
16 (66 J) 



36(85.7) 
16(66.7) 



P-value 



NS 



NS 



NS 



NS 



NS 



NS 



NS 



a / ? lest. NS, not significant. ^Information available for 64 patients. 



chemotherapy. Before chemotherapy, all 4 of the breast 
carcinomas strongly expressed hTERT by both 1SH and 
IHC. After chemotherapy. hTERT expression completely 
disappeared in all 4 cases (Table 111). hTERT expression by 
lymphocytes was detectable by ISM and IHC both before and 
after chemotherapy in all 4 cases, and the level of expression 
did not appear to be altered by treatment. 

No correlation was observed between hTERT mRNA 
expression and any of the clinicopatholosical parameters 
age, hisiopaiholosicai type, tumor <i2e, lymph node status. 



Table HI. Relationship of hTERT mRNA ahd prolein 
expression before and after neoadjuvant chemotherapy- 

hTERT mRNA hTERT protein 



Case 


Age 


Neoadjuvant. 


Before After 


Before After 


1 


80 


Anastrozole 




+ 


2 


78 


Anastrozole 


+ 


+ 


3 


35 


FEC* 


+ 


+ 


4 


37 


AC • 







"FEC.5FU (500 mg/nr), Epirubicin (70 mg/m 2 ). Cyclophosphamide 
(500 mg/m J ). *AC, Doxorubicin (60 mg/rrr 7 ). Cyclophosphamide 
(500 mg/m 1 ). Before; before neoadjuvant cbc mother »py. After, 
after neoadjuvant chemotherapy. •*» positive; -, negative. 



relapses, and the expression of ER and PR. Similarly, there 
was no correlation between hTERT prolein expression and 
any of these clinicopathological parameters. There was a 
correlation between hTERT mRNA expression and hTERT 
protein expression in breast cancers (p<0.005). 

. Discussion 

This study reports a comparison of hTERT mRNA expression 
by 1SH and hTERT protein expression by IHC in tissue 
sections from breast tumors. hTERT mRNA was detected 
by 1SH in 56 of the 64 breast cancers and in MCF-7 human 
breast cancer cells. Breast cancer eel) nuclei stained strongly 
positive with the specific anti-sense probe but not with the 
anti-sense probe treated with RNase (data not shown). Tissue 
lymphocytes also stained positively with the ami- sense probe, 
but the stromal cells did not. Expression of hTERT protein 
was observed by IHC in 52 of the 64 breast cancers. hTERT 
mRNA and prolein expressions were highly correlated in 
bieast cancers (p<0.005). Detection of Ore hTERT protein by 
IHC has permitted further analysis of carcinogenesis and 
cancer diagnosis (34). 

In recent years, there has been disagreement over the 
suitability of telomerase activity as a prognostic biologic 
marker in breast cancer that may help to differentiate patients 
for individually based risk- related therapy. Hiyama el ot (15), 
in a study of 140 breast cancer specimens with the TRAP 
assay, found a strong association between telomerase activity 
and stage classification and observed telomerase activity 
in 68% of stage 1 tumors and 95% of stage IV tumors. 
Po/emba et «/(!), using tissue microarrays, found a statistically 
significant correlation berween tumor-specific survival (overall 
survival) and hTERT expression in breast cancer/However, 
some problems in interpretation may affect this apparent 
consensus. First T some samples of breast cancer tissue may 
be extensively contaminated by infiltrating lymphocytes 
during operative manipulations, especially in advanced disease, 
causing overestimarion cf telomerase activity and/or hTERT 
expression. In our previous reports, higher expression leveb 



of signals for both hTERT mRNA and protein were detected 
in some sections containing both carcinoma and lymphocytes 
in thyroid and colorectal cancers (13,24). Secondly. Poremba 
et al{X) used polyclonal antibodies against hTERT protein as 
a signal for expression. In our bands, polyclonal antibodies 
against hTERT protein give rise to strong background signals 
and are not clearly specific for rneasuring expression in cancer 
tissues. We have carefully compared the reactivity against 
hTERT protein of the monoclonal antibody used in the 
present study with that of some polyclonal antibodies. Use of 
the monoclonal antibody in IHC allowed clear demonstration 
of hTERT protein expression, with results similar to those 
of 1SH for hTERT mRNA expression. Furthermore, IHC is 
technically much easier to perform than 1SH, since con- 
tamination of samples by RNase is not an issue irt IHC. 

To the best of our knowledge, this report is the first on the 
study of hTERT expression in. breast cancer as a function of 
neo-adjuvani treatment. We examined hTERT expression 
in 4 cases of breast cancer before and after chemotherapy. 
Before chemotherapy, hTERT was strongly expressed in 
all 4 carcinomas, but after chemotherapy hTERT expression 
had completely disappeared in all 4 cases, hTERT expression 
by. lymphocyte's was detectable by JSH and JHC both before 
and after chemotherapy in all 4 cases, and the level of 
expression did not appear lo be altered by treatment. 

In conclusion, determination of hTERT mRNA expression 
by 5SH and hTERT protein expression by JHC can be used to 
obtain information contributing to a histopaibological diagnosis 
during screening of breast cancers. By use of a monoclonal 
antibody, we could very clearly and sensitively demonstrate 
hTERT protein expression in breast cancer tissues but not in 
non-cancerous tissues. We also demonstrated that 4 carcinomas 
with originally positive immunoreactiviiy against hTERT 
pjoiein became negative after neo-adjuvaht chemotherapy. 
These results suggest thai determination of hTERT protein 
by JCH can be used as a potential diagnostic and therapeutic 
majker to distinguish breast cancers. 
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Expression of the ubiquitui-proteasorae pathway and muscle loss in 
experimental caucer cachexia. 

KbajJ, Wyke SM, Russell ST Hine AV Tisdale MJ . 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham, UK. 

l U htvl P T tdn d T d f ti0 , n iS th ° Ught t0 Play 3 ma J'° r r0le in musc,e ^ophy in cancer 
cachexta. To myeshgate the importance of the ubiquitin-proteasome pathway, which has 
been sugg^ted to be the mam degradative pathway mediating progressive protein lossTn 
cachexia, the express.on of mRNA for proteasome subunits C2 and C5 as well as the 
ub,qu,tm-(X>nj U gating enzyme, E2(14k), has been determined in gastrocnemius and 
pectoral muscles of m.ce bearing the MAC 16 adenocarcinoma, using competitive 

sTlT^TZlT ?? CP ° ] ^ e ™ *** reaCti ° n - ?r0t ™ >«vels of proteasome 
w. r r fl t l f ?( u } ^ detemuned fa y immunoblotting, to ensure changes in mRNA 
wShMnf f m r gCS Pr °, tein ex P rcssion - Musc ' e eights correlated linearly with 
we lg ht bss dunng the course of the study. There was a good correlation between 
express™ of C2 and E2(14k) mRNA and protein levels in gastrocnernTurlscTe with 

foZS, 0 ' 6 t f ° ld 02 ^ tW °- foId f ° r E2(,4k > between 12 - d 20%TetntTo* 
Mowed by a decrease m expression at weight losses of 25-27%, although Iossff mtSle 

Tf • * m eXpreSSio " ° f C5 mRNA on «y '"ncreasol two-fold and wS 
"™" a t 3t 311 between 7.5 and 27%: Both proteasome function^ 

actmty and proteasome-spec.fic tyrosine release as a measure of total protein 
degradauon was also maximal at 18-20% weight loss and decreased at higher weight loss 

cS^TflT*: * IZT 1 ,T de f0 " OWed 3 diffCrent Pattem wlincrease n 
C2 and C5 and E2(14k) mRNA only being seen at weight losses above 1 7% althoueh 

muscle loss mcreased progressively with increasing weight loss. These results suggS 

that act,vat,on of the ub.quUm-proteasome pathway plays a major role in proteinL in 

' ? t0 20% Wd8ht l0SS ' bUt ** 0ther such * depression in 

protein synthesis may play a more important role at higher weight loss. 

PMID: 16160695 [PubMed - in process] 
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Increased expressiou of proteasome subunits in skeletal muscle of cancer 
patients with weight loss. 

KhajJ, HineA V, Fearon KC , Dgjon^CH, Tisdale MJ 

Pharmaceutical Sciences Research Institute; Aston University, Birmingham B4 7ET, UK. 

Atrophy of skeletal muscle is common in patients with cancer and results in increased 
mortidity and mortality In order to design effective therapy the mechanism by which 
this occurs needs to be elucidated. Most studies suggest that the ubiquiUn-proLome 
proteolync pathway is most important in intracellular proteolysis, although there have 

E« . n ° t K CPOrtS T ? C aCUvity ° f thiS pathway in P atients with differe "t stents of weight 
loss. Indus report the expression of theubiquitin-proteasome pathway in rectus . ■ 
abdominis muscle has been determined in cancer patients with weight loss of 0-34% 

using a competitive reverse transcriptase polymerase chain reaction to measure 
express.cn of rnRNA for proteasome subunits C2 and C5, while protein expression has 
been de termmed by western blotting. Overall, both C2 and C5 gene expression Z 
.ncreased by about three-fold in skeletal muscle of cachectic cancer patients fa^rSe 
weight loss 14.5 + /-2, 5 %), compared with that in patients without we!gh "os Sor 
without cancer. The level of gene expression was dependent on the amount of weight 
niwTS? maximaI| y for bot » Proteasome subunits in patients with weight loS of 
Further increases m weight loss reduced expression of rnRNA for both 

wdSfZ S UnUS ' alth ° Ugh J it W3S Sti " deVated in C ™P™ with patients with no 

' So/ Th? ere ^ n ° T'^f f ° r m inCreaSC in 6X P reSsi0n at 1«« l^s 

■2L-n ^ 7k S 3 g °° d COITeIahon ^tween expression of proteasome 20SaI P ha 

gene expression resulted in increased protein synthesis. Expression of me ubiquitin 
conjugating enzyme, E2(l4k), with weight loss followed a similarpattem to mat of 
proteasome subunits. These results suggest variations in the expression of key 

sZSS l?h ^ iqU i tin r° te f 0me P athwa V with weight loss of cancer patients, and 
suggest that another mechanism of prote.n degradation must be operative for patients 
with weight loss less than 10%. • pauems 

PMID: 16125116 [PubMed - in process] 
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Cell type-specific occurrence of caveolin-lalpha and -lbeta in the lung 
caused by expression of distinct inRNAs. 

Kogg_H, Aiba T. Fujimoto T . 

Department of Anatomy and Molecular Cell Biology, Nagoya University Graduate 
School of Medicine, Showa-ku, Nagoya 466-8550, Japan, hkogo@fujita-hu.ac.jp 

Two isoforms of caveolin-1, alpha and beta, had been thought to be generated by 
alternate translation initiation of an mRNA (FL mRNA), but we showed previously that 
a vanant mRNA (5'V mRNA) encodes the beta isoform specifically. In the present srudy 
we demonstrated strong correlation between the expression of the caveolin- 1 protein 
isoforms and mRNA variants in culture cells and the developing mouse lung. The alpha 
isoform prptem and FL mRNA were expressed constantly during the lung development 
whereas^ expression of the beta isoform protein and 5'V mRNA was negligible in the fetal 
lung before 17.5 days post coitum, and markedly increased simultaneously at 18 5 days 
post coitum, whence alveolar type I cells started to differentiate. Inununohistochemical 
analysis revealed the cell type-specific expression of the two isoforms; the alveolar type I 
cell expresses the beta isoform predominantly, while the endothelium harbors the alpha " 
isoform chiefly. The mutually exclusive expression of caveolin-1 isoforms was verified 

VT ?? ?', ng ° f thC SdeCtive plasma memb rane preparation obtained from the 
endothelial and alveolar epithelial cells. The present result indicates that the two 
caveolin- 1 isoforms are generated from distinct mRNAs in vivo and toat their production 
is regulated independently at the transcriptional level. The result also suggests that the 
alpha and beta isoforms of caveolin-1 may have unique physiological functions. 

PMID: 15067006 [PubMed - indexed for MEDLINE] 
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Oncogene and growth factor expression in ovarian cancer. 

Kommoss F, Baukn echt T, Birmclin G , KohjerM, Tgsckjg, Pflcidcrer A . 

Department of Gynaecology, Albert^Ludwig University, Freiburg, Germany. 

The varying tumor-biological behavior of ovarian carcinomas probably influences both 
Uieir operabihty and response to chemotherapy, which are the most relevant prognostic 
tactors. The phenotype of different ovarian carcinomas is obviously associated with an 
acfvanon f *e EGF^GF-alpha signal pathway, including c-myc and c-jun expression. 
Analysis ot EGF-R, TGF-alpha, c-myc. and c-jun expression in 33 stage HI/IV and 2 
stage I/II.ovanan carcinomas with biochemical, molecular-chemical and 
immunohistochemical methods showed a correlation between the mRNA and protein 
levels of EGF-R and TGF-alpha for tumors with low or high expressing rates. However 
the concentration of measurable free EGF-Rs seems to depend on the amount of TGF- ' 
alpha expression by the tumors. The EGF-R binding ligand TGF-alpha is produced by 
epithelial tumor cells; stromal cells are usually TGF-alpha-negative, as shown by 
lmmunohistochemistry. High expression rates of EGF-R. TGF-alpha and c-myc were 
detected in 6, 7, and 10 out of 35 ovarian carcinomas, respectively. C-jun mRNA was 
detected in 18/19 cases studied. Non-malignant tissues originating from myometrium or 
ovary expressed no (or only small amounts of) EGF-R or TGF-alpha mRNA, whereas a 
high c-myc expression was found in 1/7 normal myometria, and in 2/5 normal ovaries. 
There was no strong correlation between EGF-R/TGF-alpha and c-myc/c-jun 
expression^ ABSTRACT TRUNCATED AT 250 WORDS) 

PMED: 1502888 [PubMed - indexed for MEDLINE] 
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Somatostatin receptors in primary human breast cancer: quantitative 
analysis of mRNA for subtypes 1--5 and correlation with receptor protein 
expression and tumor pathology. 

KuinarU, Grigorakis SI, WattHL, SasjR, SnellL, Watson P, Chaudhari S . 

Fraser Laboratories For Diabetes Research, Department of Medicine, McGill University 
Royal Victorxa Hospital, 687 Pine Avenue West, H3A I Al Montreal, Quebec, Canada. ' 
ujendra.kximar@muhc.mcgill.ca 
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1 Introduction 

Mammals protect thernsefves against pathogen infection pri- 
marily via innate and adaptive immunity The innate 
immune system relies on a set of pattern recognition receptors 
{e.g., TolMike receptors) to recognize foreign molecular struc- 
tures such as lipopolysaccharide (LPS) and bacteria) DNA \2. 3). 
Innate immune ceils recognize these molecular structures and 
initiate not only innate but also adaptive immunity by produc- 
ing jmmunomodulatory cytokines and activating T and B 
immune cells (J). Bacteria) DNA can directly activate B cells to 
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proliferate and secrete immunoglobulins in a TcelJ-independ- 
ent manner [4-6), It also induces B cells and monocytes to 
activate transcription factor NF-kB and secrete cytokines, 
including interleukin (1L) 12, tumor necrosis factor a (TNF-a), 
and interferon ot/p p-10]. The immunosumulatory activity of 
bacterial DNA has been assigned to unmethylated CpG motifs 
(GACGTTfor murine, GTCGTTfor human) [il-J. Recent evi- 
dence shows that synthetic oligodeoxynudeotides containing a 
CpG motif (CpG-ODN), like bacterial DNA with the CpG moi- 
ety (CpG-DNA), induce potent Thl -like immune responses 
that are protective against several infectious agents and 
immune disorders in animal models [1 2, 1 3). Biologically active 
CpG-ODN, like bacteria DNA, activates 'macrophages and 
immature dendritic cells to increase expression of MHC class 
11 and costknulatory molecules, thereby transcribing cytokine 
mRNAs, and producing pro-intf arnma tory cytokines including 
TNFa, IL-1, 1L6 and 1L-12 19, 14-16J. CpG-ODN can therefore 
serve as an adjuvant and immunomodulatbr in vaccines 
against a wide variety of targets, including infectious agents, 
cancer antigens and allergens [17], 

It has been suggested that unmethylated CpG-DNA-medi- 
ated immune activation functions through a toll-like receptor 9 
(TLR9) signaling pathway |18]. Endocytosis and sequentially 
endosomal maturation as well as bin ding of heat shock protein 
(HSP) 90 to CpG-DNA are essential for induction of TLR9 sig- 
nal transduction |19, 20). It has also been shown that recogni- 
tion of CpG-DNA causes TLR to form a dirrier, which recruits 
the adaptor molecule, myeloid dWerentiation factor 88 
(MyD88), through interaction between their Gterminal ToD/R- 
1R domains. This recruitment of MyD88 to the ToD/IL-lR do- 
main of TLR 9 initiates a signaling pathway that sequentially 
involves lL-lR-associated kinase 1 and TNF-a receptor-asso- 
ciated factor 6 [18, 21, 22J. Studies using gene-deficient mice 
and RAW2647 cells transiently transfected with dominant- 
negative forms of these molecules have indicated that the 
MyD88-mediated signaling pathway is essential for CpG-DNA 
induced activation of NF-xB and c-jun NH r terminal kinase 
(JNK), as well as subsequent production of cytokines in mono- 
cytic cells [18, 21 , 22). The precise mechanism of action of CpG- 
DNA and CpG-ODN, nonetheless, is still not thoroughly 
understood. To further elucidate the molecular events after 
binding of CpG-ODN to TLR9, in this study, we treated CpG- 
ODN responsive THP-1 cells with CpG-ODN and evaluated 
changes by using DNA microarray and proteomic approaches. 
We have discovered up-regulabon of more than 50 disting- 
uished genes/proteins and identified induction of several anti- 
apoptotic and neur oprotecting genes by CpG-ODN treatment. 

2 Materials and methods 
2.1 Reagents 

Phosphorothioa re-modified CpG-ODN and GpCODN were 
synthesized by MDBio (Taipei, Taiwan). Human specific 
ODN sequences are: CpG-ODN, S'-TCG TCG 7TTTGT CGT 
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TtT GTC GTT-3 ; GpC ODN, 5' TGC TGC TIT TGTGCT 
TIT GTG CTT-r. The mouse specific CpG-ODN sequence is 
5' TCC ATG ACG TTC CTG ATG CT-3\ CHCA was from 
Sigma (St Louis, MO, USA). 

2.2 Cell culture 

Cell lines were obtained from the American Type Culture 
Collection (Rockville, MD). Mouse RAW264.7 macrophage 
and human embryonic kidney 293 cells (HEK293) were cul- 
tured in DM EM- supplemented with 10% heat inactivated 
fetal bovine serum, 100 U/mL penicillin, 100 ug/inL strep- 
tomycin sulfate, 200mmol/L L-ghitamine, and 50 um P-mer- 
captpethanol in a humidified atmosphere of 5% C0 2 at 37°G 
The medium was changed every 2 days for all experiments. 
Human THP-1 monocytic leukemia cells, which have been 
shown, to express TLR9 and respond to CpG^DNA- stimula- 
tion [23, 24}, were cultured in RPM11640 with the same sup- 
plements as for RAW264.7 cell cultures. 

2.3 Human cDNA microarray 

Total RNAs extracted from cultured THP-1 cells were isolated 
with TRIzol (lnvitrogen, Leek, The Netherlands) and submitted 
to Genasia Biotechnology (Taipei, Taiwan) for further proces- 
sing. In brief, 4 ug of total RNA from CpG-ODN stimulated, or 
normal THP-1 ceDs was labeled with a fluorescence marker 
(U-vision, (Taipei, Taiwan)). Different colored fluorescence dyes 
(Cy5 and Cy3) were used to distinguish total RNA from normal 
and ODN stimulated cells. The labeled RNA was used for 
hybridization with the Human 1 cDNA microchip from Agilent 
Technologies (Palo Alto, CA, V SA).The chips were scanned and 
the expression pattern was analyzed using genechip software. 
Genes showing up-regulation or down-regulation of RNAlevels 
were analyzed and identified on a genomic database as sug- 
gested by the manufacturer of the microchip. 

2.4 Protein preparation 

THP-1 cells were seeded in a 175 cm 2 tissue culture flask at a 
density of 10 6 cells per milliliter in cidnrre medium. The cells 
were stimulated with or without 1.5 pM CpG-ODN at defined 
times and harvested by centrifugation at 4°C, 1000 x g foT 
I5min- Cell pellets were washed twice with ice-cold PBS, 
resuspended and sonicated in extraction buffer containing 
25 mM Tris-HQ {pH 7.5), 2mM p-mercaptoethanol and pro- 
lease inhibitor cocktail. After centrifugation at 10000 x gfor 
20 mm, ammonium sulfate was added to the supernatant until 
the final concentration reached 50% saturation w/v. The solu- 
tion was stirred at 4°C for 30 min and centrifiiged at 10 000 x g 
for 30 min at 4°C The supernatant fraction was then trans- 
fen ed into a fresh rube, and the precipitated protein pellet 
solubilized in extraction buffer. To remove salts and other con- 
taminants, the extracts were treated with a pre-cooled (-2CPC) 
solution of 10% TCA in acetone with 0.07% p-mercaptoetha- 
nol. Proteins were allowed to precipitate overnight at -20°C 
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After cenmfugation, the pellet was washed with ice-cold ace* 
tone, containing 0.07% fJ-merxaptoethanoI. The supernatant 
was discarded and thepeEet dried m a SpeedVac system (Model 
AES1010; Savant, Holbrook, NY, USA). 

Z5 2-DE 

2-DE Was performed using an IPGphore 1EF and a Hofer 
DA1T vertical unit (Amersham Biosciences, Piscataway* NJ, 
USA). One milligram of dried protein sample was dissolved 
in 350 uL of rehydration buffer solution, containing 7 m urea, 
2 m thiourea, 4% w/v CHAPS, 5 rnw tributyl phosphine, and 
2%, IPG and loaded onto an immobilized pH 3-10 linear 
gradient strip (38 cm), followed by rehydration for 16h. IEF 
was then performed in the following manner 100 V for 
30min, 250 V for 30 min, 500 V for 30min, 1000 V for ~ 
30 min, 4000 V for 30min, 6000 V for 55 000 Vh. At the end 
of IEF, the IPG strips were equilibrated for 15 min in buffer 
containing 6 m urea, 2% w/v SPS, 30%v/v glycerol, and 
50 mM Tris, pH 6.8, then reduced with 65 mM dithiorery- 
thritol (DTE) and subsequently alkylated with 135mM 
iodoacetamide for another 15 min. After equilibration, the 
IPG strips were immediately placed on top of a 12% SDS- 
PAGE (1.5 mm,. 20x24 cm). The second dimension gels 
were then overlaid with molten 0.8% agarose solution in 
SDS electrophoresis buffer. Electrophoresis was performed 
at 16°C, starting at 10mA per gel for 1 b, followed by 45 mA 
per gel until the dye front reached the bottom of the gels. 

2.6 Staining and image acquisition 

Immediately after electrophoresis, gels were stained with 
SYPRO Ruby (Molecular Probes. Eugene, OR, USA). In 
brief, gels were fixed for 30min in 10% methanol, 7% acetic 
acid, and then stained overnight in SYPRO Ruby stain. The 
staining solution was removed and gels were washed in 10% 
methanol and 7% acetic acid for 3 h. After staining, image 
acquisition was carried out on a Typhoon 9200 (Amersham 
Biosciences). To identify a protein, spot detection, quantifi- 
cation and matching of 2-D results were analyzed using 
ImageMasler software (Amersham Biosciences). The M t of 
the proteins were calibrated according to the LMW-SDS 
Marker Kit (Amersham Biosciences), and their pJ values 
were estimated from the position of the protein spots on the 
2-D gel and confirmed with the information supplied by the 
manufacturer. Since most of the p/ values for the truncated 
proteins had not been reported previously, the p/ values of 
the truncated proteins were estimated from the position of 
the observed spots. To omit the variation due to the use of 
separate gels, after backgrou nd subtraction, the intensity 
levels of protein spots on each gel were normalized as a pro- 
portion of one reference spot, and protein quantities were 
calculated by integrating the density over the spot area. Pro- 
tein spots that showed reproducible modulation exceeding 
-80% after CpG ODN treatment in three experiments were 
further analyzed by MS. 



2.7 ln-gei digestion with trypsin arid extraction of 
peptides 

Protein spots were excised from stained gels and cut into 
pieces. In brief, gej spots were dehydrated with ACN fox 
10 min and dried in a vacuum centrifuge. Gel pieces were 
resweDed with 55 mM DTE in 25 mM arrimonium bicarbon- 
ate (pH 8.5) at 37°C for 1 h. The solution was then exchanged 
with alleviation solution, which contained lOOrriM iodoaceta- 
mide in 25 mM ammonium bicarbonate (pH 8.5), at room 
temperature for lh. After- aUcylation, the gel pieces were 
washed twice with 50% ACN in 25 m M ammonium bicarbo- 
nate (pH 8.5) for 15 min. The wash solution was discarded 
and the pieces of gel were dehydrated with ACN for lOmin 
and dried in a vacuum centrifuge. Tryptic digestion was 
initiated by resweUing the gel in 25 mM ammonium bicar- 
bonate solution with 25 ng of trypsin (Promega, Madison, 
WI, USA). After incubation at 3TC for 16h, tryptic peptides 
were extracted twice.with 50% ACN containing 5% formic 
acid for 15 min with moderate sonicabon. The extracted 
solutions were pooled and evaporated to dryness in a vacuum 
centrifuge. The dried peptide mixture was dissolved in 0.1% 
formic acid and used for MS. 

2.8* rVIALDJ-G-TOF MS and protein identification 

Tryptic peptides analyses were performed using a Micromass 
Q-TOF Ultima MALDI (Micromass, . Wythenshawe, U.K.) 
equipped with a 337 nm nitrogen laser and operated in 
reflectron positive ion mode. Peptide mixtures (1 uL) were 
premixed with 1 uL of the matrix (5mg CHCA in 50% ACN 
with 0.1% TFA) then spotted onto the MALDI target plate. 
Mass spectra were acquired for the mass range of 900- 
3500 Da and the individual spectra from MALDI MS or MS/ 
MS were processed using the Micromass MassJLynx 4.0 soft- 
ware. The generated peak list files were used to query the 
SwissProt database using the MASCOT program (http:// 
ww.matrixscience.com) with the following parameters: 
peptide mass tolerance, 50ppm; MS/MS ion mass tolerance, 
0.25 Da; allowance of missed cleavage, 1; and consideration 
for variable modifications such as oxidation of methionine 
and carboxyamidomethylation of cysteines. Only significant 
hits as defined by MASCOT probability analysis were con- 
sidered initially. In addition, when the PMF matches were 
between 5 and 9, at least one peptide sequence was manually 
checked by MALDI MS/MS analysis. 

2J9 RT-PCB analysis 

cDNA from THP-1 cells was produced with Superscript I J 
reverse transcriptase (Invitrogen) using a ohgo(dT) 15 primeT 
for 1 h at 42°C PCR of cDNA was performed using specific 
primers for the gene of interest and control P-actm. All PCR 
products were electrophoresed on a 1.5% agarose gel, and 
DNA bands wete visualized by staining the gel with ethi- 
dium bromide. 
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2.10 Immunoblotting 



3 Results 



Human THP-1 or mouse macrophage RAW264J cells 
(5 x 3 x loVwell) were cultured in a six-well culture 
plate and treated with ox without IS um CpG-ODN for 
the . designated times. After stimulation, cells were harvest- 
ed by centrifu^tion at 1000 x'g for ISrniri in a refri-. 
gerated centrifuge and washed twice with cold PBS buffer. 
The cells were Jysed on ice for 15 mm with. 300 uL lysis 
buffer (Pierce, Rockford, USA), supplemented with pro- 
tease | inhibitor cocktail (Sigma). The lysates were cen- 
trifuged at 12000 x g for 15min at 4 0 C„ and protein 
concentrations of supernatant were determined using the 
Bio-Rad Protein Assay (Hercules, CA; USA). The lysates 
(50 ug of protein/lane) were subjected to 12% SDS-PAGE 
and transferred to NC membranes (Amersham Bio- 
sciences). The membranes were blocked in PBS-0.1% 
Tween 20 (PBST) containing 5% non-fat skim milk at 
room temperature for 1 h, followed by staining with anti- 
ADP-ribosylation factor 3 (ARF-3) monoclonal antibody. 
(0.1 ug/mL; Sigma). The membranes were then incubated 
with ! horseradish peroxidase-conjugated secondary anti- 
body i(dihition, 1:3000) for Ih. After washing three times 
with PBST, specific bands were detected by chemilumi- 
nescehce according to the manufacturer's protocol 
(Amersham Biosciences). . . ., - ; 

2.11 Cell transffection and lucHerase assay 

HEK293 cells (5 x 3 x lO^/well) were transfected using 
JH1GENE6 (Roche Molecular Biocnemicals, Indianapolis, 
IN, USA) plus 0.1 ug pSxNF-KB-Iuc (Stratagene, La Jolla, CA, 
USA)L 0.05 ug pCDNA3.1-P-galactosidase, and pCDNA3.1- 
hTLR9 overnight The cells were incubated with or without 
15 uM CpG-ODN for Sh and then lysed. NF-kB luciferase 
activity assays were performed as recommended by the 
manufacturer (Promega). p-galactosidase activity was used to 
normalize the data. 

2.12 Enzyme activity assay 

Pyruvate kinase activity was assayed in a solution (lmL) 
containing lOOmM Tris-HCl (pH 8.0), 100mM KG, 10mM 
MgCi 2 , 0.2 mM NADH, lOmM PEP, 1.5 mM ADP, limit of 
lactate dehydrogenase, and an appropriate amount of cell 
lysate from CpG-ODN untreated or treated THP-1 cells. The 
reaction was monitored at 30°C for a period of time by 
measuring the decrease in absorbance at 340 nm. PGK ac- 
tivity was assayed in a coupJed reaction with gJyceraltlehyde 
3-phosphate dehydrogenase (GAPDH) as described by Lee 
|25], In brief, the assay was performed at 30°C in a total vol- 
ume of 1 mL containing 100mM Tris-HCl (pH 7.9), 10mM 
MgCli, 0.15 mM NADH, 2mM ATP, 6mM 3-phosph ogive e- 
rate, 0.1 mg/mL BSA, 50 mg of GAPDH, and an appropriate 
amount of cell lysate. NADH consumption was monitored at 
340 nm. 



3.1 Effect of CpG-ODN on gene expression profiles of 
human THP-1 cells 

To elucidate the effect of CpG-ODN on gene expression, 
THP-1, a cell line known to express TLR9 and respond to 
CpG-DNA (23, 24) was cultured with or without CpG- 
ODN. Since preliminary experiments showed that 1.5 um 
CpG-ODN caused more contrasting results between nor- 
mal and CpG-ODN. treated ceDs, 1.5 um CpG-ODN was 
used throughout the experiments. To distinguish CpG- 
ODN treated samples from the control, total cellular RNA 
of normal and CpG-ODN treated cells was isolated and 
labeled with the fluorescence dyes, Cy5 and Cy3» respec- 
tively. The labeled RNA was then used for hybridization 
with a Human 1 cDNA microchip from Agilent Technol- 
ogies. Of the 13 000 human , genes represented on the 
gene array, a total of .55 genes changed expression sign- 
ificantly after 2h CpG-ODN treatment Among these, 
50 genes were up-regulated while five genes were down- 
regulated by a factor > 2. These genes were sorted by 
functions and are listed in Tables 1 and 2. They included 
notably, 1L-18 receptor accessory protein, MSG A beta 
gene, thioredoxin, pro-pol-dUTPase polyprotein, Spl40, 
connexin 59 gene, Grb2-like 2, en oyl- coenzyme A hydra- 
tase, propionyl coenzyme A carboxylase, cytochrome P450, 
and WNT1 inducible signaling pathway protein 2 (W1SP- 
2) etc. The function of these genes are known to be relat- 
ed to inflammatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduc- 
tion, tumor progression, cell differentiation, proteolysis 
etc. 

Table 1. Genes up-regulated (> 2 fold) in human THP-1 cells after 
2 b of CpG-ODN treatment 



Gene narrie and description 
(changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

IL-18 receptor accessory protein 
T cell receptor V beta gene 
MGSA, beta gene 
Antigen gene (PA) 
TIED 

Platelet activating receptor 

Antimicrobial defence 
Thioredoxin 

Pro-Pol-dUTPase polyprotein 
Nuclear body protein Sp140 

Transcriptional regulation 

Putative transcription factor LUZP 

General transcription factor II, I, pseudogene 1 

Connexin 59 gene 

Basic helix-loop-helix protein class B 1 
(BHLHB1) 



AF077346 

X58806 

U03019 

M21B96 

NM_004791 

AF 002986 

NM_003329 

AC004748 

U63420 

AI986271 
AI700706 
L29277 
AF221520 
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Table 1. Continued 



Gene name and description 
(changed fold> 2) 



Table 2. Genes clown. regulated (> 2fold) in human THP-1 cells 
after 2 h of CpG-ODN treatment . 



Genebank 
number 



, Regulatory protein 

Advillin AF041449 

i 

Channel and transport . 

Small GTP binding protein Rab9 U44103 

FXYD domain-containing ion transport All 25364 
regulator 1 

Signal transduction 

SH3-domain Grb2-like 2 AF036268 

Titin X90568 

Titin associated protein |165kD protein) X69089 

KIAA1451 protein A B 04 0884 

Vasoactive intestinal peptide receptor U1 1087 

Enzyme and protease 

En oy l-Coenzyme A, hydratase AI800553 

Nephrin, B-type metallocarboxypeptidase U65090 

Propionyl Coenzyme A carboxylase AB01 1 1 45 

Acyloxyacyl hydrolase (neutrophil) M62840 

Cytochrome P450 U79716 

Intestinal alkaline phosphatase M31008 

Tumor progression and cell differentiation 

. Retinoblastoma 1 L1 1910 
Human genomic DNA of 9q32 anti-oncogene AB036268 
of flat Epitherium cancer, 
segment 6/10' 

WNT1 inducible signaling pathway protein 2 AF 100780 
(WISP-2) 

Structure protein 

Collagen, type IV, alpha 6 D21337 

Beta Myosin heavy chain M58018 

Other 

Homo sapiens Cri-du-chat region mRNA, U52827 
clone NIBB11 

Human mRNA for laminin alpha 5 chain, AB010099 
partial cds, 

NIK like and Thyroxin-binding Z83S50 

globulin precursor 

Hypothetical protein DKF2p434M0331 AL137720 
Hypothetical protein FLJ1 1021 similar j AK023985 

to splicing factor 

Hypothetical protein AL049851 

Chromosome 18 open reading frame 1 NM_004338 

Arfaptin 1 AW408785 

Zinc finger protein 8 (ZFP8) M29581 

Zinc finger protein 137 {clone pHZ-30) U09414 

Olfactomedin related ER localized protein AI738468 

Cyclin-de pendent kinase 8 BE467537 

Integrin, alpha 1 D87462 

KIAA0421 protein AB007881 

KIAA1233 protein AB033059 

Unnamed protein product AK02636? 

N BL4 X75535 

BC331191_1 AAD39268 



Gene name and description 


' Genebank 


(changed f old > 2) 


number 


Ztnc-finger horneo domain protein 4 . 


BAB03600 


Human protein kinase MEKK2b mRNA, 


AF239798 


complete cds. . 




Glypican 5 


U66033 


Human genomic DNA, chromosome 22qli.2,. 


AP000362 


clone N75A12. 




Collagen, type 1, alpha 1 


274615 



Longer stimulation of THP^l ceils with CpQrODN (8h) 
resulted in the up-regulation of 58 genes. These genes in- 
cluded notably IL-10 receptor oeta, formyi peptide receptor- 
like 1 (FPRL1), vitamin D receptor, nuclear receptor sub- 
family 1 (IjcR), early B-ceil factor; protein kinase.C gamma 
(PKCJ, Nek, Ash, phospholipase C binding protein (NAP4), 
phosphoriboxyl pyrophosphate amidotrarisf erase, dishev- 
eled 3, ' WISP-2 etc. Analysis of the functions of the 58 up- 
regulated genes showed that they are associated with anti- 
inflammation, transcriptional regulation, intracellular signal 
transduction, tumor progression, ceil differentiation, pro- 
teolysis, neurodegeneration, neuroprotection etc. (Table 3).. 
We also found that the stimulation of THP-1 cells with CpG- 
ODN for different periods of times resulted in different pro- 
files. Several defense related genes such as 1H8 receptor 
accessory protein, Pio-Pol-dUTPase polyprotein, Spl40 and 
connexin 59 were transiently up-regulated at 2h short stim- 

Table 3. Genes op-regulated (> 2 fold) in human THP-t cells after 
8 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2} 



Genebank 
number 



Inflammation and receptor 

IL-10 receptor (beta) - U 08988 

Formyi peptide receptor-like 1 (FPRL1) AF081535 

Vitamin D receptor J03258 

NMDAR1 Z32774 

CD44 antigen AW028346 

Nuclear receptor subfamily 1 (LxR) NM_005693 

Neuromedin B receptor M73482 

Transcriptional regulation 

Early B-cell factor febf) AF208502 

Neurogenic differentiation 1 (Neuro D) AB018693 

MAX dimerization protein (NESH protein) AB037886 

Ribosomal protein S6 kinase AF090421 

ASH2L AB022785 

Regulatory protein 

Hypothetical protein DKFZp434H0820 AL137555 

Peroxisomal far nesy fated protein X75535 

LTBP4 AF051344 

Neuronal pentraxin II U29195 
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Table 3. Continued 



Gene name and description 
(changed fold > 2) 



Table 4. List of antimicrobial and anti-inflammatory genes 
modulated by CpG-ODN treatment of THP-1 cells 



Structure protein 
. Trichohyalin 
(Certain 

Other 

Human' transferrin pseudogene 
TIMP-2| 

Collagen-like protein 

Human genomic DNA, chromosome 21q, 

section 60/1 OS' 
Human genomic DMA, chromosome 21 q, 

section 64/105 
KIAA0136 
KIAA0379 
K1AA0489 

KIAA1114 ' 

KIAA1451 

KIAA0756 

Zinc finger protein 267 

Hypothetical protein FLJ10633 

Hypothetical protein EUROIMAGE 1955967 

Mybl homolog like 1 

Antizyme inhibitor 

Disintegrin-like and metalloprotease 

treprplysin type) with Thrombospondin 

type il motif, 3 
ADP-ribosylation factor 3 (ARF-3J 
Testis specific protein, Y-linked 
Unnamed protein 



Genebank 
number 



Channel and transport 
Gamma-aminobutyric !61] A receptor 
ATP synthase subunit F6 * ' 

Transient receptor potential channel 1 
Choroideremia (Rab escort protein 1) 

Signal transduction 
Protein kinase C r gamma 
Regulator of G-protein signaling 5 
Nek, Ash and phospholipase C binding . 
protein (NAP4) 

Highly similar to adeylate kinase gene 

I 

Enzyme and protease 
Phenylalanine hydroxylase 
Carboxypeptidase AT 
Xylulokinase 
Pancreatic lipase 
Ubiquitin specific protease 12 
Transmembrane protease, serine2 
Aspartate bata-hydroxylase 
Phosphoribosyl pyrophosphate 
amidotransferase 

Tumor progression and cell differentiation 

CDC23 1 
WISP-2 

Microseminoprotein, beta 
Dishevelled 3 



NM_004961 
M37104 
Z73903 
X57637 

Z15114 

AI674877 
AB005216 

AB016886 

AA203389 

X67318 

AK001205 

J05125 

AF022789 

U75329 

U03109 

D13757 



AF053977 
AF100780 
M34376 
NM_004423 

L09190 
AF061809 

M22376 
U44383 
U67921 
AP001716 

AP001720 

D50926 

AB002377 

AB007958 

AL049732 

AB040884 

AB018299 

AF220492 

AK001495 

AK026108 

AK001893 

D88674 

AB002364 



M74491 
M98525 
AK026042 



Gene name 


Genebank 


Expression fold 




number 


2h 








8h 


i^onnexin yerre 




2.12 ±0.05 


1.74 ±0.08 


IL-18 receptor 


X58806 


2.32 ± 0.21 


1.19 ±0.13 


accessory protein 








Integrrn, alpha 1 


X68742 


2.01 ± 0.03 


1.05 ±0.10 


Nuclear body protein 


U63420 


2.22 + 0.11 


139 ±0.04 


Sp140 








Pro-Pol-dUTPase 


AC004748 


2.33 ± 0.18 


0.95 ±0.03 


polyprotein 








Thioredoxin 


NM_003329 


2.20 ±0.08 


1.07 ± 0.01 


FPRL1 


AF081535 


0.95 ± 0.06 


2.13 ±0.11 


IL-10 receptor 


U08988 


1^2 ± 0.18 


2.21 ± 0.07 


LxR 


NM_005693 


0.90 ± 0.03 


2.37 ±0.31 


Vitamin D receptor 


J03258 


1.37 ±0.23 


2.39 ±0.11 



Expression fold is designated as the ratio of CpG-ODN treated 
over control 



illation but were down-iegulated thereafter, while anti- 
. inflammatory associated genes such as FPRL1, IL10 recep- 
tor, vitamin D receptor and LxR were up-regulated after 8h 
stimulation (Tables 3, 3 and 4). 

3.2 Verification of the microarray results with RT-PCR 
or Western blotting 

To verify the results from the microarray analysis,, we also 
performed RT-PCR on the up-regulated genes (Fig. 3). Con- 
sistent with results obtained in the miaoarray gene expres- 
sion analysis, RT-PCR studies showed that the mRNA levels 
of some selected genes, including ubiquitin specific pro- 
tease 12, regulator of G-protein signaling 5, NAP4 and 
ASHZL, were increased in response to CpG-ODN (Fig. 1, 
Table 3). In addition, the protein expression level of ARF-3 

CpG-ODN 



GpC-ODN 



Untreated 



8 



Ubtquibn specific 
protease 12 ■ I 
Regulator of G- j 
protein signaling 5 j 

NAP4 



Figure 1. Induction of various genes by CpG-ODN. THP1 cells 
were stimulated with medium alone, 1.5 pM GpC-ODN (as the 
negative control) or CpG-ODN for the indicated times.- RT-PCR 
was then performed to analyze gene expression levels, p-actin 
was used 3s an internal control. The experiment was repeated 
three times with similar results. 
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Figure Z. CpG-ODN induced ARF-3 protein expression in human 
THP-1 or mouse RAW264.7 cells. THP-1 (A> or mouse RAW264.7 
IB) cells were incubated with 1.5 pM CpG-ODN for the indicated 
time points. The protein expression level of ARF-3 was deter- 
mined by Western blotting of cell extracts using anti-ARF-3 anti- 
body. The experiment was repeated three times with similar 
results. 

was shown to increase in Western blotting analysis in cell 
lysates from THP-1 cells treated with CpG-ODN for 8-24 h 
(Fig. 2A). Similar studies showed that the ARF-3 pr6tein was 
also induced by mouse specific CpG-ODN in other TLR9 
expression cell lines such as the mouse macrophage 
RAW264.7 cell line (Fig 2B), 

3.3 Proteins regulated in CpG-ODN stimulated THP-1 
cells 

To further assess whether there was any correlation between 
regulation of gene expression and expression of cellular pro- 
teins, a proteomic approach was adopted to identify protein 
expression profiles. THP-1 cells were treated with CpG-ODN 
for defined times (from 8 to 40h), and their cytoplasmic 
proteins were extracted for 2-DE analysis. Although the use 
of high concentrations of urea might give us a broader view 
of all the proteins affected by CpG-ODN, preliminary results 
from 2-D gels showed that the resolution of the protein mix- 
tures were not satisfactory. To improve and get the best reso- 
lution from 2-DE, total proteins were roughly separated into 
supernatant and precipitated fractions using 50% saturated 
ammonium sulfate solution. To remove salts and other con- 
taminants, both protein fractions were precipitated with TCA 
solution and then subjected to 2-DE. By protein spot deter- 
mination analysis, about 500 and 450 well-resolved spots 
were observed on each pH 3.0-30.0 gel for precipitated or 
supernatant fractions, respectively. Comparative analysis of 
2-DE between treatments and control showed that the inten- 
sities of the protein spots from the ammonium sulfate pre- 
cipitated fraction did not change, while several protein spots 
were up-regulated by at least -80% in the supernatant frac- 
tion of 8h CpG-ODN stimulated THP-1 cells (Fig. 3). 

The protein spots were individually excised from gels for 
further identification. After trypsin digestion, several protein 
spots were identified without ambiguity by MS MALD1- 
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Figure 3. The* effect of CpG-ODN on the 2-DE profile of THP-1 
cells. THP-1 cells were treated with or without 1.5 mm. CpG-ODN 
for 8h. Total proteins were extracted and roughly separated into 
two fractions by 50% saturation with ammonium sulfate. The 
supernatant fraction was then separated by 2-DE. Protein spots 
were visualized by SYPRO Ruby staining. Comparison of CpG- 
ODN treated THP-1 cells to untreated cells showed that these 
proteins changed in intensity by over 80%. Protein spots were 
identified by trypsin digestion and MS. Localization of protein 
spots 1 ienoly-coehzyme A hydratase), 2 Iproteasome a) and 3 
tcyclophifin A; two isoforms) are shown. The experiment was 
repeated three times with similar results. 

TOF. These proteins included HSP60, HSP90, cyclophi- 
lin A, enoyl-coenzyme A hydratase, eukaryotic translation 
elongation factor, proteasome a and p chain and ATP syn- 
thase beta chain (Table 5). Similar experiments on cells 
treated for a longer period of time with CpG-ODN stimu- 
lation (25 h) revealed that 27 protein spots were changed in 
intensity by at least -80%. These protein spots contained 
members of HSPs (HSP27, hsc70, grp78 and grp94), 
metabolic enzymes (pbosphoglycerate kinase (PGK) and 
pyruvate kinase (PYK)), macrophage capping protein and 
cyclophilin A (Table 6). Among these proteins, macrophage 
capping protein, PGK, PYK, cyclophilin A and HSP27 
(Figs. 4 and 5A) were found to be up-regulated. Interest- 
ingly, we found that a truncated form of grp78 with an 
expected mass of 25 kDa and pJ of 5.3 was up-regulated 
while grp78 itself was down- regulated. A similar situation 
was also found for grp94 and hsc70 and their truncated 
derivatives (Table 7 and Fig. 5). Jn addition, we also ob- 
served six down-regul3ted pTOtein spots on 2-D gels in 
samples after 25 h CpG-ODN treatment. Among these six 
proteins, we have successfully identified three as 40s ribo- 
somal protein SA, grp78 and hsc70, respectively (Table 6), 
while the other three, due to their relative low abundance, 
have not been identified yet. 
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Table 5. List of proteins modulated by 8 h CpG-ODN treatment 



Proteinname 


Accession 






Matched 


Cover- 


Score 


Expression 




no. 


(theor.) 


(theor.) 


no J 


so^ 




fold 


ATP synthase beta-chain 


gi1 14549 


56 525 


5.26 


16 


58 


171 


2.33 ± 0.06 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 , 


62 


2.85 ± 0.13 


Enoyl-Goenzyme A 


^4503447^ 35 971 


6.61 


11 


44 


62 


2.52±Q.05 


hydrstase 
















Eukaryotic translation 


gi4503481 


50 087 


6.25 . 


6 


25 


68 


3.41 ± 0.21: 


elongation factor 
















HSP60 


P10809 6 * 


57 963 


5.24 


13 


27 


76 


2.78 ± 0.03 


HSP90^beta 


P08238 


83133 


4.97 


1° 


18 


65 ' 


2.36 ±0.10 


Proteasome a chain 


g>4506181 


25 882 


6.92 


11 


59 


80 


Z52 ±,0.11./- 


Proteasome 0 chain 


gi4506193 


26 472 


8,27 


9 


46 


84 


3.85 ± 0.17 



Expression fold is designated as the ratio of CpG-ODN treated over control 

a) , NCBI accession number 

b) Swiss-Prot accession number 



Table 6. List of proteins modulated by 16 and 25 h CpG-ODN treatment 



Protein name 



Swiss-Prot H P' Match- Cover- Score 
no: (theor.) (theor.) ed no. age% . 



Expression fold 



25 h 



16h 



Cyclophilin A 


P05092 


17 


870 


7.82 


5 


35 


62 


2.53 


± 


0.02 


2.48 


± 


0.15 


78 kDa glucose regulat- 


P11021 


72 


288 


5.07 


13 


30 


148 


0.61 


± 


0.04 


0.29 


+ 


0.06 


ed protein (grp78) 












' 
















HSP27 


P04792 


22 


768 


5.98 


12 


39 


124 


1.00 




0.01 


2.61 




0.12 


Heat shock cognate 


P11142 


70 


854. 


5.37 


16 


34 


114 


0.64 


± 


0.01 


0.31 




0.03 


70kDa protein 




























thsc70) 




























Macrophage capping 


P40121 


38 


494 


5.88 


9 30 


58 


1.31 




0.01 


2.58 




0.03 


protein 




























Phosphoglycerate 


P00558 


44 


284 


7052 


11 33 


7i 


2.32 




0.11 


4.23 




0.19 


kinase 




























Pyruvate kinase 


P14618 


57 


710 


7.95 


17 32 


114 


1.65 


± 


0.07 


2.70 


+ 


0.12 


40s ribosomal protein 


P08865 


32 


833 


4.79 


5 23 


61 


0.33 




0.02 


0.35 




0.05 



SA {RSP40) 



Table 7- List of truncated proteins detected in THP-1 cells after 25 h CpG-ODN treatment 



Protein name 



94 kDa glucose-regulated P14625 

protein (grp94) 
Truncated form of grp78 Pi 1021 



Truncated form of hsc70 
Truncated form of hsc70 



P11142 
P11142 



(obs.) 


P/ 

(obs.) 


Matched 
no. 


Cover- 
age% 


Score 


Expression 
fold 


j -59 700 


-5.00 


i 


14 


104 


New 3 ' 


-25 000 
-22000 
-19000 


-5.30 
-5.80 
-6.10 


12 

i 

12 

i 

11 


22 
20 
18 


75 
96 
113 


New 
New 
New 



d) New designated proteins detected in the CpG-ODN treated gel but not in Ihe corresponding control gel 



901 



3.4 Comparison of microarray and proteomic results 

Table 8 shows the expression of six genes and their corre- 
sponding proteins that were modulated by 8h treatment of 
THPT cells with CpG-ODN, Besides enoyl- coenzyme A 
hydratase, there was poor correlation between the expression 



of genes and their corresponding proteins (Table 8), sug- 
gesting that more in-depth studies were needed. To further 
evaluate whether changes observed in protein expression 
correlated with changes in mRNA levels, we randomly chose 
two proteins (PGK and PYK) that were induced after I6h 
CpG-ODN treatment and determined their mRNA levels by 
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Figure 4. 2-D gel elect roph or etic analysis of CpG-ODN-treated 
THP-1 cells. IA) Total eel! protein from unstimulated THP-1 cells 
was subjected to 2-DE. (B) THP-1 cells were treated with or with^ 
out 1.5 p M CpG-ODN for defined times. Cellular proteins were 
extracted and separated by 2-DE. Several up-regulated proteins 
are shown in the SYPRO Ruby staining gel. Comparison of CpG- 
ODN treated. THP-1 cells to untreated cells showed that these 
proteins changed in intensity by over 80%. The experiment was 
repeated three times with similar results. 



'27 KDa 




66kDa 



Figure 5. ?-D gel electrophoretic analysis of CpG ODN-treated 
THP-1 cells. THP-1 cells were treated with or without 1.5 M m CpG- 
ODN for defined times. Cellular proteins were, extracted and 
separated by 2-DE. Protein spots were detected by SYPRO Ruby 
staining. |A| Expression of HSP27 was induced by increasing the 
period of CpG-ODN stimulation. A truncated form of hsc70 was 
detected on the gel. (B) The native form of grp78 was detected in 
untreated cells, while the truncated form of 9 rp78 was observed 
after 25 h stimulation. The experiment was repeated three times 
with similar results. 
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Table a Comparison of gene and protein expression levels in 
THP-1 cells after 8 h CpG-ODN treatment 



protein name 


Gene 


Protein 




expression 


expression 




fold from 


fold from 




microarray 


2-D gel 


Enoyl-Coenzyme A hydrolase 


1.72 ± 0.31 


2.52 ±0.05 


Eukaryotic translation 


0.95 ±0.01 


3;4i±0.21 


elongation factor 




BSP60 


0.94 ± 0.07 


?.78 ± 0.03 


HSP90-beta 


1^58 ±0.11 


2.38 ±0.10 


Proteasome a chain 


0.98 ± 0.03 


2.52 ±0.11 


Proteasome 0 chain 


1.09 ±0.11 


3.85 ± 0.17 



RT-PCR. Our results showed that mRNA levels of PYK 
increased after 16 h Cji^ODN treatment, while mRNA 
levels of PGK were dramatically increased after 24 h stimu- 
lation (Kg. 6). In addition, we also performed enzyme activ- 
ity analysis and found that the activity of PGK and PYK were 
3.5 




Time (h) 




n 2 4 8 1$ 24 

Time (h) 

Figure 6. Activities of PYK and PGK induced by CpG-ODN. THP1 
cells were stimulated with medium alone, 1.5 pM GpC-ODN (as 
the negative control) or CpG-ODN for the indicated times. Cell 
lysotes were extracted and assayed for (A) PYK and ID) PGK ac- 
tivities. Data represent mean it SEM. (n = 3). 



© 2005 WILEY- VCH Verlag GmbH & Co. KGaA, Weinheim 



www.proteomics-jomnDl.de 



Best Available Copy 



Proteomics 2005, 5, 894^906 

GpC-ODN 



CpG-ODN 



Pyruvate kmaso 



0 2 8 tt 24 2 4 8 1$ 24h 
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Figure 7. Induction of PYK and PGK transcripts by CpG-ODN.. 
THP1 cells were stimulated with medium alone, 1.5 jim GpC-ODN 
or CpG-ODN for the indicated. times. RT-PCR was then performed 
to anah/ze gene expression. f^actin was used as an internal con- 
trol. The experiment was repeated three times with similar 
results. 



indeed increased by a factor —2.5 after CpG-ODN stimula- 
tion (Kg. 7). To confirm that mRNA induced by CpG-ODN 
would also be accompanied by an increase in protein 
expression even though it was not detected in 2-D gel analy- 
sis, we used more sensitive and specific Western blotting 
analysis. As shown, in Table 3. and Fig. 2, ARF-3 was identi- 
fied in the microarray gene profile but not in the proteomic 
expression profile. Nevertheless, we observed enhanced pro- 
tein expression of ARF-3 after CpG-ODN stimulation by 
Western blotting. Moreover, to investigate whether the up- 
regulation of ARF-3 by CpG-ODN is mediated through the 
TLR9 pathway, TLR9-deficient HEK293 cells were transiently 
cotransfected with hTLR9 and luciferase-reporter gene driv- 
en by a NF-xB-dependent promoter. Our data showed that 
NF-xB activity of untransfected HEK293 cells were not 
responsive to CpG-ODN stimulation, while in transfected 
HEK293 cells expressing hTLR9, NF-xB luciferase activity 
was up-regulated 12-fold after 8h CpG-ODN stimulation. 
The activation of NF-xB induced by CpG-ODN was blocked 
by. pretreatment of the transfected cells with an ARF-3 in- 
hibitor, such as brefeldinA (Fig. 8), suggesting CpG ODN' 
induces ARF-3 and activates NF-kB after the interaction of 
CpG-ODN with TLR9. 



4 Discussion 

In this study both microarray and proteomic approaches 
were used to evaluate the effect of CpG-ODN on gene/pro- 
tein expression profiles of THP-1 cells at several time points. 
Comparison of the gene expression profiles showed that 
stimulation of the cells with CpG-ODN for different periods 
of time resulted in different profiles (Tables M). The differ- 
ences in mRNA expression between the cells with short and 
long stimulation could be attributable to the low reproduci- 
bility. However, to avoid experimental variations, we not only 
used the same batch of microarrays from the same manu- 
facturer but also applied the samples of short and long term 
stimulation at the same time. In this way, we found that the 
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changes in expression fold of mRNA after CpG-ODN treat- 
ment were quite reproducible as shown by their mean 
±. SEM (Table 4). A more likely explanation for the difference 
in the expression level of mRNA after different periods of 
stimulation with CpG-ODN is that the transient increase or 
decrease in these mRNA by CpG-ODN plays a significant 
role in ovulating biological functions. For example, we 
found that the IL-18 receptor accessoiy protein from THP-1 
cells was up-regulated after 2h of CpG-ODN stimulation. 
The lLrl8/lU8R system is known to activate Thl-mediated 
immune responses, that play a critical role in host defense 
against infection [26]. Together with IL-I8/IL18 R, several 
genes for antimicrobial defense were also increased, includ- 
ing thioredoxin, Pro-Pol-dUTPase poryproteih and Spl40. 
After 8h of CpG-ODN stimulation, however, none of these 
genes was activated any more (Table 4); Since sustained or 
excessive production of these antimicrobial molecules might 
lead to inflammation and cellular damage [27], a plausible 
explanation is that THP-1 cells fight against the invasion of 
pathogens by up-regulating antimicrobial defense-associated 
genes at an early stage of stimulation and then shut them 
down to avoid over-activation. Whether this explanation is . 
true remains to be verified- 

It is noteworthy that our data also identified the up-reg- 
ulation of several anti-inflammatory associated genes after 
8h of CpG-ODN stimulation. These genes included FPRL1, 
1L-10 receptor, vitamin D receptor, and LxR (Table 3). FPR 
and FPRL1 have been defined as chemotactic factors 
involved in host defense against bacterial infection and in the 
clearance of damaged cells. Additional studies have indicated 
that FRPL1 interacts with a menagerie of structurally diverse 
pro- and anti-inflammatory ligands associated with diseases, 
including amyloidosis, Alzheimer's diseases, prion disease 
and HIV |28, 29]. Therefore, FPRL1 may play an important 
rote in regulating and/or balancing the production of pro- 
and anti-inflammatory molecules in CpG stimulated THP-3 
• cells. Additionally a recent study has demonstrated that LxRs 
and their ligands act as negative regulators of macrophage 
inflammatory gene expression and inhibit the expression of 
inflammatory mediators such as inducible nitric oxide syn- 
thase, cyclooxygenase and 1L-6 in response to bacterial 
infection or LPS stimulation [30]. Of interest, we found that a 
transcription factor gene connexin 59, a regulator of JL-6 
expression, was up-regulated after 2h of CpG-ODN stimula- 
tion. It is thus likely that CpG-ODN stimulation of THP-1 
cells for 2h may induce the expression of the pro-inflamma- 
tory cytokine 1L-6 through the up-regulatipn of the connexin 
59 gene, while 8h of CpG-ODN treatment may counter- 
balance the initial inflammatory response by inducing LxR to 
inhibit 1L-6 production. 

Signal transduction molecules play an important role in 
cellular activation- Intracellular signal transduction systems 
employing various intermolecular interactions through 
docking elements, including SH2 and SH3 domains, have 
been reported |3l-33). Here we found that THP-1 cells 
treated with CpG-ODN for 2 h up-regulated gene expression 
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of Grb2-like protein (which contains an SH3 domain), while 
8h of stimulation induced Nek, Ash and phospholipase C 
binding protein (NAP4 which contains an SH2 domain). It is 
thus possible that Crb2-Iike protein and NAP4 may play 
important roles in CpG-ODN mediated signaling pathways. 
Furthermore, recent studies have also revealed that binding 
of CpG-DNA to TLR9 results in activation of JNK [34). Since 
JNK is activated by Nek adaptor protein and Nek interacting 
kinase [35, 36], it is possible that CpG-ODN may activate JN K 
via up-regulation of NAP4. Although a recent publication 
described the gene expression profiles of a cultured mouse 
macrophage cell line after CpG-DNA stimulation [34], their 
miaroarray results were only conducted at one time-point 
(6h stimulation). Moreover, they did not report the meas- 
urement of protein expression profiles in response to CpG 
ODN stimulation. 

Comparison of the gene and protein expression profiles 
showed that there was discordance between mRNA and pro- 
tein levels (Table 8). Similar discordance between the 
expression pattern of genes and proteins was also reported in 
other system using diiTerent stimuli [37-41]. The dis- 
cordance between mRNA and protein levels could be due to 
screening capability such as detection sensitivity, choice of 
cut-off point, quantitativity of microarray and 2-D. gels, as 
well as time discrepancy between gene and protein expres- 
sion [39, 40, 42, 43]. Alternatively, it could also be explained 
by post-trancriptional events, such as alternative splicing or 
PTM [39, 40, 42, 43]. Another possible explanation is that 
most of the spots observed in the 2-D gels are isoforms of 
some proteins. The intensity of each spots does not neces- 
sary represent total amount of a certain protein and thus 
does not correlate with its mRNA level. Our finding that 
microarray results correlated better with Western blotting 
results (eg., ARF-3 in Fig. 2), an approach more suitable than 
2-D gels for determining, the total amount rather than iso- 
forms of a given protein, seems to suggest that formation of 
isoforms should be carefully taken into consideration when 
one tries to correlate mRNA and protein expression data. 

Using a pioteomics approach, we found that CpG-ODN 
treatment up-regulated the expression of many proteins 
including HSPs, metabobc enzymes, structural proteins, as 
well as macrophage capping protein, cyclophibn and pTotea- 
some a arid P chain etc. HSPs are the most aWndant and 
ubiquitous soluble intracellular proteins. They are up-regu- 
lated by various stressors including temperature, glucose 
deprivation, microbial infection and. cancer [44]. They func- 
tion as molecular chaperones to prevent protein aggregation 
and contribute to the folding of nascent and altered proteins. 
In addition, they are able to regulate immune responses, 
including production of inflammatory cytokines and che- 
mokines and activation or maturation of immune cells (45, 
46). Beside HSPs, cydophilin as well as proteasome a and p 
chain have also been reported to be involved in the immune 
response |39, 47]; proteasome p chain is consistently up- 
Tcgulated in human neutrophils following LPS exposure (39). 
Our finding that the protein levels of HSPs, cydophilin, and 
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proteasome a and 0 chain were increased after CpG-ODN 
treatment suggests that these molecules might play a role in 
the immunostimulating effect of CpG-ODN. To what extent 
these proteins contribute to the immune responses of the 
cells to CpG-ODN is currently under study Proteomk anal- 
ysis also showed that truncated forms of grj>78, grp94 and 
hsc70 were induced, a phenomenon similar to calreticuHn. 
observed by Richards and his coworkers [48]. The expression 
of full length hsc70 and grp78 were decreased, while the 
levels of their truncated derivatives was increased after CpG- 
QDN treatment. These- results suggest that the degradation 
of these proteins has been enhanced We also found that 
proteosorrie a and p chains as well as ubiquitin specific pro- 
tease 32 were increased by CpG-ODN. Whether these 
enzymes or other enzymes were responsible for the genera- 
tion of truncated hsc70 and grp78 remains to be elucidatedL 
Cells rely on multiple signaling pathways to determine 
their fates of survival, proliferation or apoptosis J49J. In fact, 
apoptosis plays an important role in regulating pathogen 
infection. To be able to grow and replicate in the target cells, 
pathogens may have to block apoptosis. Results from several 
laboratories have made it clear that HSP70 and HSP27 pro- 
tect cells not only from heat, but also from most apoptoric 
stimuli |48, 50} by binding to Alct and subsequently mediat- 
ing anti-apoptoric activity through activation of Alct |51-53]. 
Since our data revealed that CpG-ODN induced the expres- 
sion of HSP90 and HSP27. it is possible that CpG-ODN 
might prevent apoptosis by up-regulation of HSPs. 

Interestingly, our microarray data also showed that CpG- 
ODN mediated the induction of a set of genes associated 
with tumor progression and cell proliferation. Among these, 
one gene, W1SP-2, was up-regulated by CpG-ODN after both 
2 and 8h stimulation. WISP genes were first identified as 
downstream targets of the WnMp-catemn signaling path- 
way. They belong to the CCN family of growth factors that 
have been receiving increasing attention lately due to some 
of the family members having been reported to be involved 
in angiogenesis and tumorigenesis (54J. It would be inter- 
esting to evaluate whether CpG-ODN plays a role in angio- 
genesis and tumorigenesis by regulating WJSP-2. In addi- 
tion, we found that some genes associated, with neurode- 
generation or neuroprotection, such as FPRLl, NMDAR 
(NMDA) receptor, PKC and dishevelled 3 were up-regulated. 
To our knowledge, this is the first report to suggest an asso-" 
ciation between these genes and CpG-ODN stimulation. As 
mentioned above, FPRLl plays a crucial role in proin- 
flammatory aspects of systemic amyloidsis and neurodegen- 
erative disease such as Alzheimer's disease and prion disease 
(28). NMDAR, PKC and disheveled are involved in modulat- 
ing amyloid precursor protein metabolism, which is central 
to the pathogenesis of Alzheimer's disease [55-57]. Most 
notably, recent studies have shown that theTLR4-dependent 
pathway is involved in netuodegeneration of the central 
nervous system (58). Whether CpG-ODN moieties of patho- 
gens play any role in neurodegenerative diseases such as 
Alzheimer's remains to be elucidated. • 
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Figure 8. ARF-3 participates in the CpG-ODN-TLR9-NF-x:B path- 
way. HEK293 cells were cotransfected with p5xNF-xB and human 
TLR9. After overnight transection, the cells were incubated with 
or without 1.5 mm CpG-ODN for 8 h in the presence or absence of 
increasing concentrations of the ARF-3 inhibitor brefeldiri A. 
After incubation, cells were lysed and IMF-xB lucif erase activity 
was measured. Data represent meari ± SEM. in = 3). 



Exposure of ceils to LPS or microbial infection has been 
known to induce several genes encoding metabolic enzyme 
1 34, 39]., Our microarray data also revealed that a large number 
of genes encoding proteins involved in energy synthesis and 
fatty acid oxidation, such as enoyl-coenzyme A hydratase, 
propionyl coenzyme A carboxylase and cytochrome p450 . 
were activated by CpG-ODN treatment. In addition, we found 
that other proteins such as ARF-3 were up-regulated (Table 3, 
Fig. 8). ARFs are 20kDa GTPases of the ras superfamily that 
are critical to vesicular trafficking, including exocytic protein 
transport and endocytosis (59, 60]. This study demonstrates 
for the first time that ARF-3 is involved in the activation of N F- 
kB induced by CpG-ODN (as shown in Fig. 8) 

CpG-DNA/ODN has been shown to elicit primarily 
responses via the 7TR9/MyD88 dependent pathway (18, 21, 
22]. Chromosome location analysis showed that instead of 
localizing on one or two chromosome, the genes/proteins 
modulated by CpG-ODN stimulation are scattered on all 
chromosomes except chromosomes 23 and 24. These results 
seem to suggest that CpG-ODN either affects multiple chro- 
mosomes simultaneously or subsequently via cascades of 
cellular messengers. More studies are needed to elucidate its 
mechanism of actions. 



5 Concluding remarks 

In summary, by using microarray and proteorruc approaches 
to evaluate the effect of CpG-ODN at different time points, 
we have found that genes/proteins regulated by CpG-ODN 



905 



are related to mflaramatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduction, 
tumor progression, cell differentiation, proteolysis, anti-apop- 
tosis as well as neurodegeneration and neuroprotection. Our 
results may help delineate the CpG-ODN mediated j^athway 
and contribute to further understanding of mechanisms that 
link innate immunity with acquired immune response(s). 
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Quantification of CK20 gene and protein expression in colorectal cancer 
by RT-PCR and immunoliistocuemistry reveals inter- aud intratumour 
heterogeneity. 

jLassmannS, BauerM, SoongR, Schreglmann J . Tahirijg; NahrigJ Ruger R. 
Hofler H T Werner M. b ^ 

Pathologisches Institut, Uriversitatsklinikum Freiburg, Albertstrasse 19, 79104 Freiburg 
; Germany, lassmaon@ukl.uni-freiburg.de 

Cytokeratin 20 (CK20) is an epithelial protein expressed almost exclusively in the 
gastrointestinal (GI) tract and is widely used as immunohistochemical marker for routine 
diagnosis. In contrast, CK20 gene expression is not an established marker for the 
classification of tumours and the detection of disseminated cancer cells in colorectal 
cancer. Recently, real-time reverse transcriptase polymerase chain reaction (RT-PCR) has 
provided the means for reproducible and quantitative investigation of molecular markers 
This report directly compares CK20 mRNA and protein expression in serial sections of ' 
archival, formalin-fixed, paraffin-embedded (FFPE) colorectal adenocarcinomas CK20 
expression was detected by immunohistochemistry (IHC) in 60/63 (95 2%) cases bv 
conventional RT-PCR in 58/60 (96.7%) and by quantitative RT-PCR using the ' 

JonfZn !ol LlghtC) ; Cler iS a trademark of a Member of the Roche Group) System in 
ZJ/52 (90.6/4) microdissected cases, one case yielding variable results. Despite the high 
detection rate of all three techniques, marked heterogeneity of CK20 expression was seen 
between different cases and also within individual cases. CK20 expression profiles were 
not related to particular histopathologic*! features of the tumours. A good correlation (r = 
0.8964) was found between CK20 mRNA and protein expression by comparing 
quantitative RT-PCR with IHC in 32 cases. This was also true for selected heterogeneous 
^™ L individual cases. Both RT-PCR and IHC are therefore valuable tools 

tor CK20 detection m colorectal adenocarcinoma, with real-time RT-PCR providing 
supplementary quantitative information. This suggests a promising supportive role for 
quantitative RT-PCR in molecular pathology. Copyright 2002 John Wiley & Sons, Ltd. 
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Expression of the multidrug resistance-associated protein (MRP) mRNA 
and protein in normal peripheral blood and bone marrow haemopoietic 
cells. 

Lcgrand (X Perrot JY , Tang R, Stmonm G; Gurbuxani S. Zittoun R , Marie JP . 

Laboratoire de Cinetique et de Cultures Cellulaires, Hotel Dieu, Paris, France. 

We studied the expression of multidrug resistance-associated protein (MRP) in normal 
haemopoietic cells from peripheral blood and bone marrow. The MRP mRNA levels 
were estimated by RT/PCR and in situ hybridization (ISH) assay, and the protein levels 
by flow cytometry. 21 samples of peripheral blood and 21 samples of bone marrow (1 1 
normal bone marrow donors, 10 patients in complete remission after chemotherapy for 
large cell lymphoma or acute myeloid leukaemia) were analysed. In peripheral blood the 
mean MRP mRNA level in CD3+ cells was statistically higher than in the other cells (3- 
fold by the methods used). The levels of MRP in CD3+ varied from one individual to 
another (4.5-34.8 units by RT/PCR and 5-23 grains/cell by ISH); however, this was 
proportional to the variation in all the cell lineages of same individual (r = 0.84). In bone 
marrow the mean MRP levels of the various cell lineages (including CD34+) were 
similar to the .basal -level in HL60 cells. Individual expression levels were again variable; 
however, there was no difference between untreated normal bone marrow and post 
chemotherapy normal bone marrow. MRP protein expression was determined by flow 
cytometry with the monoclonal antibody MRPm6. The CD4+ lymphocytes exhibited a 
higher MRP protein expression than the other cell lineages, including CD8+ cells. There 
was a good correlation between the three methods used (RT/PCR and ISH, P "= 0.000 1 , r 
= 0.87; RT/PCR and flow cytometry, P = 0.0001, r - 0^85; ISH and flow cytometry, P = 
0.002, r = 0.67). 

PMDO: 8757504 [PubMed - indexed for MEDLINE] 
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Vascular endothelial growth factor enhances cardiac allograft 
arteriosclerosis. 

Lemstrom KB, Krebs R , Nykancn Ah Tikkanen JM , Sihvola RK Aaltola EM. 
Hayry PJ , Wood J, Alitalo K. Yta-Herttuala S, Koskinen PK . 

Cardiopulmonary Research Group, Transplantation Laboratory, University of Helsinki . 
and Helsinki University Central Hospital, Helsinki, Finland, KarLLemstrom@helsinki.fi 

BACKGROUND: Cardiac allograft arteriosclerosis is a complex process of alloimmune 
response, chronic inflammation, and smooth muscle cell proliferation that includes cross 
talk between cytokines and growth factors. METHODS AND RESULTS: Our results in 
rat cardiac allografts established alloimmune response as an alternative stimulus capable 
of inducing vascular endothelial growth factor (VEGF) mRNA and protein expression in 
cardiomyocytes arid graft-infiltrating mononuclear inflammatory cells, which suggests 
that these cells may function as a source of VEGF to the cells of coronary arteries. Linear 
regression analysis of these allografts with different stages of arteriosclerotic lesions 
revealed a strong correlation between intragraft VEGF protein expression and the 
development of intimal thickening, whereas blockade of signaling downstream of VEGF 
receptor significantly reduced arteriosclerotic lesions. In addition, in cholesterol-fed 
rabbits, intracoronary perfusion of cardiac allografts with a clinical-grade adenoviral 
vector that encoded mouse VEGF(164) enhanced the formation of arteriosclerotic 
lesions, possibly secondary to increased intragraft influx of macrophages and 
neovascularization in the intimal lesions. CONCLUSIONS: Our findings suggest a 
positive regulatory role between VEGF and coronary arteriosclerotic lesion formation in 
the allograft cytokine microenvironment 
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Enhanced expressions of arachidonic acid-sensitive tandem-pore domain 
potassium chanuels in rat experimental acute cerebral ischemia. 

MZB, Zhang MX LULL, Wang XL . 

Instituteof Materia Medica, Chinese Academy of Medical Sciences and Peking Union 
Medical College, Beijing 100050, China. 6 

To forther explore the pathophysiological significance of arachidonic acid-sensitive 
potassium channels, RT-PCR and Western blot analysis were used to investigate the 
express^ changes of TREK channels in cortex and hippocampus in rat experimental 
acute cerebral ischemia in this study. Results showed that TREK-1 and TRAAK mRNA 

SSSSZ? T d TREK 1 ^ in Wppocampus showed i32^£2?h 

ih, a .nK I ! 0CC,USi ° n (MCAO) - * e expression levels of 

the all three channel subtypes increased significandy 24 h after MCAO in cortex and 

thC SamC time ' the P rote - of all the three chale pi* 

^ signifiaant increase 24 h after MCAO in cortex and hippocampus, but o£ly 

L^r H aa T aBd CXPreSSi0n 2 h affer MCAO in ™ d hippocampus 

tonohistoch em ,c a l experiments verified that all the three channel proteins had higher 
expres ,on levels mcortical and hippocampal neurons 24 h after McXo These resuS 

TSchann S eir 8 T elat ^ n TRBK ° hanJlelS md a ° Ute CerebraI ^m7a 

TREK channels ought provide a neuroprotective mechanism in the pathological process. 
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CHAPTER 29 
Regulation of transcription 
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The phehotypic differences thai distinguish the 
various kinds of cells in a higher euharyote are 
largely due lo differences in the expression of 
genes that code for proteins, that is, those tran- 
scribed by RNA polymerase II. In principle, the 
expression of these genes might be regulated at 
any one of several stages. The concept of the 
"level of control" implies that gene expression 
is not necessarily an automatic process once it 
has begun, h could be regulated in a gene- 
specific way at any. one of several sequential 
steps. We can distinguish (al least) five poten- 
tial control points, forming the series: 

Activation of gene structure 
I 

Initiation of transcription 
I 

Processing the transcript 
I 

Transport to cytoplasm 
I 

Translation of mKJNA 

The existence of the first step is implied by 
the discovery that genes may exist in either of 
two struciural conditions. Relative to the stale 
of most of the genome, genes are found in 
3n "active" state in the .cells hi . which they 
are expressed (see Chapter 27). The change of 
strncmre is distinct from the act of transcrip- 
h on, and indicates that the gene is "transcrib- 
able." This suggests that acquisition of the 
"active'* structure must be the fust step in gene 
expression. 

Tr;m S criptioji of a m ibr. ncrivc stale is 



controlled at the stage of initiation, Aat is, by 
the interaction of RNA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis in the in vitro systems (see Chapter 
28). For most genes, this is a major control 
point; probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukary- 
otic cells, for example, via. antitermination 
mechanisms. 

The primary transcript is modified by capping 
at the 5'end, and usually also by polyadenyla- 
tion at the 3' end: Introns must be spliced out 
from the transcripts of interrupted genes. The 
marure RNA must be exported from the nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at die level of nude ar 
UNA might involve any or all of these stages, 
but the one for which wc have most evidence* 
concerns changes in splicing; some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter 30). 

Finally, die translation of an mKNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of this mechanism in 
aduh sojuatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve the block- 
ing of initiation of translation of some mRNAs by 
specific prolein factors. 

But having acknowledged that control of gene 
expression cnu occur at multiple stages, and 
that production of RNA cannot inevitably be 
equaled wjiii production of . protein, it h clear 
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that the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



Response elements 'identify 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
thai causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heat shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in Table 29. j. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the faclor 
extends for a short distance on either side of 
t 



I ■ '. Table 29.1 inducible transcription factors, bind \o% 
' response elements that identify groups of promoters 
■ or enhancers subject to coordinate control..- : .. 



Regulatory Agenl Module Consensus Faclor 

Heat shock HSE CNNGAANM1CCNNG HSTF 

Glucocorticoid GRE 7GGTACAAA7GTTC7 neceptor 

Phorbo! ester TRE TGAC7CA API 

Serum ' SRE CCATATTAGG SRF 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



genes under common 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 bp upstream 
of it The presence of a single element usually 
is sufficient to confer, the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in pro- 
moters or in enhancers. Some types of elements 
are typically found in ope rather than the other: 
usually an USE is foimd in a promoter, while a 
GRE is found in an enhancer. We assume that 
all response elements function by the same 
general principle. A gene is regulated by a 
sequence at the promoter or enliancer that is 
recognized by a specific protein. 17ie protein 
functions as a transcription factor needed for 
RNA polymerase to initiate. Active protein is 
available only under conditions when the gene is 
to be expressed; its absence tneans that the pro- 
moter is not activated by this particular circuit 

An example of a situation in which many 
genes are controlled by a single faclor is pro- 
vided by the heat shock response. This is com- 
mon to a wide range of prokaryotes and 
enkaryotes and involves multiple controls of 
gene expression: an increase in temperature 
turns off transcription of some genes, turns on 
transcription of the heat shock genes, and 
causes changes in the translation of mRNAs. 
The control of the heat shock genes illustrates 
the differences between prokaryouc and 
eukaryotic modes of control. In bacteria, a ne»* 
sigma factor is synthesized thai directs RNA 
polymerase boloenzyme to recognize an alter- 
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Retina! preconditioning and the induction of heat-shock protein 27. 
LiY, RothS, LgscrM, MaJX, Crosson CE . 

^^ml5 0 SlA lm0l0Ky ' Medical Univmit ^ of South Caroiina ' Char,eston ' Sou * 



l^?n a • °5 ,SChemia have been shown 10 P rotect ^e retina from 

potential y damagmg penods of ischemia. This phenomenon has been teJed ischemic 

Z^TT S T SC ^ m V° leraaCe - Inthe P rese "^tudy the cellular changes Ss 
of heat shock protem (Hsp)27, -70, and -90 mRNA and expression of Hsp in th^eS na 
assorted w,th ischemic preconditioning were evaluated. METHODS: U^ShST 

mel T*P S < r ,ed "i™**™* - d Sprague-Dawley nits for 5 mihute^Sf 
;.^en left for 1 hour to 7 days, to allow the retina to reperfuse. Retinas were dissected [he 
mRNA and protem isolated, and Northern and Western blot analyses conduc ZTdt^t 
changes m expression of Hsp27, -70, and -90. Immunochemical studt were uSd'o 
identify retmal reg,ons where Hsp changes occurred. Selected animals were sub £ted to 
a second .schema event, 60 minutes in duration, to correlate the ch^eT^exprSon of 
Hsp wxth functional protection of the retina from ischemic injury. RESULTS nT^nrrol 
and sha^-treated animals retinal Hsp27, -70, and -90 mRNAs Zr^etectable Five 

ev2 at' S T P 7 C r dlt r ng> ,eVdS ° f Hsp2? ™™ A were e, ™ted above control 
levels, and 24 hours later, mRNA levels increased 200% over basal levels Hsp27 

e . lev f d for up to 72 hours and then be ^ 10 ^ ~ 

evels Hs P 27 protein levels were increased by 200% over basal levels 24 hours after 

£2? l^T ne ' remained ? th[S ' eVd f ° r 72 h0UfS ' - d returnedTo con rol 
S a a i "° COnS1Stent Change in Hs P 70 or " 90 mR NA or protein levels was 
observed dunng the course of the study. Immunohistochemical studies demons^ that 
he grease m expression of Hsp27 was localized to neurona. and non-neuronSs n 
cinX er TZ ° f *? ret,na : E'ectroretinography studies demonstrated a strong 
correl a t,on between the protection of retinal function from ischemic injury and the 

STa ^ C ° f NCLUSI0NS: prWide evidence thaurltduction 

of Hsp27 1S a gene-specfic event associated with ischemic preconditioning in the retina 
This increase m expression of Hs P 27 occurs in both neuronal and non-neufona ££T 
ceHs, and appears to be one component of the neuroprotective events induced by 
ischemic preconditioning iri the retina. . .' "«"«^u oy 

PMID: 12601062 [PubMed - indexed for MEDLINE] 
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Increasing expression of tissue plasminogen activator and plasminogen 
activator inhibitor type 2 in dog gingival tissues with progressive 
inflammation. 

Liadberg P, Kmnby B, Lecander I, Lang_NP, Matsson L 

Center -for Oral Health Sciences, Malmo University, S-2 14 21 Malmo, Sweden 
pia.Iindberg@od.rhali.se ■ ' 

fh^TT 4 ! tWt ^ e .P lasmi ™g<* activators.(u-PA and t-PA) are serine proteases 
that convert plasminogen mto plasmin, which degrades matrix proteins and activates 
meta loproteinases. The PAs are balanced by specific inhibitors (PAI-1 and PAI-2) 
I^cal production of t-PA and PAI-2 was recently demonstrated in human gingival 
tissues. The « now was to mvestigate the production and localization of t- PAand PAI- 
-7 ! u, . gingival tissues.from dogs in three well-defined periodontal conditions; clinically 
healthy gingiva, chronic gmgivitis and an initial stage of ligature-induced loss of 
attachment. At the start of the experiment the gingiva showed clear signs of 
inflammation Clinically healthy gingiva were obtained after 21 days period of intense 

some ^ f ^ by p,adn S H £ atures ~» the neck of 

some teeth. Busies were taken from areas representing the different conditions and 

S k 1" o ? hy , b " d,2at,on immunohistochemistry. In clinically healthy 
gingiva both t-PA mRNA and antigen were expressed in a thin outer layer of the sulcular 
ardjunc *iona ep.theha. No t-PA signals or staining were seen in connective tissue. Both 
mRNA signaling .and unmunostaining for t-PA were stronger in chronic gingivitis In 
areas w^ loss of attachment, t-PA mRNA as well as antigen were foundlnfhe s Icu.ar 
and unchonal epuheha to a similar degree as in gingivitis. Occasionally die connective 

Z ZTt f.L e,P f a f ly ^ COnnectio " with v **e!s. PAI-2 mRNA was seen in a - 
thin outer layer of the sulcular and junctional epithelia in clinically healthy gingiva but 

ouStr?^" 6 ■ in | COnn f. tive tissue " PAI -2 antigen was found primarily m th'e 
outer layer of the sulcular and juncUonal epithelia, Some cells in the connective tissue 
were stained. In gingivitis, PAI-2 signals were mainly found in the same locations but 
more intense and extending towards the connective tissue, unmunostaining was seen in 
the outer half of the sulcular and junctional epithelia as well as in the upper part of the 
coimecnve t.ssue, close to the sulcular epithelium. In sites with loss of attachment, PAI- 
2 mRNA was found U^roughout the sulcular and junctional epithelia, as was the antigen " 
which stained intensely. No PAI-2 mRNA was seen in connective tissue; the antigen was 

£ "and?! r re ! ■ eSPeC,a " y "r VCSSe,S - ThiS *«* sh ° WS ** the -P-^n of S t • 
PA and PAI-2 mcreases with experimental gingival inflammation in the dog and 
furthermore, the two techniques demonstrate a strong correlation between the 
topograph^ distribution of the site of protein synthesis and the tissue location of the 
antigens for both t-PA and PAI-2. The distribution correlates well with previous 
findings in humans. 1 



135: Mod Pathol. 2002 Sep;15(9):979-87. 



Related Articles, Links 



Effect of duration of fixation on quantitative reverse transcription 
polymerase chain reaction analyses. 

Macabeo-Ong M, Ginzinger DC Dekker N, McMillan A . Regezi JA Won* DT 
Jordan RC . ' ' 

Oral Pathology, Department of Stomatology, University of California San Francisco 
California 94143-0424, USA. 

Increasingly, there is the need to analyze gene expression in tumor tissues and correlate 
these findings with clinical outcome. Because there are few tissue banks containing 
enough frozen material suitable for large-scale genetic analyses, methods to isolate and 
quantify messenger RNA (mRNA) from formalin-fixed, paraffin-embedded tissue 
sections, are needed. Recovery of RNA from routinely processed biopsies and 
quantification by the polymerase chain reaction (PGR) has been reported; however, the 
effects of formalin fixation have not been well studied. We used a proteinase K-salt 
precipitation RNA isolation protocol followed by TaqMan quantitative PCR to compare 
the effect of formalin fixation for 24, 48, and 72 hours and for 1 week in normal (2), oral 
epithelial dysplasia (3), and oral squamous cell carcinoma (4) specimens yielding 9 fresh 
and 36 formalin-fixed samples. We also compared mRNA and protein expression levels 
using immunohistochemistry for epidermal growth factor receptor (EGFR), matrix 
metalloproteinase (MMP)-l, p21, and vascular endothelial growth factor (VEGF) in 15 
randomly selected and routinely processed oral carcinomas. We were able to extract 
RNA suitable for quantitative reverse transcription (RT) from all fresh (9/9) and 
formalin-fixed (36/36) specimens fixed for differing lengths of time and from all (15/15) 
randomly selected oral squamous cell carcinoma. We found that prolonged formalin 
fixation (>48 h) had a detrimental effect on quantitative RT polymerase chain reaction 
results that was most marked for MMP-1 and VEGF but less evident for p21 and EGFR. 
Comparisons of quantitative RT polymerase chain reaction and immunohistochemistry 
showed that for all markers, except p21, there was good correlation between mRNA and 
protein levels. p21 mRNA was overexpressed in only one case, but protein levels were 
elevated in all but one tumor, consistent with the established translational regulation of 
p21 . These results show that RNA can be reliably isolated from formalin-fixed, paraffin- 
embedded tissue sections and can produce reliable quantitative RT-PCR data. However, 
results for some markers are adversely affected by prolonged formalin fixation times. 

PMID: 12218216 [PubMed - indexed for MEDLINE] 
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Id-1 and Id-2 arc ovcrcxpresscd iu pancreatic cancer and in dysplastic 
lesions in chronic pancreatitis. 

Maruyama H , KlceffJ, WildiS , Fries* H . Buchler MW. [srael MA . Korc M 
Division of Endocrinology, Department of Medicine, University of California, Irvine, 

Id proteins antagonize basic helix-loop-helix proteins, inhibit differentiation, and enhance 
cell proliferation. In this study we compared the expression of Id-1, Id-2, and Id-3 in the 
normal pancreas, in pancreatic cancer, and in chronic pancreatitis (CP). Northern blot 
analysis demonstrated that all three Id mRNA species were expressed at high levels in 
pancreatic cancer samples by comparison with normal or CP samples. Pancreatic cancer 

WT S ^ equenfl y expressed all three Ids, exhibiting a good correlation between Id 
mRNA and protein levels, as determined by immunoblotting with highly specific anti-Id 
anybodies. Immunohistochemistry using these antibodies demonstrated the presence of 
taint Id-1 and Id-2 immunostaining in pancreatic ductal cells in the normal pancreas 
whereas Id-3 immunoreactivity ranged from weak to strong..In the cancer tissues m'any 
of the cancer cells exhibited abundant Id-1, Id-2, and Id-3 immunoreactivity. Scoring on 
the basis of percentage of positive cells and intensity of immunostaining indicated that 
Id-1 and Id-2 were increased significantly in the cancer cells by comparison with the 
respective controls. Mild to moderate Id immunoreactivity was also seen in the ductal 
cells in the CP-hke areas adjacent to these cells and in the ductal cells of small and 
interlobular ducts in CP. In contrast, in dysplastic and atypical papillary ducts in CP Id-1 
and Id-2 immunoreactivity was as significantly elevated as in the cancer cells These 
findings suggest that increased Id expression may be associated with enhanced 
proliferative potential of pancreatic cancer cells and of proliferating or dysplastic ductal 
ceils in v^r . 



PMID: 10487839 [PubMed - indexed for MEDLINE] 
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ld-1 and ld-2 Are Overexpressed in Pancreatic 
Cancer and in Dysplastic Lesions in 
Chronic Pancreatitis 



Haruhisa Maruyama/ Jorg Kleeff,* Stefan Wildi/ . 
Helmut Friess, f Markus Buchter,* 
Mark A. Israel* and Murray Korc* 

From the Division of Endocrinology, Diabetes, and Metabolism,* 
Departments of Medicine, Biological Chemistry and 
Pharmacology, University of California, Irvine, California; the ' 
Department of Visceral and Transplantation Surgery? University 
of Bern, Bern, Switzerland; and the Preuss laboratory? 
Department of Neurological Surgery, University of California, 
San Francisco, California . 



Id proteins antagonize basic helix-] oop-belix pro- 
teins , inhibit differentiation, and enhance cell prolif- 
eration. In this study we compared the expression of 
Jd-3, Jd-2, ajid ld-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern blot analysis demonstrated that all three Id 
mRNA species were expressed at high levels in pan- 
creatic cancer samples by comparison Tvith normal or 
CP samples. Pancreatic cancer cell lines frequently 
coexpressed all three Ids, exhibiting a good correla- 
tion between Id mRNA and protein levels, as deter- 
mined by immunoblotting with highly specific antMd 
antibodies. Immunohistochemistry using these anti- 
bodies demonstrated the presence of faint Id- J and 
ld-2 immunostaining in pancreatic ductal cells in the 
normal pancreas, whereas Jd-3 immunoreactivity 
ranged from weak to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant ld-3, 
ld-2, and Id-3 immunoreactivity. Scoring on the basis 
of percentage of positive cells and intensity of immu- 
nostaining indicated that IdO and ld-2 ivere increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild to moderate Id inimuno- 
reactivity was also seen in the ductal cells in the 
CP-lUie areas adjacent to these cells and in the ductal 
cells of small and interlobular ducts in CP. In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
ld-1 and ld-2 immnnoreactiviry was as significantly 
elevated as in the cancer ceJJs. These findings suggest 
that increased Jd expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of proliferating or dysplastic ductal cells in 
CP. (Am J Pathol 1^9, 155:815-822) 



Basic helix-loop-helix (bHLH) proteins play an important 
role as transcription factors in cellular development, pro- 
liferation, and differentiation; 1 - 2 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1 - 2 Ho- 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroD, and numerous other proteins, are 
known to activate tissue-specific genes. 3 ; 5 These tissue- 
specific bHLHs typically form heterodirners with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A,£2-2, HEB, arid other genes (also termed 
E-proteins). 6 " 9 These heterodirners activate transcription 
of genes that are associated with differentiation. 

Id genes encode a family of four HLH proteins that lack 
the basic DNA binding domain. 1 - 10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodirners with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
differentiation-associated genes. 1 °~ 12 Id gene expres- 
sion's down-regulated on differentiation in many cell 
types in vitro and in vivo. 13-18 In addition, Id proteins seem 
to be required for cell cycle progression through G,/S 
phase in certain cell types, and interaction between ld-2 
and pRB is associated with enhanced proliferation in 
some cell lines in vitro? 9 ' 23 

Pancreatic cancer is the fifth leading cause of cancer 
death in. the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the overexpressiqn of a variety of myo- 
genic growth factors and their receptors, and by onco- 
genic mutations of K-ras and inactivation of the p53 tumor 
suppressor gene 25 We have recently reported that pan- 
creatic cancers overexpress the HLH protein ld-2, and 
that enhanced expression ol this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether ihe expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP)/ an inflammatory, disease that is characterized by 
dysplastic ducts, foci of proliferating ductal cells, acinar 
cell degeneration, and fibrosis. 27 We now report that 
there is a five- to sixfold increase in ld-1 and ld-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
■ pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 

Patients and Methods 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range 
14-^68 years), CP tissues from 13 males and 1 female 
(median age 42.1 years; range 30-56 years), and pan- 
creatic cancer tissues from 10 male and 6 female donors 
(median age 62.6 years: range 53-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
duodenopancreatectomy (Whipple/pylorus-preserving 
Whipple; n — 13), a left resection of the pancreas (n = 2), 
or a total pancreatectomy (n - 1) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Union Internationale Contre le Cancer 
* (UICC) 6 tumors were stage 1 , 1 was stage 2, and 9 were 
stage 3 ductal cell" adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition, tissue samples were frozen in liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. All studies were ap- 
proved by the Ethics Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine, California. 

Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
. viously. 26 - 28 Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform . 
method. RNA was size-fractionated on 1.2% agarose/1.8 
mol/L formaldehyde gels, electrotransferred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979- bp human ld-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of ld-1, ld-2, and ld-3, respectively. A . 
BamHI 190-bp fragment of mouse 7S cDNA that, hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS films and the resulting 
autoradiography were scanned to quantity the intensity of 
the radiographic bands. 26 20 For each sample the ratio of 
Id mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression of the re- 
spective Id rnRNA species in the cancer and CP sam- 
ples, the same normal samples were used for normal/ 




Normal Cancer 



I II I . 




Figure 1. mRNA expression of Jd-l, )d-2, and ld-3-in panc/eaiic cancer and 
chronic pancreatitis. Total BNA (20 jafi/tene) from six nofmjil. eight cancer- 
ous, and seven chronic pancreatitis tissue samples were, subjected 10 North- 
ern blot analysis using 52 P-labeled cDNA probes (500,000 cpm/ml) specific 
for ld-1. Id-2 t and Jd-3 f respectively. A 7S cDNA probe (50,000 cpm/ml) was 
used as a loading and transfer control. Exposure limes of the normal/cancer 
blots were 1 day for all Id probes, and 2 days for the normal/CP blots. 
Exposure time was 4 hours for mouse 7S cDNA." By comparison with the 
normal samples, id-1 and ld-3 mRNA levels' were elevaied in 8 and 9 cancer 
samples, respectively, whereas ld-2 was elevated in 6 cancer samples. 

cancer and normal/CP membranes. The median score for 
ld-1, ld-2, and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed with 
SigmaStat software (Jandel Scientific, San Raphael, CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level of significance. 

Cell Culture and Western Blot Analysis 

PANC-1. MIA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cell tines were obtained from ATCC (Manas- 
sas, VA). COLO-357 human pancreatic cells were a gift 
from Dr. R. S. Metzger (Durham, NC). Cells were routinely 
grown in DMEM (COLO-357, MIA-PaCa-2. PANC-1) or 
RPMI (ASPC-1. CAPAN-1) supplemented with 10% fetal 
bovine serum, 100 U/ml penicillin, and I00 /tg/ml strep- 
tomycin. For immunoblot analysis, exponentially growing 
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Figure 2. Demiiometric analysis of Northern blots. Autoradiographs of 
Northern blots from 12 normal. J 4 CP, and J 6 pancreatic cancers were 
analyzed by densitometry. mRNA levels were determined by calculating the 
ratio of ibe optical density for. the respective Id niRNA species in relation to 
. ihe optical density of mouse 75 cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in die cancer and CP samples, 
ihe same normal samples were used foi normal/cancer and normal/CP 
membranes. Normal pancreatic tissues are indicated by circles, CP tissues by ' 
triangles, and cancer tissues by squares. I)aia are expressed as median 
scores ± SO. Dy comparison with the normal samples, onh/ the cancer 
samples exhibited significant increases: 6.? fold (P < 0.0 J) for Id-), fivefold 
<P < 0.01.) for ld-2, and twofold ( P = 0.027) for Id 3. 



cells (60-70% confluent) were solubilized in lysis buffer 
containing 50 mmol/I_ Tris-HCl. pH 7.4, 150 mmol/L NaCI; 
1 mmol/L EDTA. 1 ,tg/ml pepstalin A, 1 mmol/L phenyl- 
melhylsullonyl fluoride (PMSP). and 1% Triton X-100. Pro- 
teins were subjected to sodium dodecyl sulfate polyacryl- 




Figure 3. Id mRNA and protein expression in pancreatic cancer cell lines. 
Upper panels:. Total KNA (20 /ig/lanc) from 5 pancreatic cancer cell lines 
were subjected to Northern blot analysis using 3 ?P-labded cONA probes 
(500,000 cpm/ml) specific for ld-1, )d-2, and ld-3 r respectively. Exposure 
limes were J day for all Id probes. Lower panels; Immunoblotiing. Cell 
Jysates (30 pg/lane) were subjected to SDS-PACE. Membranes were probed 
with specific Jd-1, Jct-2, and ld-3 antibodies. Visualization was performed by 
enhanced cbemiluiwinescence. 



amide gel electrophoresis (SDS-PAGE). transferred to 
Immobilon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit IgG. Visualization 
was performed by enhanced chemiluminescence. 



immunohisfochemistry 

Specific rabbit anti-human ld-1 (C-20), Id-? {C-20), and 
ld-3 (C-20: all from Santa Cruz Biotechnology, Santa 



818 . Maruyama et al 

AJP September J999, Vol. 755, No. 3 





Figure 4. Norma) and cancel ous pancreatic tissues were subjected to immu- 
nosointng using highly specific aoii-ld-l (A-C). anti-Id- 2 (D-F) t and antMd-3 
(G-0 antibodies as described in the Methods section. Motleraie lo strong Id- ] 
immunoreactivity tv-js present in ihe cytoplasm of duct- like cancer cells (A 
and C, left panel). Jn the normal pancreas there was weak Id- ) immunore-. 
activity in the ductal cells (B). Pteabsorption with the Id-J-specific blocking 
peptide abolished the ld-1 immunoreaciiviry (C, righl panel). Strong Id- 2 
immunoreactivity W3s observed in the cytoplasm of the cancer cells that 
exhibited duct- 1 ike struaures I'D and F. left panel), whereas in the normal 
pancreiis. there was "only weak Id- 2 immunoi enciivjiy in the ductal cells (E). 
Preabsorption with ihe ld-2-specific blocking peptide abolished the Id- 2 
imimmoreaciiviry (F, tight paneO. Modern e to strong kl-3 immunorenciiviiy 
was present in (he duci-like cancer cells (G and I, left panel). Motleraie to 
strong ld-3 imrmtnor earth* it y was also present in the duct:tl cells of not mat 
pancreatic _ tissue samples (H). ld-3 immunoi eyciiviiy was completely abol- 
ished by pteabsoipiion with the Jd-3 specific blocking peptide ft, right 
paneO- A, D, and G constitute serial sections of a pancreatic cancer sample. 
■ revealing compression of the three Id proteins. Scale bars. 23 fim. 

Cruz, CA) polyclonal antibodies were used lor immunhis- 
lochemistry. These affinity-purified rabbit polyclonal anti- 
bodies specifically react with Id- 1 , ld-2, and ld-3, respec- 
tively, of human oiigin, as determined by Western 
blotting. Paraffin-embedded sections {4 ^m) were sub- 
jected to imrnunostaining using the streptavidin-peroxi- 
dase technique. Where indicated, imrnunostaining tor all 
three Id proteins was performed on seiiat sections. En- 



dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubate** for 15 minutes (23°C) 
with 10% normai goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 
taining 1% bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and streptavidin-peroxidase complex.- 
using diarninobenzidine tetrahydrochloride as the sub- 
strate. Sections were counterstained with Mayer's hema- 
toxylin. Preabsbrptipn with ld-1-, ld-2-, or ld-3-specific 
blocking peptides completely abolished immunoreactiv- 
ity of the respective primary antibody. The immunohisto- 
chemical results were semiquantitative^ analyzed as de- 
scribed previously. 29 ; 30 The percentage of positive 
cancer cells was stratified into four groups: 0, no cancer 
cells exhibiting immunoreactivity; 1, <33% of the cancer 
cells exhibiting, immunoreactivity; 2, 33 to 67% of the 
cancer cells exhibiting immunoreactivity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal, was also stratified into 
four groups: 0, no immunoreaclivily; 1 , weak immunore- 
activity; 2, moderate immunoreactivity; 3. strong immu- 
noreactivity. Finally, the sum of the results or the cell, 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 12 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of the 1.2-kb ld-1 transcript and the 
1.6-kb Id2 mRNAtranscript in 11 of the 12. normal pan- 
creatic samples, and the 1.3-kb ld-3 rnRNA transcript in 
- all normal pancreatic samples (Figure 1A, 2). In the can- 
cer tissues, ld-1 rnRNA levels were elevated in 8 of 16 
samples, ld-2 mRNA levels were elevated in 9 of these 
samples, and ld-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A, 2). Concomitant overexpres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
normal controls (Figure 1B, 2). Densitometry analysis of 
all of the autoradiograms indicated that there was a 6.5- 
fold increase (P < 0.01) in ld-1 mRNA levels, a fivefold 
increase (P < 0.01) in ld-2 mRNA levels, and a twofold 
increase (P = 0.027) in ld-3 mRNA levels in the pancre- 
atic cancer samples in comparison to normal controls 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in the expression levels of ld-1, ld-2, and 
ld-3. in CP tissues in comparison to the corresponding 
levels in the normal pancreas (Figure 2). 

Next, we assessed the expression of the three Id 
genes in 5 human pancreatic cancer cell lines by North- 
ern and Western blot analyses. 'ld-1 mRNA was present 
at varying levels in all 5 cell lines (Figure 3). ASPC-1. 
CAPAN-1. MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels ol Id- 1 mRNA, whereas COLO- 357 cells 
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Figure 5. Jmmunohistoeberoisiry of pancrentk: cancer and dysplastic ducts in CP tissues. Jn the pancreatic cancer Tissues CA-C) there was moderate to strong Jd-1 
(A), id-2 (B), and ld-3 (C) imrnunoreaciiviiy in the ductal cells in the areas adjacent to the oncer cells that exhibited CP-like alterations. Islei cells did not exhibit 
Id imrnunoreaciiviiy (outlined by solid arrowheads). Jn the CP samples, moderate to strong ld-1 (D), Jd-2 (E), and Jd-3 (F) immunoreactiviiy was present in the 
cytoplasm of epithelial cells forming large dysplastic ducts. Scale bar, 25 pm. ■ • 



expressed relatively low levels of this mRNA moiety. 
Western blotting with a highly specific anti-Id- 1 antibody 
confirmed the presence of the approximately 14-kd ld-1 
protein in the 4 cell lines that expressed high levels of 
ld-1 mRNA (Figure 3). Furthermore, the three cell lines 
with the highest ld-1 mRNA expression (CAPAN-1, MIA- 
. PaCa-2, and PANC-1) also exhibited the highest ld-1 
protein expression. Variable levels of the 1.6-kb ld-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
COLO-357 arid MIA-PaCa-2 cells. Immunoblot analysis 
with a highly specific anti-ld-2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell lines with exception of PANC-1 
cells, in which the 16-kd band was relatively faint (Figure 
.3). With the exception ol MIA-PaCa-2 cells, there was a 
good correlation between td-2 mRNA and protein levels 
(Figure 3). ld-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells, 
and at low levels in PANC-1 cells, ld-3 mRNA was not 
detectable in ASPC-1 and CAPAN-i cells (Figure 3). 
Irnmunoblot analysis with a highly specific anti-Id-3 anti- 
body revealed an approximately 14-kd band that was most 
abundant in MtA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a laint 
ld-3 band was seen in ASPC-1 and CAPAN-i cells, thus, 
with the exception of PANC-1 cells, there was a good cor- 
relation between fd-3 rnRNA and protein levels. 



To determine the localization of ld-1, ld-2, and ld-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong ld-1 immunoreaclivity was present in 
the cancer cells in 9 of 10 Randomly selected cancer 
samples. An example of moderate ld-1 immunoreactiviiy 
is shown in Figure 4A, and of strong immunoreactiviiy in 
Figure AC (left panel). In contrast, in the normal pancreas, 
faint ld-1 immunoreactiviiy was present only in the ductal 
cells of pancreatic ducts (Figure 4B, arrowheads): Pre- 
absorption with the ld-1-specific blocking peptide com- 
pletely abolished the ld-1 immunoreactivity (Figure 4C, 
right panel); The cancer cells also exhibited strong ld-2 
(Figure A, D and F. left panel) and moderate to strong ld-3. 
immunoreactiviiy. An example ol moderate ld-3 immuno- 
reactivity is shown in Figure 4G, and of strong immuno- 
reactivity in Figure 41 (left panel). In contrast, only faint 
ld-2 immunoreactivity was present in the ductal cells in 
the normal pancreas (Figure 4E), whereas ld-3 immuno- 
reactivity in these cells was more variable and ranged 
Irom moderate to occasionally strong (Figure 4H). Islet 
cells and acinar cells were always devoid of Id immunore- 
activity. Preabsorption ol the respective antibody with the 
blocking peptides specific lor ld-2 (Figure 4F, right panel) 
and ld-3 (Figure 4 1, right panel) completely abolished im- 
munoreactivity. Analysis ol serial pancreatic cancer sec- 
tions revealed that there was often colocalization of the 
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Figure 6. ImmuriohisiCKhemistiy of atypical papillary epithelium in CP tissues. Serial section analysis of some CP samples revealed ihe presence oflarge duct-like 
structures with atypical papillary epithelium. Mild to moderate JtM (A) and ld-2 (B) immunoreaciivity and weak ld-3 (C) immunoreactivity was present in the 
cytoplasm of (he cells forming these large ducts with papillary structures. Some CP samples also exhibited moderate ld-3 immunoreactiviiy in ihese Cells (D). Scale 
bar, 25 M" 1 - 



The immunohistochemical data foi ld-1, ld-2, and ld-3 
are summarized in Table 1. In the case of ld-1 and ld-2, 
the cancer cells as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast; due to the marked variability in ld-3 immuno- 
staining in the normal pancreas, Ihe differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitute a family of HLH transcription factors 
Ihat are important regulators of cellular differentiation and 
proliferation. 1 2 To date, tour members of Ihe human Id 
family have been identified. 1 Their expression is 
enhanced during cellular proliferation and in response to 
mitogenic slimuli, 19 * 31 and overexpression of Id genes 
inhibits differentiation and/or enhances proliferalion in 
several different cell types. 153?34 The forced expression 
ol ld-1 in mouse small intestinal epithelium results in 



three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A, D, and G. 

ld-1. Id-2, and ld-3 immunoreactivity was also present 
at moderate levels in the cytoplasm of ductal cells within 
CP-like areas adjacent to Ihe cancer cells (Figure 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreactivity. . In 4 of 9 CP 
samples, there were foci of ductal cell dysplasia of rela- 
tively large interlobular ducts, all ol which exhibited mod- 
erate to strong ld-1, ld-2. and ld-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
large ducts exhibiting atypical papillary epithelium. Serial 
section analysis of one of those CP samples revealed 
mild to moderate ld-1 and ld-2 immunoreactivity arid 
weak ld-3 immunoreaciivity in the cells of these atypical 
papillary ducts (Figure 6, A C). In contrast, in some of 
these CP samples, moderate to strong fd-3 immunoreac- 
tivity was also observed (Figure 6D). However, most of 
the ductal cells forming the typical ductular structures of 
CP, such as large interlobular ducts and small proliferat- 
ing ducts, exhibited generally only weak to occasionally 
moderate Id immunoreactivity (data not shown). 



Id Proteins in Pancreatic Diseases 821 
AJP September 1999> Vol. 755, No. 3 



Table 1. Histological Scoring 







ld-1 


td-2 


ld-3 


Normal (n = 6) 
Cancer (n = 10) 
CP (n = 9) 


Ductal cells 
Cancer cells 

Typical CP lesions (n - 9) 
Dysplastic ducts {n = 4) 
Atypical papillary ducts [n = 5). 


2.0 + 0.4 
4.5* ± 0.5 
. 2.7 ± 0.5 
5.3* ± 0.2 
4.4* i 0.2 


2.3 ± 0.2 
5.2 5 ±0.3 

3.1 ±0.6- 
5.8* ± 0.2 
5.2* ± 0.2 


2.5 ± 0.9 
4.5 ± 0.6 
3.4 ± 0.7 
5.3 ± 0.4 
5.0 ±0.4 



Scoring of, the histological specimens was performed as described in the Patients and Methods section. Values are the means ± SD of the number 
di samptes indicated *m parenthesis. P values are based on comparisons with the respective controls in the normal samples. 
\ P < 0.02; *P.< 0.01; *P = 0.004; 5 P « 0.001. 



adenoma formation in these animals. 35 The growth-pro- 
moting effects of Id genes are thought to occur through.' 
several mechanisms. For example; ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20 * 21 and ld-1 and ld-2 
can antagonize the bHLH-rhediated activation of known 
inhibitors of cell cycle progression such as the cyclirv 
dependent kinase inhibitor P21. 23 

In the present study, we determined by Northern blot 
analysis that a significant percentage of human pancre- 
atic cancers expressed increased ld-1, ld-2, and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-fold) and ld-2 (fivefold). In contrast, ld-3 mRNA 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence, of ld-1, ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas faint ld-1 and ld-2 immunoreactivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal ceils. Pancreatic acinar and islet 
cells in the normal pancreas were devoid of ld-1, ld-2. 
and Id^3 immunoreactivity. In the cancer samples, all 
three Id proteins olten colocalized in the cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the hall-life ol either Id mRNA or Id protein 
may be altered by comparison with the other cell lines. 
Interestingly, ld-2 immunoblotting revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 of 5 
cell lines. In view ol the fact that two possible initiation 
codons have been reported for the ld-2 gene, 36 our 
observation raises the possibility that the two ld-2-immu- 
noreactive bands may represent separate translation 
products ol the ld-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosis 
pathways. Thus, these cancers often exhibit increased 
expression ol ant i-apopt otic proteins such as Bcl-2 38 and 
abnormal resistance to Fas-ligand- mediated apopto- 
sis 39 It has been shown recently that forced constitutive 
expression ol Id genes together with the expression of. 
anti-apoptotic genes such as Bcl-2 or BctX L can result in 



malignant transformation of human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
antl-apoptotfc genes may contribute to the malignant 
potential of pancreatic cancer cells in vivo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
normal pancreas. Immunohistochemical analysis of pan- 
creatic cancer samples revealed colocalization of weak 
to moderate ld-1, ld-2, and ld-3 immunoreactivity in pro- 
(iterating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
restricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak Id- 1, ld-2, and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
ducts without dysplastic changes. In general, there was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the cells forming dysplastic 
ducts there was moderate to strong Id immunoreactivity. 
In these CP samples, Id mRNA levels were relatively 
higher than in the CP samples that were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk of 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
general population. 40 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model lor pan- . 
creatic cancer, such as Ihe adenoma-carcinoma se- 
quence of colorectal carcinoma, 41 it is generally ac- 
cepted, that K-ras and pt6 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process. 37 - 41 ~ 43 Increased Id ex- 
pression may contribute to malignant transformation ol 
cultured cell lines in vitro 1 * and has been linked to cell 
invasion in a murine mammary epithelial cell line. 44 In 
view of the current findings that ld-1. ld-2. and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic ducts in CP. these observations raise the 
possibility that elevated levels ol ld-1 . fd-2. and. to a 
lesser extent, ld-3 may represent relatively early markers 
of pancreatic malignant transformation and may contrib- 
ute to the pathobiology of pancreatic cancer. 
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Tightly regulated and inducible expression of a yoked hormone-receptor 
complex m HEK 293 cells, leceproi 

M eehan TP, PuettD, Naravan P 



it was found IhatYHR expression was ttwUvrX^ H . mUedpaSSaSe - 

development of clonal lines of HEK ?of ^11 ■ , t ' ? m "' we descnbe fc 

. fte control of, te^cy uSI "lafed 5T , ^ ^'f 0 " ° f ' S °" der 

tight control of YHR «™«J- 1. Characterization of clonal lines revealed 

receptor and basal cAMP productiofMoTeoverTe rTuZn-^ ° f 

following doxycycline removal revealedTat S^n^ 

rates, again suggesting a strong linkage h^^iT^TJZVe * 
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c-fos and estrogen receptor gene expression pattern in the rat uterine 
epithelium during the estrous cycle. 

Mendoza-Rodripuey, C.A Merchant-Larios ff Segura-Valdez MI. Mnrgmw 
MendozaN, CruzME, Artea ga-Lopez P, Camacho-Arrovo f , Dominguez R C*rh«n 

Facultad de Quimica, Universidad Nacional Autonoma de Mexico, Ciudad Universitaria 
Coyoacan 04510, Mexico, D.F., Mexico. 

Different studies in ovariectomized estrogen treated animals support the idea that c-fos 
Plays a role m the proliferation of uterine epithelial cells. However, these studies invite us 
. to reassess the role played by c-fos in epithelial cell types of the endometrium during the 
estrous cycle: The present study was undertaken to determine the c-fos and estrogen 
receptor (ER) gene expression pattern in the rat uterine epithelium during the estrous 
cycle m which natural and cyclic changes Of steroid hormones occur, and correlate these 
changes W1 th the proliferation status of this cellular types. Proliferation was assessed 
during the estrous cycle using bromodeoxyuridine incorporation to DNA ERalpha and 
beta proteins were assessed by immunohistochemistry. The regulation of c-fos gene 
expression m the uterus of intact animals during the estrous cycle was evaluated using 
both ^in situ hybridization and immunohistochemishy. Estradiol (E(2)) and progesterone 
im» Plasma levels were assessed by radioimmunoassay. The results indicated that 
luminal (LE) and glandular epithelia (GE) presented maximal proliferation during the 
metestrus (M) and the diestrus (D) days. However, during the proestrus (P) day only LE 
presented proliferation, and during the estrus (E) day only the stromal cells proliferated 
A marked immunostatning for ERalpha was detected in both LE and GE cells during the 
early phases of the cycle but diminished on the P and the E day. In contrast, ERbeta was 
undetectable in both epithelia during all stages of the cycle. The highest c-fos mRNA 
level was detected in both epithelia on the M day, followed by a significant reduction 
during the other days of the cycle. The highest protein content was observed on the M 
and D days, and the minimal value was detected on the E day. The c-Fos protein level in 
LE was increased during M and D days, presenting a high correlation with the cellular 
proliferation pattern of this cell type. In conclusion, the overall results indicate that c-Fos 
protein presented a good correlation with uterine epithelial cell proliferation of LE In the 
case of GE, the same tendency was observed, although no significant correlation was ' 
found. Both in LE and GE, c-fos mRNA did not strictly correlate with its protein levels 
c-tos seems to have a postranscriptional regulation in uterine epithelial cells during the 
rats estrous cycle. Copyright 2003 Wiley-Liss, Inc. 

PMID: 12589649 [PubMed - indexed for MEDLINE] 
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[3H]MK-801 binding and the mRNA for the NMDARl subunit of the 
NMDA receptor arc differentially distributed in human and rat forebrain. 

Meooi P . Mugnaini Bunnemanu BH. Trist DG . Bowery NG . 

Department of Pharmacology, Medical School, University of Birmingham, UK. 
meohip@novel 1 5. bham.ac.uk 

The distributions of [3H]MK-801 binding and the NMDA NR1 subuait mRNA were 
studied using receptor autoradiography and in-situ hybridization in rat and human brain 
whole-hemisphere coronal sections. Receptor protein detected by radioligand 
autoradiography and the mRNA for the key subunit of the receptor presented similar 
distributions in the forebrain, with a few areas showing an imbalance between the levels 
of mRNA and receptor protein. Human frontal cortex showed a relative abundance of 
NMDARl mRNA as compared to [3H]MK-801 binding. The same area in rat brain did 
not show any difference in the two distributions. In comparison, the rat claustrum 
presented a relative excess ofNMDAJRJ mRNA which was not detected in human 
sections. Human caudate nucleus exhibited relatively high levels of [3H]MK-801 binding 
that were unmatched in rat caudate. The hippocampi of either species presented similar 
levels of [3H]MK~801 binding and NMDARl mRNA, but when the two signals were 
measured in specific subfields of the hippocampal formation, the differential distribution 
of the two signals reflected the anatomy of hippocampal connections assuming a 
preferential dendritic distribution for MK-801 binding. Interestingly, rat and human 
hippocampi also showed some important species-dependent difference in the relative 
distribution of the receptor protein and mRNA. The data presented show an overall good 
correlation between the mRNA for the key subunit of the NMDA receptor and the 
functional receptor detected with radioligand binding and highlight the presence of local . 
differences in their ratio. This may reflect different splicing of the mRNA for the 
NMDARI subunit in specific brain areas of rat and human. The species-dependent 
differences in the relative distribution of the mRNA for the key subunit of the NMDA 
receptor and that of a marker of functional receptors also highlights important differences 
in the NMDA function in rat and human brain. 
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Abstract 

Translation initiation is regulated in response to 
nutrient availability and myogenic stimulation and is 
coupled with cell cycle progression and cell growth. 
Several alterations in translational control occur in 
cancer* Variant mRNA sequences can alter the 
translational efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes in* 
the expression or availability of components of the 
translational machinery and in the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase in 
the overall rate of protein synthesis and translational 
- activation of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles of translational control, the alterations 
encountered in cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. - 

Introduction 

The fundamental principle of molecular therapeutics in can- 
cer is to exploit the differences in gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translational regulation has been best recog- 
nized among developmental biologists^ because transcription 
does not occur in early embryogenesis in eukaryoles. For ex- 
ample, in Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches midbfastula transition, the 
4000-cell stage. Therefore, all necessary mRNA molecules are 
transcribed dxaing oogenesis and stockpiled in a transJationally 
inactive, masked form. The mRNA are translationaliy activated 
at appropriate times during oocyte maturation, fertilization, and 



early embryogenesis and thus, are under strict translational 
control. 

Translation has an established role in cell growth. Basi- 
cally, an increase in protein synthesis occurs as a . conse- 
quence of mitogenesis. Until recently, however, little was 
known about the alterations In mRNA translation in cancer, 
and much is yet to be discovered about their role in the 
development and progression of cancer. Here we review the 
basic principles of translational control, the alterations en- 
countered in cancer, and selected therapies targeting transta- . 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translational Control 
Mechanism of Translation Initiation 
Translation nrtittation is the main step in translational regulation. 
Translation britiation is a complex process in which the initiator 
tRNA and the AOS and 60S ribosornal subunits are recruited to 
the 5' end of a mRNA motecuJe and assembled by eukaryotic 
translation initiation factors into an 805 ribosome at the start 
codon of the mRNA (Fig. 1). The 5' end of eukaryotic mRNA is 
capped, i.e., contains the cap structure m 7 GpppN (7-metbyt- 
o^anosine-triphospho-S'HtoiucleosW Most translation in 
eukaryoles occurs in a cap-dependent fashion, i.e., the cap is 
specifically recognized by the eIF4E. 3 which binds the 5' cap. 
. The elF4F translation initiation complex is then formed by the 
assembly of elF4E, the RNA heRcase elF4A, and eIF4G, a 
scaffolding protein that mediates the binding ot the 40S ribo- 
sornal subunit to the mRNA molecule through interaction with 
the eJF3 protein present on the 40S ribosome. elF4A and elF4B 
participate in melting the secondary structure of the 5' UTR of 
the mRNA. The 43S initiation complex (40S/elF2/Met-tRNA/ 
GTP complex) scans the mRNA in a 5'->3' direction until it 
encounters an AUG start codon. This start codon is then base- 
paired to the anticodort of initiator tRNA, forming the 48S initi- 
ation complex The initiation factors are then displaced from the 
48S complex, and the 60S. ribosome joins to form the 80S 
ribosome. 

Unlike most eukaryotic translation, translation initiation of 
certain mRNAs, such as the picomavtrus RNA, is cap inde-' 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require e!F4E. Ether the 43S complex can bind 
the initiation codon directly through interaction with the IRES En 
the 5' UTR such as in the enceptolomycca/difc virus, or it can 



Received V16/02; revised 7/1270?; accepted 7/22/02. 

1 F. M. is supported by The University ol Texas M. O Anderson Cancer 

Center Physician-Scientist Program and by N1H Grant 1K08-CA 97895- 

01. K. K. H. is supported by Department ol Defense Award DAMD- 17-97- 

1-7t62. 

7 To whom requests lor sprints should be addressed, at Department ol 
Surgical Oncology. Be* 444, The University ol Texas M. D. Anderson 
Cancer Center, 1515 Hotcombe Boulevard. Houston. TX 77030. Phone: 
(713) 745-1453; Fax: (713) 745-4926; E-mail: tmericermJanderson.org. 



3 The abbreviations used are: etF4f£, eukaryolic into tion factor 4£; UTR, 
untranslated region; IRES, interna* rfcosome entry site; 4E-BP1. eukaryotic 
initiation teeter 4E-bmding protein 1; S6K. ribosornal p70 S6 Kinase, mTOR 
mammalian target ol rapamycm; AIM, ataxia tetengrectaswi mutated; PBK, 
phosphatWytinosilol 3- kinase; PTEN. phosphatase and tensin homolpg de- 
leted horn ctvomosome 10. PP2A. protein phosphatase 2A; TGT- 03, bans- 
torming growth tactor-03; PAP. por/(A) polymerase; EPA. eicosopentaencic 
acid; mda-7. metanoma cWtefetitialioo- associated gene 7. 



•^AGE 3/11 ■ RCVDAT 5/75/2006 12:39:38 PNI [Pacific 



Dayhflhl Time] > SVR:SVCS01/0 ■ DN1S:6606 » CSID:613 957 9303 - DUEAT,n» 



tmm-ss):09-3Qp 



MAY 25 2006 15:39 FR CI ICIST 

72 Transbttoo Initia&m in Cancer 



613 952 9303 TO. 03246606 




BimBng of cDF4F complex to 
mRNA 5* cap 




Unwinding of secondary structure 
and formation of 48S 




Scanning, 

release of initiation factors, 
80S formation 



AAAAAA 



Fig. 1. Translation initiation in euxaryotes. The 4E-CPs are hyperphos- 
phorylated to release elF4E so that a can interact with the 5' cap. and the 
elF4F initiation complex is assembled. The interaction of poty(A) binding 
protein with the initiation complex and cuculanzatton ol the mRNA is not 
depicted in the diagram. The secondary structure ol the 5' UTR is melted 
the 4 OS ribosomat subonil is bound to elF3„ and the ternary complex 
consisting of eJF2. GTP. and the Met-tRNA are recruited to the mRNA. The 
ribosome scans the mRNA in a 5'~>3' direction until an AUG start codon 
is found in the appropriate sequence context The initiation factors are 
released, and the large ribosomal subunit h recruited 



initially attach to the IRES and then reach the initiation codon by 
scanning or transfer, as is the case with the poHovirus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations in the 
expression or phosphorylation status of the various factors 
involved. Key components in translational regulation that 
may provide potential therapeutic targets follow. 

elF4EL elF4E plays a central rote in translation regulation. 
It is the least abundant of the initiation factors and is con- 
sidered the rate-limiting component for initiation of cap- 
dependent translation. elF4E may also be involved in mRNA 
splicing, mRNA 3' 'processing, and mRNA nucteocytoplas- 
rriic transport (2). elF4E expression can be increased at the 
transcriptional level in response to serum or growth factors 
(3). elF4E overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



lations! machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpression of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth tactor-2, and cycRn D1 (2, 8, 9). 

Another mechanism of control is the regulation of elF4E 
phosphorylation. e)F4E phosphorylation is mediated by the 
mHogervactivated protein kinase- interacting kinase 1, which 
is activated by the mitogen-activaled pathway, activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 mitogen-acttvated protein ki- 
nase (10-13). Several mitogens, such as serum, platefet- 
derived growth factor, epidermal growth factor, insulin, 
angiotensin II, sre kinase overexpression, and ras over- 
expression, lead to elF4E phosrjhorylation (14). The phos- 
phorylation status of elF4E is usually correlated with the 
translation^ rate and growth status of the cell; however, 
elF4E phosphorylation has also been observed In response 
to some cellular stresses when translational rates actually 
decrease (15). Thus, further study Is needed to understand 
the effects of eIF4E phosphorylation on e!F4E activity. 

Another mechanism of regulation is the alteration of elF4E 
availability by the binding of elF4E to the eiF4E-binding pro- 
teins (4E-BP, also known as PHAS-I). 4E-BPs compete with 
elF4G for a.binding site in elF4E. The bfoding of elF4E to the 
best characterized elF4E-b!nding protein, 4E-BP1, is reguv 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 hyperphosphorylation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
I and II, interleukin 3, granulocyte-macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to. decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of .nutrients or growth factors result*. . 
in 4E-BP1 dephosphorylation, an increase in elF4E binding, 
and a decrease in cap-dependent translation. 

p70 S6 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K is thought to play an important role in translational 
regulation. S6K -/- mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on ceH proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
oRgopyrimidine tract (5' TOP) found at the 5' UTR of nlxosornal 
proteh mRNAs and other mRNAs coding for components of 
the translational machinery. Phosphorylation of S6K is regu- 
lated in part based on the availability of nutrients (18, 19) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and insulin-like growth factor I (20). 

elF2a Phosphorylation. The binding of the initiator tRNA 
to the small ribosomal unit is mediated by translation initia- 
tion factor elF2. Phosphorylation of the cr-subunil of elF2 
prevents formation of the elF2/GTP/Met-t RNA complex and 
inhibits global protein synthesis (21, 22). elF2o is phospho- 
rylated under a variety of conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
etF2a is phosphcrytated by heme-regutated inhibitor, nutnent- 
regulated protein kinase, and the IFN-hxfuced. doubfe- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 



MAY 25 2006 15:39 FR CI 



1CIST 



613 952 9303 TO- )03246606 P. 05 

Mqtecubr Cancer Therapeutics 973 



The mTOR Signaling Pathway. The macrolide antibiotic 
rapamycin (SiraBmus; Wyeth-Ayerst Research, Coflegevirte, 
PA) has been the subject of intensive study because it in- 
hibits signal transduction pathways involved in T-ceB activa- 
tion. The rapamycfo-sensitrve component of these pathways 
is mTOR (also called FRAP or RAFH). mTOR is the mam- 
malian homologue of Ihe yeast TOR proteins that regulate O, 
progression and translation in response to nutrient available 
ity (24). mTOR is a serine-threonine kinase that modulates 
. translation initiation by altering the phosphorylation status of 
4E : BP1 and S6K (Fig. 2; Ref. 25). 

4E-8P1 Is pbosphorylated on multiple residues. mTOR pbos- 
phorylates the Thr-37 and Thr-46 residues of 4E-BP1 m vitro 
(265; however, phosphorylation at these sites is not associated 
with a loss of eJF4E binding. F^osphorytetion of Thr-37 and 
* Thr-46 is required for subsequent phosphorylation at several 
CXX)B-ten™nai, serum-sensitive sites; a combination of these 
phosphorylation events appears to be. needed to inhibit the 
binding of 4E-BP1 to eIF4E £5)- "Wte product of the>»7M gene, 
p3a/MSK1 pathway; and protein, kinase Co also play a rote in 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein kinase Akt. PTEN is a phosphatase that 
negatively regulates PI3K signaling. PTEN nu0 cells have 
constitutive^ active of Akt, with increased S6K activity and 
S6 phosphorylation (30). S6K activity is inhibited both by 
PI3K inhalers wortmannin and LY294002 and by mTOR 
inhibitor rapamycin (24). Akt phosphorylates Ser-2448 in 
mTOR in vitro, and this site is phosphorylated upon Akt 
activation in vivo (31-33). Thus. mTOR is regulated by the 
PI3K/AW pathway; however, this does not appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR ^©phosphorylation is blocked by wort- 
mannin but not by rapamycin (34). This seeming inconsistency 
suggests that mTOR-responsive regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than intrinsic mTOR 
kinase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibition of a phospha- 
tase. Treatment with catycufin A, an inhibitor of phosphatases 1 
and 2A, reduces raparrrycin-induced dephosphorylation of 4E- 
BP1 and S6K by rapamycin (35). PP2A interacts with fuB-length 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting from rapamycin. mTOR phosphorylates PP2A 
in wtro; however, how this process alters PP2A activity fe not 
known. These results are consistent with the model that phos- 
- phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rivation and rapamycin block inhibition of the phosphatase bv 
mTOR Y 
Polyaderrylation. The pory(A) ta3 in eukaryotfc mRNA is 
important in enhancing translation initiation and mRNA sta- 
bility. Pofyadenylation plays a key role in regulating gene 
expression during oogenesis and early embryogenesis. 
Some mRNA that are translationalry inactive in the oocyte are 
poryadenylated concomitantly with translational activation In 
oocyte maturation, whereas other mRNAs that are transit 
tionalry active during oogenesis are deadenylated and trans- 
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lationaHy silenced (36-38). Thus, control of poty(A) tail syn- 
thesis is an important regulatory step in gene expression. 
The 5' cap and pory(A) tail are thought to function synergis- 
tically to regulate mRNA translations! efficiency (39, 40). 

RNA Packaging. Most RNA-binding proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translational fate of the transcript (41). A highly 
conserved family of Y-box proteins is found in cytoplasmic 
messenger ribonucleoprotein particles, where the proteins 
are thought to play a role in restricting the recruitment of 
mRNA to the translational machinery (41-43). The major 
mRNA-associated protein, YB-1, destabilizes the interaction 
of elF4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results tn translational repression in vivo (44). Thus, 
alterations in RNA packaging can ateo play an important role 
in translational regulation. 

Translation Alterations Encountered in Cancer 
Three main alterations at the translational level occur in cancer 
variations in mRNA sequences that increase or decrease trans : 
iattanal efficiency, changes in the expression or availability of 
components of the translational machinery, and activation of 
translation through abenantty activated signal transduction 
pathways. The first alteration affects the translation of an indi- 
vidual mRNA that may play a role in carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an Increase in the overall rate of protein synthesis, and the 
translational activation of several mRNA species. 

Variations in mRNA Sequence 

Variations in mRNA sequence affect the translational effi- 
ciency of the transcript- A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especially 
in the 5' UTR, can alter its rransfational efficiency, as seen in 
the following examples. ' 
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c-myc Sa'rto et at. proposed that translation of fun-length 
c-myc is repressed, whereas in several BurWtt lymphomas 
that have deletions of the mRMA 5' UTR„ translation of c-myc 
is more efficient (45). More recently, it was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be initiated by a cap- independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C->T mutation in the c-myc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal ribosomal entry (49). 

BRCAh A somatic point mutation (117 G-»C) in position 
-3 with respect to the start codon of the BRCA1 gene was 
identified in a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream luciferase reporter dem- 
onstrated a decrease in the transitional efficiency with the 5' 
UTR mutation. 

. CycSn-dependent Kinase Inhibitor 2X Some inherited 
melanoma kindreds have a G-*T transversion at base -34 
of cycfin-dependent kinase inhibrtor-2A, which encodes a . 
cyclin-dependent kinase 4/cyclin-dependent kinase 6 kinase 
inhibitor important in G, checkpoint regulation (51). This 
mutation gives rise to a novel AUG translation initiation 
- codon, creating an upstream open reading frame that com- 
petes for scanning ribosornes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations in splicing and alternate transcripfon sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately impacting translation^ efficiency. 

ATM. The ATM gene has four noncoding exons in its 5' 
UTR that undergo extensive alternative splicing (52). The 
contents of. 12 different 5' UTRs that show considerable 
diversity in length and sequence have been identified. These 
divergent 5' leader sequences play an important role in the 
translational regulation of the ATM gene. 

mdm. In a subset of tumors, overexpression of the onco- 
protein mdm2 results in enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only in their 5' leaders (53). The longer 5' 
UTR contains two upstream open reading frames, and this 
mRNA is loaded with ribosornes inefficiently compared with 
the short 5' UTR. 

BRGA1. tn a normal mammary gland, BRCA1 mRNA is 
expressed with a shorter leader sequence (5'UTRa), whereas 
. in sporadic breast cancer tissue, BRCA1 mRNA is expressed 
with a longer leader sequence {5' UTRb); the translational 
efficiency of transcripts containing 5' UTRb is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-fJ3> TGF-03 mRNA includes a 1.1 -Mb 5' UTR, which 
exerts an inhibitory effect on translation. Many human breast 
cancer cell lines contain a novel 7GF-03 transcript with a 5' 
LRR that is 870 nucleotides shorter and has a 7-fold greater 
translational efficiency than, the normal TGF-03 mRNA (55). 

Alternate Poryadenylation Sites. Multiple poryadenyl- 
ation signals leading to the generation of several transcripts 
w.th differing 3' UTR have been described for several mRNA . 
species, such as the RET proto-oncogene (56), ATM gene 
(52), tissue inhibitor of metaJloproteinases-3 (57). RHOA 



proto-oncogene (58), and calmodulin- 1 (59). Although Ihe 
effect of these alternate 3' UTRs on translation is not yet 
known, they may be important in RNA-protein interactions 
that affect translational recruitment. The role of these alter- 
ations in cancer development and progression is unknown. 



Alterations in ihe Components of the 
Translation Machinery 

Alterations in the components of translation machinery can 
take many forms. 

Overexpresssrpn of elF4E. Overexpression of.elF4E 
causes malignant transformation in rodent cells (60) and the 
deregulation of HeLa cell growth (61). Polunovsky ef aA (62) 
found that elF4E overexpression substitutes for serum and 
Individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival, signaling. 

Elevated levels of elF4E mRNA have been found in a broad ' 
spectrum of transformed cell lines (63). elF4E levels are 
elevated in all ductal carcinoma in situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
irhens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that elF4E overex- 
pression can be valuable as a prognostic marker. elF4E over- 
expression was found in a retrospective study to be a marker of 
poor prognosis in stages I to HI breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer is now 
under way h a prospective trial (67). However, in a different 
study, elF4E expression was correlated with the aggressive 
behavior of non-Hodgkin's lymphomas (68). In a prospective 
analysis of patients with head and neck cancer, elevated levels 
of elF4E in histologically tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results aH suggest that'eIF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that elF4E overexpression is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations in a 
number of other initiation factors have been associated with 
cancer. Overproduction of elF4G, similar to elF4E, leads to 
malignant transformation in vitro (69). e!F-2a is found in 
increased levels in bronchioloatveolar carcinomas of the lung 
(3). Initiation factor elF-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunit of 
translation initiation factor 3 is amplified and overexpressed 
in breast and prostate cancer (72), and the elF3-p1 10 subunit 
fe overexpressed in testicular seminoma (73). The role that 
overexpression of these initiation factors plays on the devel- 
opment and progression of cancer, if any, is not known. 

Overexpression of S6K. S6K is amplified and highly 
overexpressed in the MCF7 breast cancer cell line, com- 
pared with normal mammary epithelium (74). In a study by 
Barlund ef a/. (74), S6K was amplified in 5£> of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' pofy(A). syn- 
thesis. PAP is overexpressed in human cancer ceKs com- 
pared with normal and viralry transformed cells (75). PAP 
enzymatic activity in breast tumors has been correlated with 
PAP protein levels (76) and, in mammary tumor cytosote, was 
found to be an independent factor (or predicting survival (76). 
Little is known, however, about how PAP expression or ac r 
tjvity affects the translations! profile. 

Alterations in RN A- binding Proteins. Even less is known 
about alterations in RNA packaging in cancer. Increased ex- 
pression and nuclear localization of the RNA-binding protein 
YB-1 are indicators of a poor prognosis for breast cancer (77), 
non-small cefl lung cancer (78), and ovarian cancer (79). How- 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases chemoresistaoce by en- 
hancing the transcription of a multidrug resistance gene (80% 



Activation of Signal transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpression of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
important mutant in human cancers is the tumor suppressor 
gene PTEN, which leads to the activation of the Pt3K/Akt path- 
way. Activation of PI3K and Akt induces the oncogenic trans- . 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of (81). 
. A mutant Akt that retains kinase activity but does not phos- 
phorylate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of PI3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin-like growth factor, HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likely to contribute to the growth and.aggres- 
siveness of these rumors. Furthermore, the mRNA for many 
of these kinases themselves are under translational control. 
For example, HER2/neu mRNA is translationally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation in a cell type- independent manner and 
by a distinct cell type-dependent mechanism that increases . 
translational efficiency (82). HER2/neu translation is different 
in transformed and normal cells. Thus, it is possible that 
alterations at the translational level can. in part account for 
the discrepancy between HER2/neu gene amplification de- 
tected by fluorescence in situ hybridization and protein levels 
detected by immunohistochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways involved in the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
ond Tumstotin 

Rapamycin inhibits the proliferation of lymphocytes. It was 
initiaJry developed as an immunosuppressive drug for organ 



transplantation. Rapamycin with FKBP 12 (FK506- binding 
protein, M r 12,000) binds to mTOR to inhibit its function. 

Rapamycin causes a small but significant reduction in the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S6 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and inhibits cap-dependent 
but not cap-independent translation (17, 86). 

The raparrrycin-sensrtrve signal transduction pathway, acti- 
vated during malignant transformation and cancer progression, 
is how being studied as a target for cancer therapy (87). Pros- 
tate, breast, small cell lung, gfoblastoma, melanoma, and T-cefl 
leukemia are among the cancer lines most sensitive to the 
rapamycin analogue CO-779 (Wyelh-Ayerst Research; Ret 
87). In rhabdomycc«anx)rna cell fines, rapamycin is either cyto- 
static or cytocidai, depending on the p53 status of the cell; p53 
wild-type cells treated with rapamycin arrest in the G, phase 
and maintain their viability, whereas p53 mutant cells accumu- 
late in G, and undergo apoptosis (88, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
rr*edijBoblastoma models, rapamycin exhibited more cytotox- 
icity in combination with cisplatin and camptothecin than as a 
single agent In vivo. CC 1-779 delayed growth of xenografts by 
1 60% after 1 week of therapy and 240% after 2 weeks. A single 
high-dose administration caused a 37% decrease in tumor 
volume. Growth inhibition in vivo was 1.3 times greater, with 
cisplatin in combination with CCI-779 than with cisplatin alone 

(90) . Thus, preclinical stucSes suggest that rapamycin ana- 
logues are useful as single agents and in combination with 
chemotherapy. 

Rapamycin analogues CCt-779 and RAD001 (Novartis. 
Basel, Switzerland) are now in clinical trials. Because of the 
known effect of rapamycin on lymphocyte prolileration, a 
potential problem with rapamycin analogues is immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycin and CCI-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycin analogues resolve in —24 h after therapy 

(91) . The principal toxicities of CCI-779 have included der- 
matological toxicity, myelosuppression, infection, mucositis, 
diarrhea, reversible elevations in Ever function tests, hyper : 
gtycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCI-779 have been conducted in 
advanced renal cell carcinoma and in stage II l/TV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94, 95). 
Thus, CCI-779 has documented preliminary clinical activity in 
a previously treated, unsetected patient population. 

Active investigation is under way into patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 in PTEN-null tumors (30. 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCI-779, although only two of these lines 
lacKed PTEN (97) There was. however, a positive correlation 
between Akt activation and CCI-779 sensitivity (97). This 
correlation suggests that activation of the PI3K-AM pathway. 
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regardless of whether it is attributable to a PTEN mutation or 
to overexpression of receptor tyrosine kinases, makes can- 
cer ceil amenable to mTOf> directed therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycin resistance; thus, a lower 4E-BP1/elF4E ratio 
may predict rapamycin resistance (98). 

Another mode of activity for rapamycin and its analogues 
appears to be through inhibition of angiogenesis. This activ- 
ity may be both through direct inhibition of endothelial cell 
proliferation as a resuft of mTOR inhibition in these cells or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor in tumor cells {99, 1 00). . 

The angiogenesis inhibitor tumstatin, another anticancer, 
drug currently under study, was ateo found recently to inhibit 
translation in endothelial cells (101). Through a requisite in- 
teraction with integrin, tumstatrn inhibits activation of the 
PI3K/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-signafing pathway. 



ment also reduces the expression of angiogenic factors (115) 
and has been proposed as a potential ao^uvart therapy for head 
and neck cancers, particuJarty when elevated etF4E is found in 
surgical margins. Sma0 molecule inhibitors that bind the e!RG/ 
4E-BP1-binding domain of e!F4E are proapoptoiic (116) and 
are also being actively pursued. 

Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5' UTR. These mRNA would thus be at a competitive 
disadvantage in normal cells and not translate well, whereas in 
cancer celts, they would translate more efficiently. For example, 
the introduction of the 5' UTR of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex virus type-1 thymidine 
kinase gene, allows for selective translation of herpes simplex 
vims type-1 thymidine kinase gene in breast cancer cell fines 
compared with normal mammary cefl lines and results in se- 
lective sensitivity to ganciclovir (117). 



Targeting elF2cc EPA, Clotrimazole, mda-7, 
and Flavonoids 

EPA is an n-3 polyunsaturated fatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as in animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * Irom. 
intracellular stores while inhibiting their refilling, thereby ac- 
tivating PKR. PKR, in turn pbosphorylates and inhibits elF2o, 
resulting in the inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent in vitro and in wvo, inhibits cell growth through 
depletion of Ca 2 * stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tial decreases the expression of cyclins A, E, and D1, 
resulting in blockage of the cell cycle in 6 V 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 f\d- 
mda7) induces apoptosis in many cancer cells including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
induces and activates PKR, which leads to phosphorylation 
of elF2a and induction of apoptosis (110). 

Flavonoids such as genistein and quercetrn suppress tu- 
mor cell growth. All three rnammalian elF2a kinases, PKR, 
heme-regulated inhibitor, and PERK/PEK. are activated by 
flavonoids, with phosphorylation of e!F2o and inhibition of 
protein synthesis (111). 

Targeting elF4A and e)F4E: Antisense SNA 
and Peptides 

Antisense expression of elF4A decreases the proliferation rate 
ot melanoma ceBs (1 12). Sequestration of eIF4E by overexpres- 
sion ot 4E-BP1 is proapoptotic and decreases tumorigenicrty 
(113, 114). Reduction of elF4E with antisense RNA decreases 
soft agar growth, increases tumor latency, and increases the 
rates of tumor doubling times (7). Antisense elF4E RNA treat- 
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Toward the Future 

Translation is a crucial process in every ceil. However, several 
alterations in translations! control occur in cancer. Cancer cells 
appear to need an aberrantly activated transiationaf state for 
survival, thus allowing the targeting of translation initiation with 
surprisingly low toxicity. Components of the translations! ma- 
chinery, such as elF4E, and signal transduction pathways in- 
volved in translation initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials. It is possible 
that with the development of better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most likely to achieve clinical utility in 
combination therapy. In the interim, pur increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 
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Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in 
human membranous nephropathy. 

Mczzano SA, Droguett MA, Burgos ME , Ardiles LG. Aros CA, jfWs[f f^jnjf 

Division of Nephrology, School of Medicine, Universidad Austral, Vaidivia, Chile. 
sme22ano@uach.cl 

Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in human 
membranous nephropathy. BACKGROUND: Proteinuria plays a central role in the 
progression of glomerular disease, and there is growing evidence suggesting that it may 
determine tubular cell activation with release of chemokines and fibrogenic factors, 
leading to interstitial inflammatory reaction. However, most studies on this subjecthave 
been performed in experimental models, and the experience in human kidney biopsies 
has been scarce. We analyzed the tissue sections of patients with idiopathic membranous 
nephropathy (IMN), a noninflammatory glomerular disease that may follow a progressive 
disease with heavy persistent proteinuria, interstitial cell infiltration, and decline of renal 
function. METHODS: Paraffin-embedded biopsy specimens from 25 patients with IMN 
(13 progressive and 12 nonprogressive) were retrospectively studied by 
immunohistochemistry [monocyte chemoattractant protein- 1 (MCP-1), regulated on 
activation normal T-cell expressed and secreted chemokine (RANTES), osteopontin 
(OPN)> platelet-derived growth factor-BB (PD-GF-BB)] and in situ hybridization [MCP- 
I , RANTES, PDGF-BB, transforming growth factor-betal (TGF-beta 1 )]. Moreover, we 
studied the presence of myofibroblasts, which were identified by the expression of alpha- 
smooth muscle actin (alpha-SMA), the monocytes/macrophages (CD68-positive cells), 
and T-cell infiltration (CD4+ and CD8+ cells). All of the patients were nephrotic and * 
without treatment at time of the biopsy. RESULTS: A strong up-regulation of MCP-1, 
RANTES, and OPN expression was observed, mainly in tubular epithelial cells, with a 
significant major intensity in the progressive IMN patients. A strong correlation between 
the mRNA expression and the corresponding protein was noted. The presence of these 
chemokines and OPN was associated with interstitial cell infiltration. TGF-beta and 
PDGF were also up-regulated, mainly in tubular epithelial cells, with a stronger 
expression in the progressive IMN, and an association with the presence of 
myofibroblasts was found. CONCLUSIONS: Patients with severe proteinuria and 
progressive IMN have an overexpression in tubular epithelial cells of the qhemokines 
MCP- 1 , RANTES, and OPN and the profibrogenic cytokines PDGF-BB and TGF-beta. 
Because this up-regulation was associated with an interstitial accumulation of 
mononuclear cells and an increase in myofibroblastic activity, it is suggested that those 
mediators are potential predictors of progression in IMN. Finally, based on experimental 
data and the findings of this article, we speculate that severe proteinuria is the main factor 
responsible for the up-regulation of these factors in tubular epithelial cells. 



Decreased uncoupling protein expression and intrarayocytic triglyceride 
depletion m formerly obese subjects. 
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r°rnPi CTIVE: T ° ^ amine the muscular uncoupling protein expression 2 (UCP2) and 

gene expresston in morbid obese subjects before and after bariatric surgery fbilio- 
pancreatic diversion (BPD)]. RESEARCH" METHODS AND PROCEDUREl^even 

BPD. Skeletal muscle UCP2 and UCP3 mRNA was measured using reverse 
fransenptase-competitive polymerase chain reaction and UCP3 protein by Western 

rnrl^T^ 0 ^f 1 ^ 65 were by high-performance liquid 

chromatography. Twenty-four-hour energy expenditure and respiratory quotient ( RO) 
were measured in a respiratory chamber. RESULTS: After BPD, the SJage wlS? oss 

Tam™ l n*? y , 3 fn°- NOnpr0te, ' n RQ WaS inCreased in tne P-^ese subSj 73 
1 0 t n It' n?/' P r ° ° 01) - ^ ^-yocytic triglyceride level dropM (3 66 

UCP2 and UCP2 mR'NA^ 00 "I ° f ^* ^ P < ° 000l > ^ BPD ' Expression of 
UCF 2 and UCP3 mRNA was significantly reduced (from 35.9 +/- 6.1% to 18 6 +/- 4 5% 

of cyclopham, p = 0.02; from 60.2 +/- I4.0H.to 33.4*- 8.5*. p = 0.03; re peodvely/ 
UCP3 p r o to„ content was also significantly reduced (272.19 +/- 84.13 vs. H5.78 -J- 

l^t-'annT 1 fu regression anaIysis = 0 - 90 > showed «•* IMTG 

"p * " r J, ^ eSented 1,16 most P owerfUI independent variable for predicting 
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Differential expression of the short and long forms of the gamma 2 subunit 
of the GABAA/beuzodiazepme receptors. 

MirallesCP , Gutierrez A , Khan ZCJ . Vttorica J , De Bias AL . 

Division of Molecular Biology and Biochemistry, School of Biological Sciences, 
University of Missouri-Kansas City 64110-2499. ' 

The distribution of the mRNAs encoding the gamma 2S and gamma 2L subunits of the 
GABAA receptor in the rat brain has been revealed by in situ hybridization, northern blot 
and dot blot analysis using specific antisense oligonucleotides. In addition, the 
quantitative distribution of the gamma 2S and gamma 2L subunit peptides participating in 
the fully assembled GABAA recepto^enzodiazepine receptors has been mapped by 
immunoprecipitation with specific anti-gamma 2S and anti-gamma 2L antibodies. 
Several neuronal types and brain regions ace enriched in gamma 2L such as neurons of 
the layer II of striate cortex and cerebellar Purkinje cells as well as the inferior colliculus, - 
superior colliculus, deep cerebellar nuclei, medulla and pons. Other neuronal types and 
regions are enriched in gamma 2S such as the mitral cells of the olfactory bulb, pyramidal 
neurons of the pyriform cortex, layer VI of the neocortex, granule cells of the dentate 
gyrus and pyramidal cells of the hippocampus. Other cortical areas and cerebellar granule 
cells express both gamma 2S and gamma 2L in comparable amounts: There is a good 
correlation between the relative expression of gamma 2S and gamma 2L mRNAs and the 
relative presence of these protein subunits in fully assembled and mature receptors in the 
studied brain regions. The differential distribution of gamma 2S and gamma 2L might 
result in differential ethanol sensitivity of the neurons expressing these GABAA receptor 
subunits. 
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Follicle-Stimulating Hormone Receptor and Its Messenger Ribonucleic Acid Are 
Present in the Bovine Cervix and Can Regulate Cervical Prostanoid Synthesis 1 

D. Mizrachi and M. Shemesh 2 

Department of Hormone Research, Kimron Veterinary Institute, Bet Dagan, Israel 50250 



ABSTRACT 

The hypothesis that FSH regulates the bovine cervical pros- 
taglandin E 2 (PGE 2 ) synthesis that is known to be associated with 
cervical relaxation and opening at the time of estrus was inves- 
tigated. Cervical tissue from pre-estrous/estrous, luteal, and 
postovulatory cows were examined for 1) the presence of bovine 
(b) FSH receptor (R) and its corresponding mRNA and 2) the 
effect of FSH on the PCE 2 regulatory pathway in vitro. The pres- 
ence of bFSHR mRNA in the cervix (maximal during pre-estrus/ 
estrus) was demonstrated by the expression of a reverse tran- 
scription (RT) polymerase chain reaction (PCR) product (384 
base pairs) specific for bFSHR mRNA and sequencing. Northern 
blotting revealed three transcripts (2.5, 3.3, and 3.8 kilobases 
[kb]) in cervix from pre-estrous/estrous cows. The level of FSHR 
(75 kDa) was significantly higher (p < 0.01) in Western blots of 
pre-estrous/estrous cervix than in other cervical tissues. There 
was a good correlation between the 75-kDa protein expression 
and its corresponding transcript of 2.55 kb throughout the es- 
trous cycle as described by Northern blot analysis as well as RT- 
PCR. Incubation of FSH (10 ng/ml) with pre-estrous/estrous cer- 
vix resulted in a 3-fold increase in the expression of FSHR and 
a 2-fold increase in both G protein (aj and cyclooxygenase II. 
FSH (5-20 ng/ml) significantly increased (p < 0.01) cAMP, ino- 
sitol phosphate (p < 0.01), and PGE 2 (p < 0.01) production by 
pre-estrous/estrous cervix but not by cervix at the other stages. 
We conclude that bovine cervix at the time of the peripheral 
plasma FSH peak (pre-estrus/estrus) contains high levels of 
FSHR and responds to FSH by increasing the PGE 2 production 
responsible for cervical relaxation at estrus. 

INTRODUCTION 

FSH and LH are the two known pituitary hormones that 
regulate gonadal functions. FSH is essential for female fer- 
tility, specifically for folliculogenesis, and human ovarian 
failure has been related to mutations both in the FSH re- 
ceptor (FSHR) and in the FSHp gene [1]. However, unlike 
LH, for which extra-gonadal binding has been documented 
for the uterus in the cow [2], pig [3], and human [4], no 
extra-gonadal effects of FSH have been reported. The uter- 
ine LH receptors have been shown to be physiologically 
active in the cow [2, 5]. The concentration of uterine LH 
receptors is related to the phase of the cycle, and binding 
of LH to uterine receptors increases uterine prostaglandin 
synthesis [2, 5]. Preliminary observations in our laboratory 
indicated that both FSH and LH receptors are also present 
in the cervix. We therefore investigated whether FSHRs are 
associated with the regulation of cervical prostaglandin, 
particularly around the time of estrus, when both cervical 
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PGE 2 production [6] and peripheral FSH concentrations are 
maximal. 

The present investigation was therefore conducted to de- 
termine the presence of FSHR and its mRNA, and its phys- 
iological activity, in the bovine reproductive tract. Since the 
cervix is a prostaglandin Ej (PGEi)- and PGE 2 -sensitive 
tissue and responds to hormonal stimulation, we also de- 
termined whether FSH could increase cervical cyclooxy- 
genase expression and production of PGE 2 in vitro, partic- 
ularly at the time near estrus. 

MATERIALS AND METHODS 

Animals 

Cervical and ovarian tissues were collected from Hol- 
stein cows at a local abattoir. The stage of the cycle was 
determined according to signs of ovulation and the status 
of corpora lutea (CL; weight, color). These parameters al- 
lowed the classification of the cervix into three groups: 1) 
pre-estrous/estrous (follicular, 18-20 days postovulation; 
regressed CL < 1.0 g; presence of cervical mucus; n = 
54); 2) postovulatory (1-4 days postovulation; presence of 
corpora hemorrhagica; signs of follicular rupture; h = 35), 
and 3) luteal (12-16 days postovulation; CL 4-6 g; n = 
44). Bovine granulosa cells were collected from pre-es- 
trous/estrous follicles and used as a positive control for 
FSHR expression [7]. 

The cervix external os segment was taken as the initial 
3 cm of the cervix (total length of cervix is about 9 cm). 
The cervical os segments, primarily luminally oriented 
muscularis with epithelium, were sliced into horizontal- 
strips (1 cm in length), pooled, and minced finely with a 
scalpel. The Kimron Veterinary Institute Animal Care and 
Use Committee approved all procedures. 

FSHR Gene Expression 

RNA isolation. Bovine cervical or ovarian tissues were 
rapidly dissected and frozen in liquid nitrogen (within 20 
min after slaughter). Total RNA was extracted using the 
acid phenol, guanidine thiocyanate technique [8], using 
TriReagent (Molecular Research Center, Inc., Cincinnati, 
OH) according to the manufacturer's instructions. 

Reverse transcription (RT)-polymerase chain reaction 
(PCR). Bovine FSHR mRNA was detected as previously 
described for the LH receptor mRNA [5]. Oligonucleotides 
corresponding to the published sequence of the bovine 
FSHR [7] were synthesized and used as primers in a PCR 
reaction. Bovine ovarian and cervical tissue cDNA ob- 
tained by RT of 2 \ig RNA was used as a template in the 
PCR reaction using a pair of primers corresponding to the 
transmembrane segments of the FSHR. The primers, 20- 
mer each, were selected using the OL1GO Program (Oligo 
Rl primer analysis software; National Biosciences, Plym- 
outh, MN). The forward primer corresponded , to position 
1399-1419 (5'CGGCTTTTTCACTGTCTTTG3') on the 
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bovine FSHR mRNA [7]. The reverse primer was a 20-mer 
oligonucleotide complementary to position 1763-1783 
(5'CGCTTGGCTATCTTGGTGTC3'). The predicted size 
of the RT-PCR product was 384 basepairs (bp). Four mi- 
croliters of the RT reaction were used as a template for 
PCR; the reaction was allowed to proceed 35 cycles using 
2 U Termus Aquaticus DNA polymerase (recombinant) 
provided by MBI Fermentas, Vilnius, Lithuania, and 200 
pmol of each primer. The cycling parameters of the PCR 
were 98°C for 20 sec, 56°C for 45 sec, and 72°C for 45 
sec. After amplification the samples were separated on a 
1% agarose gel, stained with ethidium bromide, and pho- 
tographed under UV light. The 384-bp fragment was ex- 
tracted using the Wizard PCR Preps Kit (Promega, Madi- 
son, WI) and sequenced with the upper primer using an 
automatic sequencer (Applied Biosystems, Foster City, 
CA). For internal control, a bovine 0-actin fragment of 890 
bp was produced from an upper primer (5'ACCA- 
ACTGGGACG ACATGGAG3 ' ; 21 mer) and a lower prim- 
er (5 'GCATTTGCGGTGGACAATGG A3 ' ; 21 mer) as pre- 
viously described [5]. Each PCR amplification was stan- 
dardized using ovarian granulosa cDNA to produce a spe- 
cific band of the predicted size. Standardization was based 
on temperature, cycle number, Mg level, and pH. 

Western Blots 

Cervical, bovine seminal vesicle, or muscle minces were 
sonicated for 45 sec, lysed in 500 u.1 lysing buffer (1% 
Nonidet-40 [Sigma, St. Louis, MO], 2 mM EGTA, 5 nM 
MgCl ? , 1 mM PMSF in PBS) and incubated for 2 h at 4°C. 
After centrifugation, aliquots were taken for protein deter- 
mination using gamma globulin (Sigma) as reference stan- 
dard [9] and a protein dye binding method (Bio-Rad Lab- 
oratories, Richmond, CA). Sixty micrograms of protein of 
the soluble tissue extract was. then separated by electropho- 
resis on 5-9%-gradient SDS-PAGE and electroblotted onto 
nitrocellulose paper (Amersham, Little Chalfont, Bucks, 
UK) as described previously [9]. The nitrocellulose mem- 
brane was washed with PBS containing 0.05% TWEEN-20 
and blocked with 10% horse serum (Kimron Veterinary In- 
stitute) in washing solution. The nitrocellulose membrane 
was then treated with either 1) an anti-peptide antibody 
(code name 179) raised against human (h) FSHR peptide 
265-295, diluted 1:500 (donation of Dr. J.A. Dias, Wads- 
worth Center, New York State Department of Health); 2) 
rabbit antiserum for bovine G protein (ot ? ) diluted 1:1000; 
UBI, Lake Placid, NY); or 3) rabbit anti-bovine cycloox- 
ygenase polyclonal antiserum (diluted 1 :200; Kimron Vet- 
erinary Institute) [9]. Different dilutions were used for each 
antibody, and the final dilution used was 75% of the dilu- 
tion, which gave a maximal signal as previously described 
[9]. The nitrocellulose paper was then incubated with horse- 
radish peroxidase-conjugated goat anti-rabbit IgG (Sigma 
Israel, Rehovot, Israel; diluted 1 :2000 in washing buffer) 
for 1 h at room temperature. The presence of FSHR, G 
protein (a s ), or cyclooxygenase was then visualized by 
means of a color reaction as. follows. The nitrocellulose 
paper was incubated in a substrate solution containing 3'3'- 
diaminobenzidine (0.5 mg/ml; Sigma) in a mixture of PBS 
containing 0.5% CaCl 2 and 6% H 2 0 2 - The antibody to 
hFSHR recognized the 75-kDa protein of the FSHR. The 
antibody to G (a s ) recognized both the 42-kDa and 87-kDa 
forms of this protein, and the antibody to cyclooxygenase 
recognized the 72-kDa form (cyclooxygenase II). Extract 
of bovine seminal vesicles that are known to have a high 



content of cyclooxygenase was used as positive control as 
previously described [9]. The densitometric scans were ob- 
tained using a bio-imaging system (B.I.S. 2020; Rhenium 
Dingo, Jerusalem, Israel) and processed with Tina 2.0 soft- 
ware (Fuji, Japan). Linearity of detection [9] was deter- 
mined for densitometry for both Western and Northern 
blots. Each Western blot was evaluated in the absence of 
the first antibody, and no signal was detected. 

Northern Blot Analysis 

For Northern blots, 10 \*>g of total RNA was denatured 
in 20 u.1 50% formamide/2.2 M formaldehyde in single- 
strength 3-(«-morphoIino) propanesulfonic acid (MOPS) 
buffer (0.04 M MOPS, 10 mM sodium acetate, 1 mM 
EDTA) at 65°C for 10 min. Samples were placed on ice, 
and 5 p,! of loading buffer (0.5% SDS, 0.25% bromophenol 
blue, 25% glycerol, 25 mM EDTA) was added. Samples 
were subjected to electrophoresis through a 1.2% agarose 
formaldehyde gel and transferred by capillary blotting to 
nylon membranes (Nytran; Schleicher and Schuell, Keene, 
NH). RNA was subsequently UV cross-linked to mem- 
branes. The RT-PCR product fragments of 384 bp for 
FSHR and 890 bp for p-actin were used to generate bioti- 
nylated probes using random primer biotin labeling of DNA 
for chemiluminescence (NEBlot phototype kit; Biolabs, 
Beverly, MA) according to the manufacturer's instructions. 
The blotted membrane was prehybridized at 65°C for 1 h 
in prehybridization solution (6-strength SSC [3 M NaCl, 
0.3 M sodium citrate], 5-strength Denhardt's reagent [1% 
ficoll, 1% polyvinylpyrrolidone, 1% BSA], 0.5% SDS, and 
100 p-g/ml denatured salmon sperm DNAs) and hybridized 
overnight in the same solution with the denatured biotiny- 
lated probe to the target RNA. The membrane was then 
washed in double-strength SSC, 0.1% SDS at room tem- 
perature for 10 min and washed again in 0.1-strength SSC, 
0.1% SDS at 68°C for 15 min followed by subsequent 
chemiluminescence, development, and detection on x-ray 
film (RX; Fuji Film, Tokyo, Japan). To obtain quantitative 
data for the specific mRNA, we used the densimetric value 
for p-actin mRNA to normalize each specific mRNA value. 
To remove FSHR probe, membranes were stripped using 
water for 15 min, then incubated in 0.4 M NaOH, 0.1% 
SDS at 80°C for 30 min. The membrane was- rinsed again 
in 0.2 M Tris-HCl, 0. 1 -strength SSC for 30 min at 25°C. 
After stripping, prehybridization and hybridization with (3- 
actin probe were performed as described for FSHR mRNA. 

Cyclic AMP Determinations 

Cervical tissue minces (50 mg/ml) were incubated for 
10 min in wells of 1.5 ml containing 1.0 ml of Tissue Cul- 
ture Medium-199 (TCM-199 without serum (Biological In- 
dustries, Beit Haemek, Israel) in the absence or presence 
of physiological (5, 10 ng/ml) and pharmacological doses 
(20, 40 ng/ml) of FSH (USDA-bFSH-8-1 ; no further in- 
crease in cAMP production was seen at 50 or 100 ng/ml) 
or forskolin, a stimulator of adenylate cyclase (10 jxM; Sig- 
ma). The dose of forskolin was selected to give a 3-fold 
increase in cAMP At the end of the incubation, tissues were 
removed, blotted on filter paper to remove mucus, and in- 
cubated overnight at 4°C with 400 |xl of 3% HC10 4 . The 
solution was then neutralized with 150 \xA of KHC0 3 and 
centrifuged, and 50-uJ aliquots were taken for the radiore- 
ceptor assay as described by Brown et al. [10] and modified 
for endometrium by Miyazaki et al. [11]. Standards (0- 
1000 pg of cAMP) or samples were incubated at room tern- 
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FIG. 1 . RT-PCR amplification of bovine ceryix FSHR and P-actin mRNA. 
RT-PCR was performed as described in Materials and Methods for FSHR 
and p-actin. Separate RT-PCR amplification products of FSHR (estimated 
384 bp) and (J-actin (estimated 890 bp) were combined and separated on 
agarose gel electrophoresis and stained with ethidium bromide. RT-PCR 
was performed using 2 jxg RNA of bovine granulosa cells as a positive 
control (lane 1), cervical tissue at the pre-estrous/estrous (lanes 3-4), post- 
ovulatory (lanes 5-6), and luteal phases (lanes 7-8). Lanes 9-1 0 represent 
reamplification, using as template the PCR product, cDNA, from the lu- 
teal -phase amplification. Lane 2 represents negative control in the ab- 
sence of RNA. The tissue was taken from 4 cows at each stage of the 
estrous cycle, and two representative samples from each stage are shown. 
SM, size markers. . . 



perature with 100 jxl of cAMP binding protein prepared 
from bovine adrenal extracts as described [10]. The buffer 
used for the assay was added to form a final reaction vol- 
ume of 400 ul After 2-h incubation in a cold room, the 
reaction was stopped by adding 300 |xl of a charcoal-dex- 
tran solution; centrifiigation followed, and the supernatant 
was removed for counting in a scintillation counter. The 
sensitivity of the assay was 60 pg/tube. Quadruplicate as- 
says were made for each value determined. The within- 
assay and between-assay coefficients of variance were 8% 
and 10%, respectively. 



1401 

BTFSH GCTTTTTCAC TGTCTTTGCC AGTGAGCTCT CAGTCTACAC TCTGACCGCC 
Cervix GCTTTTTCAC TGTCTTTACT AGTGAGCTCT CAGTCTACAC TCTGACCGCC 

BTFSH ATCACGCTGG AAAGATGGCA TACCATCACC CATGCCATGC AGCTCGAATG 
Cervix ATCACGCTGG AAAGATGCCA TCCCATCACC CGTGCCATGC AGCTCGAATG 

BTFSH CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 
Cervix CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 

BTFSH TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 
Cervix TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 

BTFSH AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 
Cervix AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 

BTFSH CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 
Cervix CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 

BTFSH GTGGCTGCTA CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 
Cervix GTGGCTXCTT CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 

1761 

BTFSH TCCTCTTCTA GTG 
Cervix TCCTCTTCTA GTG 

FIG, 2. Automated nucleotide sequencing and homology between Bos 
taurus mRNA FSHR (GenBank accession number L22319) from nucleo- 
tides 1401 to 1 763 and the PCR products obtained using the upper primer 
as described in Materials and Methods. A homology of 97.5% was found 
between Bos taurus and our amplified cDNA obtained by RT-PCR from 
cervical mRNA at pre-estrus/estrus, suggesting that they are complemen- 
tary. 



Measurement of Phospholipase C (PLC) Activity 

Cervical minces (30 mg) were incubated in 1 ml TCM- 
199 containing 5 mCi [ 3 H]myo-inositol for 90 min. Slices 
were rinsed with 1 ml TCM-199 and incubated for another 
60 min to remove unincorporated [ 3 H]myo-inositol. At the 
end of the preincubations, slices were incubated in 1 ml 
TCM-199 containing 10 mM LiCl to inhibit inositol phos- 
phate (IP) hydrolysis, allowing 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 to 
accumulate in the tissue after stimulation of PLC, in ab- 
sence or presence of 0-20 ng/ml of FSH or 0-20 ng/ml 
LH (USD A bLH-I-1); Incubations were terminated by add- 
ing 2 ml ice-cold chloroform/methanol/hydrochloric acid 
(5:10:0.1, v:v:v) and then 1 ml chloroform and 1 ml EDTA 
(5 mM) and extracted as described by Kisielewska et al. 
[12]. 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 were separated from the 
labeled inositol compounds by using Dowex AG 1-X8 (for- 
mate form). 3 H-Labeled IP was eluted from the column by . 
sequential elution into 10 fractions. Aliquots of each frac- 
tion were added to scintillation vials containing 5 ml scin- 
tillation fluid, and radioactivity was determined using a 
scintillation counter. Total activity of PLC is expressed as 
cpm of total IPs/30 mg/30 min. 

RIA for PGE 2 and PCF 2o . 

Aliquots of 100 \l\ were taken at the end of the incu- 
bation period for specific RIA of PGE 2 and PGF 2a , which 
were performed without chromatographic separation. The 
antisera for PGF 2a (Sigma) reacts preferentially with PGF 2a 
but cross-reacts with PGF la (60%) and to a negligible ex- 
tent (< 0.1%) with prostaglandins of the A, B, and E series. 
The antisera to PGE 2 (Sigma Israel) reacts preferentially 
with PGE 2 but cross-reacts with PGEj (20%), PGA,, PGA 2) 
PGF lu , and PGF 2a (< 10%) and to a negligible extent with 
PGB! and PGB 2 (< 0.1%). The intraassay coefficients of 
variation were 9% and 1 1%, and the interassay coefficients 
of variance were .12,3% and 13% for PGF 2ft and PGE 2 . 

Statistical Analysis 

ANOVA was performed with a significance level of p 
< 0.05. Data were further analyzed using Tukey's proce- 
dure (/? < 0.05 or p < 0.01) to assess significance between 
treatments. Student's /-test was used where appropriate. 
Values are expressed as means ± SEM. 

RESULTS 



RT-PCR demonstrated that the FSHR gene is expressed 
in bovine cervix as indicated by the presence of the 384- 
bp fragment corresponding to bFSHR mRNA. The mRNA 
was maximally expressed in the cervix during the pre-es- 
trous/estrous phase. In contrast, the expression of the gene 
was reduced in postovulation cervix and not detectable, in 
luteal-phase cervix (Fig. 1). However, reamplification of the 
PCR product, cDNA, from the luteal phase produced a de- 
tectable signal (Fig. 1). 

To demonstrate that the absence of FSHR mRNA in lu- 
teal-phase cervix was specific, bovine (3-actin mRNA was 
used as internal control. PCR demonstrated that the specific 
890 bp of the p-actin cDNA band was produced by the 
cervical tissues of all the stages. The band for (3-actin was 
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FIG. 3. Northern blot for analysis of total RNA (10 jig) extracted from 
bovine cervical tissues collected from pre-estrous/estrous, postovulatory, 
and luteal-phase cows, and bovine granulosa cells as a positive control. 
Relative migration of 28S and 18S ribosomal RNA is indicated. To localize 
rinRNA for FSHR and 0-actin, our RT-PCR products (384-bp and 890-bp 
fragments, respectively) were used to generate biotinylated probes using 
biotin random primer. Figure shows results for FSHR mRNA in bovine 
granulosa cells as a positive control (lane A); and in cervical tissues at 
pre-estrus/estrus (lanes B-C), postovulation, (lanes D-E) and the luteal 
phase (lanes F-G). The corresponding (3-actin band was obtained by re- 
probing of the membranes previously tested for FSHR mRNA as described 
in Materials and Methods. Columns show mean ± SEM of the optical 
density of the ratio FSHR:[3-actin, in which the luteal phase is given an 
arbitrary value of 1.0. 'Significantly different from luteal-phase value (p 
< 0.01); n = 5 for each stage. 
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FIG. 4. Presence of FSHR protein in cervical tissues as determined by 
Western blot. Soluble cell extracts (60 fig protein) were used to determine 
the amount of FSHR as described in Materials and Methods. Western blot 
proteins were transferred to nitrocellulose paper and probed with anti- 
serum to peptide hFSHR, 265-296. Western blot displays a representative 
result for bovine granulosa cells, used as positive control (lane A); bovine 
seminal vesicles (lane B) and muscle, used as a negative control (lane C); 
and cervical tissues from three stages of the estrous cycle: pre-estrous/ 
estrous (lane D), postovulatory (lane E), and luteal (lane F). Histogram 
summarizes the results for the three stages of the estrous cycle. Columns 
show mean ± SEM of the relative optical density in which the luteal phase 
is given an arbitrary value of 1.0. *Significantly different from luteal-phase 
value (p < 0.01); n = 7 for each stage. 



present even though the 384-bp band corresponding to the 
FSHR was absent (Fig. 1). 

The nucleotide sequence (Fig. 2) of the 384-bp fragment 
obtained from pre-estrous/estrous cervix was compared 
with that for the Bos taurus FSHR reported by Houde et 
al. [7]. It was found that there was a 97.5% homology be- 
tween the 384-bp fragment and the nucleotide range 1410- 
1763 found in the Bos taurus FSHR. 

The Northern blot contained the expected three tran- 
scripts of 2.55, 3.3, and 3.8 kilobases (kb) present in the 
bovine granulosa. However, the high (6.8-kb) and low (1.6- 
kb) transcripts found in the ovary were not found in the 
cervix. The three transcripts were found in pre-estrous/es- 
trous cervix, but only one transcript (2.55 kb) was seen in 
luteal cervix (1/8 of the pre-estrus/estrus level; n = 5) or 
postovulatory cervix (1/3 of the pre-estrus/estrus level; n = 
5) (Fig. 3). 
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FIG. 5. Effect of incubation of cervical tissues with FSH on the expres- 
sion of" the FSHR protein. Cervical tissue from 5 pre-estrous/estrous cows 
was incubated in the absence or presence of FSH (0 or 10 ng/ml). Western 
blot is representative for FSH protein concentration after 3 h of incuba- 
tion, with granulosa cell extract serving as positive control. Lane A, Gran- 
ulosa cells (CC); lane 8, cervical tissue at Time 0; lane C, cervical tissue 
after 3 h of incubation in TCM-199; lane D, cervical tissue after 3 h of 
incubation with 10 ng/ml of FSH. Lane E represents muscle tissue incu- 
bated with FSH (10 ng/ml) for 3 h as a negative control. Histogram sum- 
marizes the results of six experiments. Columns are means ± SEM. ♦Sig- 
nificantly different (p < 0.01) from the column representing the control 
at 3 h (a given value of 1 ). 



FSHR Protein in the Cervix 

The presence of FSHR protein in the cervix was dem- 
onstrated as determined by Western blot analysis using a 
specific antipeptide antibody for hFSHR. The antibody rec- 
ognized a 75-kDa protein in both bovine granulosa and 
cervical preparations identical with the predicted molecular 
size of FSHR. The signal for the 75-kDa protein was stron- 
gest in pre-estrous/estrous cervix when compared with 
postovulation- (3-fold) or luteal-phase (6-fold) cervix (Fig. 
4). With the use of 60 jxg protein in each lane, this protein 
was observed to be expressed throughout the estrous cycle. 

In Vitro Effect of FSH on Induction of FSHRs 

A time-course (0.75, 1.5, and 3 h) and dose-response (0, 
10, and 20 ng/ml) study was carried out to determine in 
vitro effect of FSH on its own receptor. Cervical minces 
(100 mg) from cows in the pre-estrous/estrous, postovula- 
tory, and luteal phases were used. It was found, as deter- 
mined, by Western blot, that FSH induced its own receptor 
in cervical tissues at pre-estrus/estrus. with a 3-fold increase 
(p < 0.01) observed at 10 ng/ml after 3 h of incubation 
(Fig. 5). However, no effect of FSH was seen at earlier 
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FIG. 6. Effect of incubation of cervical tissues with FSH on the synthesis 
of G protein. Cervical tissues from 5 pre : estrous/estrous cows were in- 
cubated in the absence or presence of FSH (10 ng/ml). Both the 87-kDa 
G protein (a, 0, 7 complex) and 42-kDa protein (otj are elevated at 3 h 
of incubation. Granulosa cell extract was used as a positive control. Lane 
A, Granulosa cells (GC); lane B, cervical tissue at Time 0; lane C cervical 
tissue after 3 h of incubation in TCM-199; lane D, cervical tissue after 3 
h of incubation with 10 ng/ml of FSH. Columns are means ± SEM of 42- 
kDa protein. *Significant|y different (p < 0.05) from the column repre- 
senting the control at 3 h (with an arbitrary value of 1 .0). 



times at the 10 ng/ml level. When 20 ng/ml was used, a 
stimulatory effect (2-fold) was seen after 1.5 h. No signif- 
icant response was observed when tissues of the postovu- 
latory or luteal phase were used at any dose or time tested 
(data not shown). 

Effect of FSH Treatment on C Protein (a J 

A time-course and dose-response study was carried out 
to determine whether cervical FSHR was coupled to G pro- 
tein. Cervical minces (100 mg) from cows in the pre-es- 
trous/estrous, postovulatory, and luteal phases were incu- 
bated in the presence of FSH (0-20 ng/ml) for 0.75, 1.5, 
and 3 h. It was found in pre-estrous/estrous cervical tissue 
that, in the presence of FSH (10 ng/ml), there was a sig- 
nificantly (p < 0.05) higher level of G protein (d s ) at the 
level of both the 87-kDa (a, (3, 7 complex) and the 42-kDa 
(a s ) proteins after 3 h of incubation (Fig. 6). FSH, at either 
10 or 20 ng/ml, did not have a stabilizing effect on the 
level of G protein (a s ) at shorter times of incubation (0.75 
or 1.5 h). No significant response was observed when tis- 
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FIG. 7. Effect of FSH on IP production in cervical tissue. Cervical tissues 
from six cows at each of three stages of the cycle were incubated for 30 
min in the absence or presence of FSH (10 ng/ml) or LH (ng/ml). IPs 
produced were determined in four replicates by amount of labeled com- 
pound (cpm) produced. Columns are means ± SEM (*p < 0.05; **p < 
0.01 ; significantly different from control value for each stage of the cycle). 



sues of the postovulatory or luteal phase were used at any 
dose or time tested (data not shown). 

IP Pathway Activation by FSH 

For IP measurement, cervical tissue minces (30 mg) 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated for 30 min in the absence or presence 
of FSH (10 ng/ml) or LH (10 ng/ml). FSH significantly (p 
< 0.01) increased the level of IP, IP 2 , and IP 3 at pre-estrus/ 
estrus. However, FSH significantly (p < 0.05) inhibited IP 
level in the postovulatory stage. In contrast, LH, but not 
FSH, stimulated IPs significantly (/? < 0 0 1 ) at the luteal 
phase (Fig. 7). 

Adenylate Cyclase Activation by FSH 

A dose-response analysis was used to determine whether, 
the cervical FSHR was coupled to adenylate cyclase. In 
these experiments, cervical tissue minces (50 mg) obtained 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated with FSH (0-40 ng/ml). FSH in- 
creased cAMP accumulation in a dose-dependent manner 
when incubated with pre-estrous/estrous cervical tissues, 
with maximal stimulation 2.5 times (p < 0.01) that of con- 
trol in the presence of 10 ng/ml (Fig. 8 A). In contrast, cer- 
vical tissue from luteal or postovulation did not respond to 
FSH even though tissues from all phases of the cycle re- 
sponded to forskolin with a significant (p < 0.01) increase 
in cAMP (Fig. 8B). 

Effect of FSH on PCE Production by Cervical Tissues 

A dose-response analysis was carried out to determine 
whether FSHR was associated with cervical prostaglandin 
production. PGE 2 and PGF 2a production by cervical minces 
(100 mg/ml) in the presence of FSH (0-20 ng/ml) was de- 
termined by R1A. FSH stimulated (p < 0.01) basal PGE 2 
production in a dose-dependent manner and caused a 3-fold 
increase in PGE 2 production at 10 ng/ml in pre-estrous/ 
estrous cervical tissues (n = 8, Fig. 9). FSH had no effect 
on the small amount of basal PGF 2a (< 1 ng/100 mg) pro- 
duced by cervical tissue (data not shown). FSH did not 
elevate PGE 2 in cervical tissues from the luteal phase; fur- 
thermore, a small but significant (p < 0.05) inhibition was 
observed at postovulation (Fig. 9). 
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FIG. 8. Effect of FSH on adenylate cyclase production in cervical tissue. 
A) Cervical tissues from six cows at pre-estrus/estrus were incubated (4 
replicates) for 10 min in the absence or presence of FSH (0, 5, 10, 20, or 
40 ng/ml) or forskolin (10 ^mol/L), and the amount of cAMP produced 
was measured by a protein binding assay. B) Cervical tissues from six 
cows at each of the three stages of the estrous cycle were incubated (4 
replicates) for 10 min in the absence or presence of FSH (10 ng/ml) or 
forskolin (10 u.M). The amount of cAMP produced was measured by a 
protein binding assay. Columns are means ± SEM. Columns with asterisks 
were statistically different from their own control (*p < 0.01; **p < 0.05). 
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FIG. 9. FSH dose response on production of PCE 2 by cervical tissues. 
Cervical tissues from 8 pre-estrous/estrous, postovulatory, or luteal-phase 
cows were incubated in the absence or presence of FSH (5, 10, or 20 ng/ 
ml) for 6 h, and the effect on the production of PGE 2 was measured by 
RIA. *p < 0.01; **p < 0.05. 
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FIG. 10. Effect of incubation of cervical tissues with FSH on the ex- 
pression of cyclooxygenase. Cervical tissues from 5 pre-estrous/estrous 
cows were incubated in the absence or presence of FSH (10 ng/ml). The 
72-kDa cyclooxygenase II protein was elevated at 3 h of incubation.. Lane 
A, bovine granulosa cells (GC); lane B, cervical tissue at Time 0; lane C, 
cervical tissue after 3 h of incubation in TCM-1 99; lane D, cervical tissue 
after 3 h of incubation with 10 ng/ml of FSH. Columns are means ± SEM 
of 72-kDa protein. ^Significantly different (p < 0.01) from the column 
representing the control at 3 h (with an arbitrary value of 1 .0). 



Cyclooxygenase Activation by FSH 

To determine whether the presence of FSHR was asso- 
ciated with the expression of cyclooxygenase, cervical 
minces were incubated for 3 h in the presence or absence 
of FSH (10 ng/ml). Tissues from pre-estrous/estrous, post- 
ovulatory, and luteal-phase cows were extracted, separated 
on SDS-PAGE, and tested for the cyclooxygenase II (72 
kDa) using a specific antibody. It was found that in cervical 
tissues from six pre-estrous/estrous cows, FSH induced a 
200% increase in the expression of cyclooxygenase after 3 
h of incubation (Fig. 10). In contrast, no effect on cyclo- 
oxygenase expression was seen in cervical tissues from lu- 
teal-phase cows, and an insignificant (p > 0.05) elevation 
at the postovulatory phase was observed (data not shown). 

DISCUSSION 

Results of this study demonstrate by both PCR amplifi- 
cation and Northern blot analysis that FSHR mRNA is pres- 
ent in the bovine cervix. The bands corresponding to the 
receptor (75 kDa) and its mRNA were maximally expressed 
during the pre-estrous/estrous phase. Incubation of cervical 
tissues with FSH increased the expression of cyclooxygen- 



ase and the production of PGE 2 (and possibly PGEO at the 
pre-estrous/estrous phase. The stimulation of cervical cy- 
clooxygenase was associated with activation of both the 
PLC and adenylate cyclase second messenger G-protein re- 
lated pathways. 

Northern blot analysis of bovine cervical tissues from 
pre-estrous/estrous cows revealed multiple transcripts for 
bovine cervical FSHR. The sizes of the major transcripts 
were 2.5 kb, 3.3 kb, and 3.8 kb, which were similar to those 
obtained in bovine granulosa cells [7], rat testis and ovary 
[13, 14], and human ovary [15] and myometrium [16]. Oth- 
er minor transcripts have been reported in all of these spe- 
cies. The RT-PCR product yielded a single band of 384 bp, 
which was the expected molecular size, so the different 
isoforms of the FSHR are probably the result of differential 
splicing of the same transcript. Isoforms of FSHR have 
been characterized in human [17] and primate [18] ovary 
and in ovine testis [19]. Since only a single 2.5-kb isomer 
was present in luteal-phase cervix (which did weakly pro- 
duce FSHR), it would appear that there are different iso- 
forms present in the bovine cervix as well. 

The nucleotide sequence homology between our RT- 
PCR 384-bp product was 97,3% identical to the comparable 
region of the bovine FSHR, showing that our amplified 
cDNA was complementary to the Bos taurus mRNA FSHR. 
The PCR product of 384-bp cDNA was found primarily in 
the. pre-estrous/estrous cervix. 

Using specific antibody raised in rabbits against hFSHR 
(amino acid sequence 265-296) for Western blot analysis 
resulted in a major signal for a 75-kDa protein. A protein 
of similar molecular mass has been reported in rat and hu- 
man ovary [20]. A strong signal for this protein was seen 
in the pre-estrous/estrous cervix compared with cervix from 
the other stages of the cycle. This cervical receptor was 
regulated by FSH itself, similar to the regulation of the 
FSHR in the rat ovary [14]. 

FSH increased cAMP production .by cervical tissue from 
the pre-estrous/estrous phase but did not elevate cAMP in 
the postovulatory and luteal phases. FSH elevation of IPs 
displayed the same pattern. It would therefore appear that 
the cervical FSHR is associated with signal transduction 
pathways in a way similar to the accepted mechanism of 
LH action in the ovary and testes, i.e., activation of the 
second messenger pathways — adenylate cyclase [21] and 
phosphatidyl-inositol (PLC) [22, 23]. The inhibitory effect 
of FSH on IPs and cAMP at postovulation occur when 
FSHR is low. Since FSH activity is biphasic, i.e., is stim- 
ulated by a low dose and inhibited by a high dose, an in- 
hibitory effect at low concentrations could be the result of 
the change in the ligand-receptor ratio. 

FSH induced about a 200% increase in the expression 
of cyclooxygenase at pre-estrus/estrus but had no effect on 
the two other stages of the cycle. This was similar to the 
response of cyclooxygenase to LH that was observed for 
the bovine endometrium [2] and uterine vein [6]. However, 
in the endometrium, LH increased cyclooxygenase during 
both the luteal and postovulatory phases but not at estrus 
[2]. Furthermore, the induction of cyclooxygenase by go- 
nadotropin in the endometrium was associated with an in- 
creased level of PGF 2fl , while in the cervix PGE 2 was the 
major product. 

The bFSH was of immunological grade and devoid of 
residual biological contamination of LH, thyroid-stimulat- 
ing hormone, growth hormone, prolactin, or ACTH. The 
effects on increasing cAMP, PGE 2? and cyclooxygenase as 
well as induction of FSHR were therefore specific. Fur- 
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thermore, it was found that highly purified hFSH (10-20 
ng/ml) (donation of J.A. Dias) had the same effects as 
bFSH when incubated with cervical tissue under the same 
conditions (2.5-fold increase in cAMP; 3-fold increase in 
PGE 2 ; 2-fold increase in cyclooxygenase). 

The relaxation and opening of the gravid cervix is due 
to active biochemical and structural changes in the cervical 
connective tissue that are mediated in part by prostaglan- 
dins [24]. It is thought that stretching of the cervix is a 
factor in causing the release of these prostaglandins [25, 
26]. The effect of PGE in causing cervical softening in the 
ewe is also well documented [27-29]. In both nonpregnant 
and pregnant cows, PGE 2 causes an increase in cervical 
opening within 3 h of treatment [30]. PGEi administered 
intra-cervically in a jelly was shown to decrease cervical 
resistance within 24 h [31]. 

Recently, we reported [6] that oxytocin caused a signif- 
icant stimulation of PGE 2 production in vitro in cervical 
tissues from pre-estrous/estrous cows but had no effect on 
PGE 2 production in cervical tissue from other stages of the 
estrous cycle of the cow. Similarly, in an initial report [31], 
it was shown that administration of oxytocin to pre-estrous 
cows increased the concentration of PGE 2 in the cervical 
exudate. However, peripheral oxytocin concentrations dur- 
ing estrus are lower than during the luteal phase [32, 33], 
and elevations of both peripheral oxytocin and cervical 
oxytocin receptor concentration are necessary for oxytocin 
to cause cervical softening towards parturition [24]. Fur- 
thermore, progesterone in vitro induced a dose-dependent 
inhibition of PGE 2 release by cervical tissues from pre- 
estrous/estrous cows, and this was associated with a de- 
crease in both basal and oxytocin-stimulated PGE 2 produc- 
tion [6]. It would therefore seem that hormones other than 
oxytocin are responsible for the increase in cervical PGE 2 
in the pre-estrous/estrous cow. The present report indicates 
that FSH, which has its peak peripheral concentration at the 
time of estrus, could be the hormone that increases cervical 
PGE 2 , as the FSHR expression is maximal at this time and 
FSH in vitro increases PGE 2 production by the cervix. 

The bovine cervix at pre-estrus/estrus has high levels of 
FSHR protein and its corresponding mRNA. Activation of 
the receptor by FSH is associated with the G-protein-cou- 
pled receptor family that mediates the cAMP and IP sig- 
nalling pathways. These signalling pathways then increase 
the expression of cyclooxygenase and production of PGE 2 . 
The expression of the FSHR was maximal at the time of 
the FSH peak in the blood and suggests a physiological 
role for FSH in the relaxation and opening of the cervix at 
estrus. 
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The alpha(v)beta6 integrin receptor for Foot-and-mouth disease virus is 
expressed constitutively on the epithelial cells targeted in cattle. 

M^SaghajLE, GojdS, Sjfflr^ouJ, ZhangZ, WcinrebPH, Violette SM, Alexandersen 

o, Jackson T . "~ 

h^tutefor Animal Health, Pirbright Laboratory, Ash Road, Pirbright, Surrey GU24 

Field strains of Foot-and-mouth disease virus (FMDV) use a number of alpha(v)- 
integnns as receptors to initiate infection on cultured cells, and integriris are believed to 
be the receptors used to target epithelial cells in animals. In this study 
immunofluorescence confocal microscopy and real-time RT-PCR were used to 
investigate expression of two of the integrin receptors of FMDV, al P ha(v)beta6 and 
alpha(v)bete3, within various epithelia targeted by this virus in cattle. These studies show 
that alpha(v)beta6 is expressed constitutively on the surfaces of epithelial cells at sites 
where infechous lesions occur during a natural infection, but not at sites where lesions 
are not normally formed. Expression of alpha(v)beta6 protein at these sites showed a 
good correlation with the relative abundance of beta6 mRNA. In contrast, alpha(v)beta3 
protein was only detected at low levels on the vasculature and not on the epithelial cells 
ot any of the tissues investigated. Together, these data suggest that in cattle 
alpha(v)beta6 rather than aI P ha(v)beta3,,serve S as the major receptor that determines the 
tropism of FMDV for the epithelia normally targeted by this virus. 

PMED: 161 8623 1 [PubMed - in process] 
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Uroldnase-mediated posttranscriptional regulation of iirokinase-receptor 
expression in non small cell lung carcinoma. 

Montuori N , MattiejjoA, Mancmi A, Taelialatcla P CgtiM Kneci^ i^qP 

Istituto di Endocrinologia cd Oncologia Sperimentale, Consiglio Nazionaledelle- 
Ricerche, Naples, Italy. 

The urokinase-type plasminogen activator (uPA) and its cellular receptor (uPAR) are 
involved in ^the proteolytic cascade required for tumor cell dissemination and metastasis 
and are highly expressed in many human tumors. We have recently reported that uPA 
independently of its enzymatic activity, is able to increase the expression of its own ' 
receptor m uPAR-transfected kidney cells at a posttranscriptional level. In fact uPA 
upon binding uPAR, modulates the activity and/or the level of a mRNA-stabili'zing factor 
that binds the coding region of uPAR-mRNA. We now investigate the relevance of uPA- 
mediated posttranscriptional regulation of uPAR expression in non small cell lung 
carcinoma (NSCLC), in which the up-regulation of uPAR expression is a prognostic 

, ° W that UPA is ab,e t0 increase uPAR expression, both at protein and 
mRNA levels, .n primary cell cultures obtained from tumor and adjacent normal lung 
tissues of patients affected by NSCLC, thus suggesting that the enzyme can exert its 
ofo m LT, 8 .? " S - We mvesti eated the relationship among the levels of uPA, uPAR and 
uPAR-mRNA binding protein(s) in NSCLC. Lung tissue analysis of 35 NSCLC patients 
shows an increase of both uPA and uPAR in tumor tissues, as compared to adjacent 
norma tissue* in 27 patients (77%); 19 of these 27 patients also show a parallel increase 
of the level and/or binding activityof a cellular protein capable of binding the coding 
region of uPAR-mRNA. Therefore, in rumor tissues, a strong correlation is observed 
among these 3 parameters, uPA, uPAR and the level and/or the activity of a uPAR- 
mRNA binding protein. We then suggest that uPA regulates uPAR expression in NSCLC 
at a posttranscriptional level by increasing uPAR-stability through a cellular factor that 
binds the coding region of uPAR-mRNA. Copyright 2003 Wiley-Liss, Inc. 

PMID: 12704669 [PubMed - indexed for MEDLINE] 
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Vascular endothelial growth factor expression correlates with matrix 
metalloproteinases MT1-MMP, MMP-2 aud MMP-9 in human 
glioblastomas. 

MunautC, Noel A. Hougrand O. Foidart JM. Boniv^rj IVgrM' 

Laboratory of Tumour and Development Biology, University of Liege, Liege, Belgium. • 

Vascular endothelial growth factor (VEGF) is the major endothelial mitogen in central 
nervous system neoplasms and it is expressed in 64-95% of glioblastomas (GBMs) 
Tumour cells are. the main source of VEGF in GBMs whereas VEGF receptors (VEGFR- 
1, .ts soluble form sVEGFR-1 , VEGFR-2 and neuropilin-1) are expressed predominantly 
by endothelial cells. Infiltrating tumour cells and newly-formed capillaries progress 
flirough the extracellular matrix by local proteolysis involving matrix metalloproteinases 
(MMPs). Recent studies have shown that VEGF expression and bioavailability can be 
modulated by MMPs. We reported previously that the expression of MT1-MMP in 
human breast cancer cells was associated with an enhanced VEGF expression We used 
quantitative RT-PCR, Western blot, gelatin zymography and immunohistochemistry to 

mmVZ£F^Z?? E T' VEGFR - 1 ' VEGFR -2. sVEGFR-1, neuropiIin-l r MTl- 

' MMP - 2 . MMP-9 and TIMP-2.in 20 human GBMs and 5 normal brains The 
expression of these MMPs was markedly increased in most GBMs with excellent 
correlaUon between mRNA and protein levels; activated foritis of MMP-2 and MMP-9 
were present in 8/18 and 7/1 8 of GBMs. A majority of GBMs (17/20) also expressed high 
52» 33 prev,ousl y re Ported, with strong correlation between VEGF and MT1 - 

MMP gene expression levels, and double immunostaining showed that VEGF and MT1- 
MMP peptides co-localize in tumour and endothelial cells. Our results suggest that the 
interplay between metalloproteinases and VEGF previously described in experimental 
riw 0 ^ may 3lS0 be °P erative in huma « GBMs. Because of its dual ability to activate 
MMP|-2 and. to up-regulate VEGF, MT1-MMP might be of central importance in the 

fJSTwr. *! MS ™ d represent ^ interesting target for anti-cancer treatments. Copyright 
2003 Wiley-Liss, Inc. 
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VASCULAR ENDOTHELIAL GROWTH FACTOR EXPRESSION CORRFI ATP* 
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Vasoi ar endothelial growth factor (VEGF) is the maior 
endothehal mitogen in central nervoJs system neopfcX 
and .t .s expressed in 64-93% of gHoMasto^u ^GBMsVT^ 

cent studies have shown that VEGF exwes^il/jK- * t 
aMity «„ bc modulated by MM* 'vvTrepTrte 
that the expression of MTI-MMP In human breast ca^fer 
celts was associated with an enhanced VEGF expresVio^£I 

mmpIw"* 1 Pr ° te ' n eve,s : "tivated forms of MMP-2and 
GBI* mX Sa,,i,, " , ^ ,n l . , ! 7/18 of GBMs. A majorityTf 

^MH^^^^^^^ 

"JlhreVtra^E^M^M^ 

tumour and endothelial ce.ls. D« ■ «iX2SS?S2?S2 

© 2003 Wiley-JLtss, Inc. 

Ke, words: VECF; MMP 5 ; gtioblos.omv: br oin:tum or 

Angiogenesis is critical for the development of nonnal tissue 
and solid rumours. This process include/the degrada™„ o/T 
extracellular matrix (ECM) and .he proliferaliorT mi ^ 0 n a „d 
d.ffcre„„aUon of endothelial cells, and is finely re«uCd by 
inhibitory and promoting factors." Among positive factors vas 
colar endothebal growth factor (VEGF) haf b£ n noSlsZ 
major endothelial mitogen in central nervous syXTAs) neo! 
plasms* Strong VEGF expression has been TiZZZ 
h«,ochem.s.ry ,„ 64-95% of glioblastomas (GBMs)' ™S 

m^,i8nan, P rimi,ive """ours of the CNS in 
adu s > Microvascular proliferation is characteristic of these ru- 
mors arrf ,s an essential WHO diagnostic criteria - Tumour ceUs 
are the mam source of VEGF in GBMs whereas VEGF recess 
^f?T n K n " y " P ' eSSed by «"»o.beJial cells.w.There Tc% 
VFrro't ^' h A n >ema 01 aff,ni 'y an<, "Auction signSn. 
VEGFR-1 (flt-1) and VEGFR-2 (flk/KDR) belon- to the Class II 

NRPtHV? roodula '« y EGF availability.*" Neuropilin-I 
(NRPI) ,s a co.,ecep l0f for VEGF that increases by 10-fold me 

o moauiaie VbGF-medwtcd tumour angiogenesis in hum-in ™^ 
hgoant astrocytomas - The coo^ij a ?£&£*£^g l 

Sogene"^ 5 ^ * ' "™ » «* — < * 



hfiitobng mmour cells and newly-formed capillaries progress 
through the ECM by local proteolysis involvmg matrix S 
protemases (MMPs)..*." MMP S m r^.eolytic enzymes ™at£ 
. synthesized as inactive zymogens. Their ZmS£$Z Z 
removal of a propeptide by proteinase cleavage and X* 
ited by various tissue inhibitors of MMPs fJIMPs). Most MMPs 

in the cell membrane by a transmembrane domain or by a gh-co- 
sylphosphalatjdyl-inositol anchor and are classified as mrinh^nl 
*pe MMPs (MT-MMPs).a.. ,„ eaaJr^SftE 
- ^ «o accumulate on.the cytoplasmic . membrane of 
invadopodia where they selectively mediate local rTrSuIar 

Sfe 0 ^ 5 ap ' aS "* b Ieveb ^MTI-MMP. MMP-2 and 
MMP-o. ^^Vmong these MMPs. MTI-MMP might play a cen- 
tral role m the remodeling of ,he ECM as this membfanlbou^ 
protease is able to activate MMP-2 ami MMP-13 ™ Mweover 
MTI-MMP has been shown to promote cell migration in S 
carc.noma cell lines by its ability to cleave larnW5 a malor 

in 0 "of t S, 4 H^ emen ' - d * ^ 

infegrin^ J ° r f °' b >' a,u '° ni ») 2S ^d of a v 3, 

• MT '- MM , P "involved in both developmental and tumour an- 
g3 ogenes,s^ MTI-MMP overexpression in human melanoma 
cells has been assocta.ed with enhanced in vitro invasion and 
increased m v,vo rumour growth and vascularization *« We have 
shown previously that in MCF-7 breast cancer cells, VEGF tran- 
scription is unregulated when MTI-MMP is overexpressed .» Up- 
regulation of VEGF by MTI-MMP has also bcen^eported in^ 
model of human gboma xenograft by Deryugina « at These 
expenmemal data suggest a link between MTI-MMP and the 

human glioblastomas. We compared the expression of VEGF and 
its receptors (VEGFR-1, sVEGFR-1, VEGFR-2 NRPI) with 
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MT1-MMP, MMP-2, MMP-9 and TIMP-2 in a series of 20 GBMs 
and 5 normal brains. Using quantitative RT-PCR, gelatin zymog- 
raphy, Western blot and immunohistochemistry, we showed a 
strong correlation between the expression of VEGF, MT1-MMP, 
MMP-2 and MMP-9 in GBMs. These results are in accordance 
with previous in vino studies and add to the evidence of an 
interplay between VEGF and MMPs in the progression of human 
GBMs. 



MATERIAL AND METHODS 

Patients 

We; studied 20 GBMs diagnosed at the Laboratory of Neuropa- 
tho!o£y-CHU Liege between 1997 and 2001. The series included 
17 primary GBMs (ie., no previous history, of lower grade diffuse 
astrocytoma) and 3 secondary GBMs previous history of 
lower grade diffuse astrocytoma). Clinical information on these 20 
cases have been reported previously as part of a larger series. 31 The 
gender ratio was 1/1, and the age at time of diagnosis ranged from 
41-79 years (mean 56 years). Normal brain cortex arid white 
matter were obtained from 5 patients with intractable epilepsy 
treated by partial temporal lobectomy. Histological examination of 
these specimens showed severe hippocampal sclerosis; frozen tis- 
sue was sampled from microscopically normal inferior temporal 
gyri. Our study was approved by the Ethical Committee of the 
Faculty of Medicine of the University of Liege. 



RNA extraction and cDNA synthesis 

Total RNA was extracted from cryosections with RNeasy Mini 
Kit (QIAGEN GmbH, Hilden, Germany) according to the manu- 
facturer's protocol Total RNA (1 jig) was reverse transcribed with 
a ThermoScript reverse. transcriptase (ThermoScript RT-PCR Sys- 
tem, Invitrogen, Carlsbad, CA) and random hexamers as primers. 

Primers 

: Primers pairs used in our study are described in Table 1. Primers 
for the VEGF gene were chosen to distinguish between VEGF )8S? , 
VEGF 165> VEGF I45 and VEGF, 2I mRNA isoforms. Intron-span- 
ning primers and probes for the TaqMan system (primers for 
VEGFR-1 (F)t-1 X sVEGFR- 1 , VEGFR-2 (KDR/Flk- 1) and NRPl ) 
were designed to meet specific criteria by using Primer Express 
software (Perkin Elmer, -Foster City, CA). All primers were syn- . 
thesized by Eurogentec (Liege, Belgium). The 5'^ and 3'-ead 
nucleotides of the probe were labeled with a "reporter (FAM. = 
6-carboxy-flu6rescein) and a quencher dye (TAMRA = 6-car- 
boxy-tetramethylrhodaroine). We conducted BLASTn (National 
Center for Biotechnology Information; Bethesda) searches against 
dbEST and the non redundant set of GenBank, EMBL, and DDB J 
database sequences to confirm the total gene specificity of the 
nucleotide sequences chosen for the primers. The specificity of the 
amplified PCR products was confirmed either by restriction digest 
or by sequencing, The 18S ribosomal RNA was measured using 
the Pre-Developed TaqMan Assay Reagents Endogenous control 
kit from Applied Biosystems (Foster City, CA). 





TABLE I- 


- SEQUENCE OF PRIMERS AND TaqMan PROBES USED FOR RT-PCR STUDIES 






Gei>e and accession 
number 


Pttsiiion 


' ' . j Sequence 


Sizt 


Cycles 


MMP-2FP 
MlViP-2-RP . 
NM_0O4530 


1740F 
1964R 


s'-acatcttcttcttca'aggaccggtto' 
5vggctggtcagtcgcttggggta-3' 

5'-ccggagattggcaa!ccagctgta-3' 


225 bp 


33 


MMP-9-FP 
MMP-9-RP 
J05070 


I592F 
J800R 


5'GACGCCCCTGTGTACACCCACA-3' 


208 bp 


37 


MMP-14-FP ' 

MMP-14-RP 

NM_004995 


1288F 
I508R 


5'-GGATACCCAATGCCCATTGGCCA-3' 
5'-CCATTGGGCATCCAGAAGAGAGC-3' 


221 bp 


32 . 


T1MP1-FP 
T1MP1-RP 
M 12670 


: 78F 
245R 


5 CATCCTGTTGTTGCTGTGGCTG AT- 3' 
5'-GTCATClTG ATCTCATA ACGCTGG-3 ' 


168 bp 


■ 33 


T1MP-2-FP 
TIMP-2- RP 
NM 003255 
VEGF-FP 
VEGF-RP 
AH001553 


78F 
245R 

J208F 
J687R 


5'-CTCGCTGGACGTTGGAGGAAAGAA-3' 
5^AGCCCATCTGGTACCTGTGGTTCA-3 , 

5 r -CCTGGTGGACATCTTCCAGGAGTA-3' 
5'-CTCACCGCCTCGGCTTGTCACA-3 r 


155 bp 

479 bp 
407 bp 
347 bp 
275 bp 


30 

33 


28S rRNA-RP 
28SrRNA-RP 
U 13369 


12403F 
J2614R 


5'-GTTCACCCACTAATAGGGAACGTGA-3' 
S'-GATTCTG ACTTAG A j3GCGTTCAGT- 3 ' 


212 bp 


19 


VEGFRJ-FP 
VEGFR1-RP 
VEGFR1 Probe 
AF063657 


2438F 
25J6R 
2469 


5'-TCCCITATGATGCCACCAAGT-3' 

S'-CCAAAAGCCCCTCTjTCCAAO' 

y-CCGGGAGAGACTTAAACTGGCCAAATCA-3' 

! 


79 bp 


.40 


sVEGFRl-FP 
sVEGFRl-RP 
sVEGFRl Probe 
U0I134 


2209F 
2388R 
2257 


5'-ACAATCAGAGGTGAGCACTGCAA-3' 

5'-TCCGAGCCrGAAACTTAGCAA-3' 

5'-TCCAAATTTAAAAGiCACAAGGAATGATTGTACCAC-3' 


180 bp 


40 


VEGFR2-FP 
VECFR2-RP 
VEGFR2 Probe 
AF063658 


79JF 
946R • 
820 


5'-CTTCGAAGCATCAGCATAAGAAACT-3' 

5 ' -TGGTCATC ACCCC ACTGG AT- 3 ' 

5'-AACCGAGACCTAA|AACCCAGTCTGCGAGT-3' 


156 bp 


40 


NRPI-FP 
NRPJ-RP 
NRPl Probe 
XM 034725 


my? 

1942R 
1883 


5'- CAC AGTGG A A CAGGTG ATGACTTC- 3 ' 

5'-AACCATATGTTGGWaACTCTGATTGT-3' 

5'-CCACAGAAAAGCCCACGGTCATAGACA-3' 


112 bp 


40 



850 



MUNAUT ETAL 



6000 



5500 



5000 



4500 



4000 



^ 3500 



< 
Z 

g 2500 



2000 



1500 - 



8 



VEGF VEGFR1 sVEGFRI VEGFR2 HHP 

, I ^ F 4nd VEGF mRNA quantification* scat- 

^Sfi*" r NomaI .«ck spots) and GBMs (whfte 
spote) mRNA levels are expressed as normalized values (as described 

"„^ n l? d A U ' = ""its). Each AfnT^re 

sents the mean of 3 separate experiments. 

T)MP-2mRNA and VEGF mRNA isoforms 

MTl-MMP, MMP-2, MMP-9, HMP-2 and VEGF mRNA iso- 
forrns (VEGF,,,, VEGF 1W , VEGF 145 and VEGF m ) weVe met 
sured ,» JO ng aliquots of cDNA using Tag polymerise (Taka^ 
Shiga, Japan) and 5 pmoi of each primers (Table J). The thermal 
cycling conditions included 2 min at 95X for denaruration and 
,,fica J ,0 "i?; Sec * 94°C. 20 sec at &6X and 20 sec at 
Tl'r ( ^f C fo ' VEGF » s ^orms) with a final incubation 2 min at 
- i /i»^J r0 ? UCtS WCTe ' esojVed on 2% Nusieve 3:1 agarose 
gels (BwWhittaker, Rockland, MD) and analyzed using a JWS 
MulnJmager (Bio-Rad, Hercules, CA) after ethidium bromide 
staining. Specfic mRNA levels were expressed as the ratio of 
specific transcnpts/28S transcripts. Experiments were repeated at 
least j times in duplicate. 

R ^J'^\^^^CRfor VEGFKJ. sVEGFX-J. VEGFR-2 
and NRPJ mRNA 

,,r^ in ? e 1 uanlilativ e RT-PCR analyses for VEGFR-1 
sVEGFR-1, VEGFR-2, NRPI mRNAs and IBS rRNA were cj: 
ned out using ibe ABJ PRISM 7700 Sequence Detection System 

IfT^o SOf ' Ware (PE App,ied ^systems). The sequences 
ol the PCR pnmer pairs and fluorogenic probes that were used for 
each gene are shown in Table 1. A standard curve was generated by 

wSri^ ,iU i iOP ° f p,3Cema CDNA lo Covc ' ,he of 
1 " S ? d WaS ran in du P ,ica * during every experiment. 
For each experimental sample, the amount of target gene was 
determined from this standard curve. The relative expression level 
of ihe target gene was normalized against I8S rRNA lo compen- 



S ?Ix,^ Var ! at,0n m **** qua,ity ofmA thc ai «ount of input 
cDNA (as descried bylhe manufacturer PE Applied Biosystems 

in U f iSf^ 2) ' Was ^ td 001 TaqMan Uni- 

V rSfI ^ ^f Cr Mix (Applied Bios y s *ms) ^ing 5 p.1 of diluted 
cDNA , (equivalent to 10 ng total RNA), 200 nM of the probe, and 
400 nM primers m a 25 pJ final reaction mixture. After a 2 min 
incubation at 50°C to allow for UNG cleavage, AmpliTaq Gold 
was activated by an incubation for 10 min at 95°C Each of the 40 
PCn cycles consisted, of 15. sec of denaruration at 95°C and 
hybridization of probe and primers for 1 min at 60°C 

To confirm amplification specificity, the PCR products were 
also examined by subsequent 2% agarose gel electrophoresis, 
fcxpenments were repeated at least 3 tiroes in duplicate. 

lmmwwhistochemistry for VEGF and MTl-MMP 

Sections (4 jim thick) were cut from fonnalin-fixed, paraffin 
embedded rumour tissue. They were hydrated through graded 
alcohols and incubated in (0.3% 15 min). Sections were ' 

aotoclaved for 11 mm at 126*C in citrate buffer pH6 for antigen 
retrieval (Dako, Glostrup, Denmark). For double LmunostainL 

aH? in primary """donal Ab anti-MTl- 

MMP Ab-4) 1:100 (Oncogene Research Products, San Dieoo 
CA) followed by peroxidase- conjugated EnVision (DakoV Inarmi- 
noreactivity was visualized with 33'diaminobenzidine (DAB+ 
vr^°p f C t C T,l Werc * en ^cubated with polyclonal Ab anti-* 
VEGF 1:150 (Santa Cruz, Santa Cruz, CA) for 1 hf at room 
K f° ,0Wed ^ Valine phosphatase-conju gated En- 
V, S ion (Dako). Immunoreactivity for VEGF was visualized with 
Fast Red chromogenic substrate (Dako). Single immunostaining 
was also earned out on serial sections using each primary antibody 
alone with the corresponding enzyme-chromogene combination 
Negative controls were obtained by omining the primary antibod- 
ies. J 

Gelaiin zymograpky assay 

MMP-2 and MMP-9 activities were quantified by gelatin zy- 
mography on 2 normal brains and 18 GBMs. Ten cryosections (JO 
IxM) were homogenized in buffer (0.1 M Tris-HCl pH 8 1 04% 
Triton X-100) and cenlrifuged for 20 min at 5,000*. The indlels 
were discarded. 25 u.g of total protein from homogenate supenia- 
tants were mixed with non reducing sample buffer (623 mM 
Tns-HCI pH 6.8; 2% SDS; 10% glycerol; 0.1% bromophenol 
« tectrophoresed directly on 10% SDS-polyacrylamide 
gels (SDS-PAGE) containing 0.1% gelatin (w/v).« After electro- " 
phoresis, gels were washed for 1 hr at room temperature in a 2% 
(v/v) Tnton X-100 solution to remove SDS, transferred to a buffer 
(50 mM Tris-HQ pH 7.6, containing 10 mM CaQj and incu- 
bated for 18 hr at 37°C Gels were stained for 30 min with 0.1% 
(w/v) Coomass.e bnlJiant blue G250in 45% (v/v) methanoV10% 
- (v/v) acetic acid and desiained in 10% (v/v) acetic acid/20% (v/v) 
methanol. Gels were analyzed with Quantity One software (ver- 
sion 4.2.2, Bio-Rad Laboratories, Hercules, CA) after densitomet- 
ry scanning of the gels using a Fluor-S Mullimager (BioRad). 

Western blot 

, c^IV^ir ^ r ° ,cin ,eVC,S Wcre analy^d 2 normal brains and 
k ,r rno A iaJ " e ' XlraC ' S {25 were w ^ ^ sample 

buffer (0.25 M Tris (pH6.8). 10% SDS (w/v), 4% sucrose (v/v), 
5% P-mercaptoelham>l (v/v) and 0. 125% bromophenol blue (w/v)J 
and boiled for 5 mm. They were separated on 10% SDS-PAGE 
gels and transferred ro a PVDF filler (NEN, Boston, MA) After 
blocking wnh 57c milk (w/v), 0.1% tween 20 (w/v) in PBS for 2 
hr at room lempeiarure. membranes were exposed lo the primary 
antibody (10 H g/ml f clone 1J3-5B7, AM, Oncogene Research 
Products, San Diego, CA) at 4°C overnight followed by incubation 
wiih a horseradish peroxidase-conjugaied rabbit anti-mouse anti- 
body (1.3 jig/ml, Dako, Glosrhip, Denmark). Signals were de- 
tected with an enhanced chemoluminescence (ECL) kit (NEN. 
Boston, MA). The relative intensities of the immunoreaclive bands 
were analyzed wiih Quantify One software (version 4 9 2 Bio-Rad 
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Figure 2 - MMPs and T1MP-2 mRNA quantifica- 
tioD. (a) Representative 2% agarose gels of RT-PCR 
products for MT.1-MMP, MMP-2, TTMP-2 a D d 
MMP-9 in 2 normal brains (NB) and 10 GBMs. (b) 
Scatter plots (as described in Fig. J). Experiment was 
repeated at least 3 times in duplicate. 



Laboratories) after densitometric scanning of ibe X-ray films using 
a Fluor-S Multimager (Bip-Rad). 

Statistics 

VEGF, . VEGFRs, MMPs and T1MP-2 expression values in 
GBMs were correlated using Spearman's test. Correlation was 
considered significant for 2-tailed p-value < 0-05. Statistical anal- 
ysis was carried out using tbe Prism 3.0 software (GraphPad, San 
Diego; CA). 

RESULTS 

Expression of VEGF and VEGF receptors 

VEGF mRNA was present in normal brains (295-375, arbitrary 
units; mean = 322) and in all GBM samples (2 17-5, 1 12; mean = 
1,774) as reported previously (Fig. I). 31 In most GBMs. VEGF 
mRNA levels were raised 2- 1 5- fold above normal brain values. 
Tbe most abundant isoform in all cases was VEGF, 65 , followed by 
VEGF, 2|I VEGF, 89 and VEGF lJt5 (data not shown). VEGFR-1 
expression was found at similar levels in GBMs (89-357: mean = 
182) and normal controls (154-198; mean = 181). There was no 
correlation between VEGFR-1 and VEGF. mRNA levels ip = 
0.35) in GBMs. VEGFR-2 was expressed in all GBMs (48-582: 
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mean = 210) and in 8720 cases at least twice normal values 
(87-111; mean = 103)- VEGF and VEGFR-2 expressions were 
correlated significantly ip = 0.0035) in GBMs: NRP1 expression 
varied broadly between GBMs (75-3,260; mean = 1.061) con- 
trasting with a constant baseline expression in normal controls 
(383-397; mean « 390). In rumours. NRP1 correlated with 
VEGFR-2 ip = 0.0119) but not witb VEGF ip = 0:084), DOr 
VEGFR-1 ip = 0.066). sVEGFR-1 was expressed at low levels 
both in normal brains (84-92; mean - 8?) and GBMs (25-205; 
mean = 101). sVEGFR-1, however, was found to correlate witn 
VEGFR-1 (p = 0.0289), VEGFR-2 (p = 0.0029), and NRP) <P -. 
0.0027) but not with VEGF ip = 0.053). 

Expression of MMPs and TIMP-2 

! MT1-MMP. MMP-2 and MMP-9 were expressed in both nor- 
1 mal brains and GBMs but at much higher levels in the 
2a). MT1-MMP mRNA levels were constantly higher id <j«ms 
(34-202; mean = 106) than in normal controls (3-^29; mean - 
15) MMP-2 and MMP-9 mRNA levels were higher than controls 
in 1 8720 and 14/20 cases respectively (Fig. 2b) and correlated witft 
each other ip - 0.0)87). MT1 -MMP mRN A levels correlated with 
MMP-2 (p « 0.0008) and MMP-9 ip = 0.005). * 
non-discriminative distribution in relation to the controls. TIMP-^ 
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13 14 17 18 20 22 16 + 








— 60 kDa 



FteuB* 3 1 - Zyroographic analysis of MMP-2 and MMP-9 in tissui 
dmoned by human m-1080 cells was included as positive contot 
STed^^oC ' MMP:9 * pro - MMP " 2 ^ WmS. 

TABLE II — ZYMOGRAMS AND WESTERN BLOT OUANlmCATmN OS 

MMPS W NORMAL BRAIN ASi%i" HCAT10N W . 



nttS? 4 --W«lera blot analysis using ihe ab4 antibody (dbne 
113-5B7) raised against (he catalytic domain tfNfTl-S C 
extracts from MTI-MMP transfecled A2058 beUs WoL SLs'^V^ 
^ai^°ffi + >- protein is^cd^ 

FROM SPEARMAN'S TT<rr 



MTI-MMP 



MMP-2 



MMP-9- 



VEGF 

VEGFR-J 

VEGFR-2 

NRPJ 

sVEGFR-1 



TIMP-2 



Cdaiia 2ymogrephy 



0.0250 
0.0073 

<o.oooi 

0.0053 
0.03)3 



0.0245 
0.0710 
0.0168 
0.1334 
0.0469 



. 0.0053 
<0.000I 
0.0004 
0.2457 
0.0194 



0.0094 
0.4542 
0.0153 
0.5480 
0.2563 



proMMP-9 


Nl 


0.41 


N2. 


0.28 


GB1 


9.00 


GB2 


4.28 


GB3. 


1.67 


GB4 


1.40 


GB5 


0.47 


GB7 


5.29 


GB8 


5.75 


GB9 


5:02 


GBJ2 


5.08 


GB13 
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0.0002) but not with MMP-9 {p = 0.1408). 

w' €l Mm-9 €tW€en mUMMF P™™ ond activated MMP-2 

By gelaGn zymography, pro-MMP-2 and pro-MMP-9 were de- 
eded in thelg GBMs and 2 controls examined (Fig. 3, Tab e U) 
In most GBMs, levels of these inactive for™ were higher than in 
? T a ° d WCfC COrTdaIcd '«P«^ mRNA 

MMP-2 and MMP-9 were not found in normal brain. By contrast 

Sir B k C,S W ™ qUan,ified b * Wes,cm W« in 15 
3Dd 2 Dorm31 brains 4, Table 11). Th ey WCTC sigmfi . 



cantly correlated with zymograrn- derived aclivatcd Jfivd 
(p - 0.0226) 1 but not with activated MMP-9 levels (p = 006) 
J) eresUDg^ MTI-MMP protein and mRNA levels were corre- 
cted ?1 gmfkaDUy (p = 0.089), arguing for a predominantly tran- 
scriptional regulation in GBMs! 

Correlation between VEGF network and MMPs 

J^ttTtll?! ^ GF and VEGF recc ^ were compared to 
MTI-MMP, MMP-2, MMP-9 and HMP-2 (Table Hi). TW was 

MSf"Si>o 0rre I a ^ n belWeen «P"*sion and MTJ- 

MM1\ MMP-2 and MMP-9. A similar correlation was also ob- 
served between VEGFR-2 and MMPs. Interestingly TIMP-2 ex- 

P T*TJS S conreIated wilh and VEGFR-2 but not with 

other VEGF receptors. 

Immunohistochemistryfor VEGF and MTI-MMP 

VEGF immnnoreactivity. was shown in both tumour and endo- 
thelial ce »^ a« previously reported (Fig. 5a W By single immu- 
nostainmg. MTI-MMP was detected in glioblastoma cells as a 
djffusecytoplasiruc staining (Fig. 5 Cf d). MTI-MMP posilivity was 
also seen ,n endothelial cells and perivascular cells (Rg 5c) By 
a!a^^^ D ^ wc observ «* the ^localization of VEGF 
andMTI-MMP m the cytoplasm of numerous tumour cells (Fig. 



DISCUSSION 

GBMs are highly malignant tumours with poor prognosis. They 
vP^i%^" 0Va ^ U,ar P roIifera 'i°n and express high levels of 
vtor-. VEGF is a strong mitogen for endothelial cells Oiereby 
promoting anagenesis. Previous reports".?* have suggested that 
VfcGr also stimulates tumour cell invasion, migration and survival 
m malignant epithelial cells through an autocrine loop by which 
oppression of MMPs induces VEGF secretion and leads to 
subsequent amplificauon of cell proliferation and protection 
against apoptosis We and others reported previously that in human 
melanoma and breast carcinoma cells, MTI-MMP upregulates 
VfcGF expression whereas TIMP-2 reduces it .- 8 - M - 35 Therefore the 
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Figure 5 - lmnjunohistocbemis- 
try (scale bar = 50 ujnX (a^) VEGF 
positive tumour and endothelial cells 
(plain arrows) show granular red 
staining of tbe cytoplasm. Negative 
cells (empty arrow) are seen in their 
close vicinity, (c) MT1-MMP posi- 
tive 'cells show a strong brown cyto- 
plasmic staining. They include tu- 
mours cells (plain arrow) and 
endothelial cells (empty arrow), {d} 
MTl-MMP positive tumour cells 
(plain arrow) are mixed with nega- 
tive cells (empty arrow), (tf) Double 
staining with VEGF (red) and MTI- 
MMPI (brown). Double positive tu- 
mour cells (plain arrows) contrast 
with negative or single weakly pos- 
itive cell (empty arrow). 



pericellular proteolysis mediated by MT1-MMP in GBMs could 
also induce an autocrine loop resulting in enhanced VEGF expres- 
sion. In rum, VEGF could act as a paracrine factor on endothelial 
cells to stimulate angiogenesis. or possibly as an autocrine faclor 
promoting glioblastoma cells survival-migration and invasion as 
demonstrated recently in the various rumour eel! culture models. 

We compared the expression of VEGF and its receptors wjih 
MTl-MMP,. MMP-2 and MMP-9 in 20 GBMS and 5 normal brains. 
The expression of these MMPs was maijcedly increased in most 
GBMs with excellent correlation between mRNA and protein levels. 
MT]1-MMP expression has been shown previously to correlate with 
glioma aggressiveness and its transfection in different tumour cell 
lines triggers an angiogenic phenotype and promotes tumour 
growth. 2 ^ 8 - 30 * 36 " 38 A majority of GBMs ( 17/20) also expressed high 
levels of VEGF, as previously reported, with a strong correlation 
between VEGF and MTl-MMP gene expression levels. Double in> 
munostainiDg studies showed co-expression of VEGF and MTl- 
MMP by the same tumour cells. These data suggest that tbe transcrip- 




rional control of VEGF by MTl-MMP could be operative not only in 
vitro but also in vivo in human GBMs. 

MTl-MMP could also promote the growth of GBMs by its 
ability to activate MMP-2 in the presence of low concentration of 
T1MP-2. J0 Pro-MMP-2 activation occurs after the formation of a 
ternary complex that contains pro-MMP-2 linked to cell surface 
MTl-MMP via a T1MP-2 bridge. Jn accordance with this hypoth- 
esis, we found that MMP-2 activation occurred in 8/18 of our 
GBMs 20 *" 1 among which 7/7 tested for MTl-MMP showed high 
contents of this protease. 

Activated MMP-9 was aiso found in 7/18 of our GBMs. This js 
an interesting finding as active MMP-9 is able to mobilize VEGF 
from its ECM reservoir. 39 Therefore, MMPs could promote 
VEGF-mediated angiogenesis in GBMs by both transcriptional 
(MTl-MMP) and post translational (MMP-9) mechanisms. 

VEGF binding to VEGFR-2 triggers the proliferation and mi- 
gration of endothelial cells whereas its binding to VEGFR-1 has 
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opposite effects on glioblastoma cell Hues. ,2 -« In our study VEGF 
mRNA levels were correlated with VEGFR-2 but not VEGFR-1 
NRP1 and sYEGFR-1. Collectively our data suggest that GBMs 
display a specific and complex pattern of VEGF receptors, trans- 
ducing VEGF signaling toward cell proliferation and migration. 

In conclusion, our study adds to the evidence for an interplay 
between meralJoprotdnases and VEGF in human GBMs as previ- 
ously documenied in experimental tumours. Because of its dual ability 
to acti vate MMP-2 and to up-regulate VEGF, MT1 MMP might be of 
central importance m the growth of human glioblastomas and repre- 
sent an interesDng target for anti-cancer treatments. 
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D NA hyper methyla tio n is a mechanism for loss of expression of the HLA 
class I genes in human esophageal squamous cell carcinomas. 
Ni*Y> YlSKG, SongJV, ZhaoX, SoC, LiaoJ, WangJLD, Vang^CS. 

NelTJZtip 7 R f Search ' C 0 "^ of Pharmacy, Rutgers-The State University of 
New Jersey, 164 Frehnghuysen Road, Piscataway, NJ 08854-8020, USA. 

The three human leukocyte antigen (HLA) class I antigens, HLA-A, HLA-B and HLA-C 
play .mportant roles in the elimination of transformed cells by cytotoxic T cells Frequent 
loss of expression of these antigens at the cell surface has been observed in many human 

bu ^ ^^ cri P tional ^'ation have been proposed and 

tested bu, die molecular mecnanisms for transcriptional regulation are not clear We show 
by immunoustochenusliy that the HLA class I antigens are absent in 26 of 29 (89%) 
samp es of human esophageal squamous cell carcinomas (ESCC). Eleven of the 26 ESCC 
samples lostmRNA expression for at least one of the HLA genes, as shown by R.T-PGR 
DNA fix>m Hie 29 pans of ESCG and neighboring normal epithelium were examined for 
C P G island hypermethylation, homozygous deletion, microsatellite instability (MSD and 
loss of heterozygosity (LOH). DNA from normal epithelial tissues had no delectable 

of tlxeCpG islands of any of these gene loci: Thirteen of 29 ESCC samples- 

exhibited ^ethyauon of all «Kee Io<i. The HLA-B gene locus was most frequently 
methylated 38%). HLA-B mRNA expression in an ESCC cell line, where HLA-B was 
hypermethylated and did not express mRNA, was activated after treatment with 5-aza-2'- 
deoxycyttdme. Homozygous deletion of these three gene loci was not observed 
SJ^^S^f ^ MSI WCre ° bseiVed for me m^rosatellite markers 
' So of OH and H D6S1666 ' Cl0se to HLA-A, -B and -C loci, although a 

Z^u^k OH u WaS ? se ^ aUn ^y ,oc ^(rep r esentedb y .themarkersD6S105l 
andD6S1560, where the tumor suppressor gene p21(Wafl) resides. A strong correlation 
between genetic alterations and mRNA inactivation was observed in the ESCC samples 
Our resulte indicate that HLA class I gene expression was frequently down-regulated in 
ESCC at both the protem and mRNA levels and that hypermethylation of the promoter 
reg,ons of the HLA-A, -B and -C genes is a major mechamsm of transcriptional 
inactivation. r 

PMID; 11577000 [PubMed - indexed for MEDLINE] 
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Molecular and Immunohisfochemical 
Analysis of HER2/neu Oncogene in Synovial 
Sarcoma 

Ef22f SXJJf" NUCIFORO ' MD. CATERINA PELLEGRINI, PhD 
ROBERTA FASANL MD. MARCO MAGGIONI MD 
GUIDO COGGl MD, AMTONINA PARAFIORm MD 
AND SILVANO BOSARI, MD . ' 



^^0»^/or 0 yer«xj>,e S aoD0fliER2/ B euh.v e b«n 
documented m many type, bfepithellal tumor and mBo0 . bM ^ 

" T 0 ^' » """-comas. Bo, the role of 

-T r 100 " 1 10 *°* * Um0 " V»°*J under- 

stood. Tins the pre^n, study tawtfg^ „p r<=aion .fHn,,, 

DeU ,n ,J P»oa>« «>k s-ynovial sarcoma (SS). In this stud, I IFTR ?/ 

'^ 0m ' tmn * f^iptJoo-polymerasc chain ruction assay; pro. 
it^T^""^ hy «»™^o.l« m ta r y .vringanaai- 

HER2/ DM polyclonal antibody, su ^ j ^eleJrZl speci. 
^f,7 < " "" d '° basal Icds of HER2/neu mRNA. 

HER2/n~ .ranscnpt, we de.ecled in all norma) rj»« s and SSs. 
Four of 1} stomas ( Jl 56) demonstrated HEW/-™ mRNA levels 
above themea,^,**^ i tumors*) displayed HER V ne„ 
pro eu, oppression. Bod, membra™,, and cytoplasmic patterns 
of ^..staining were observed, and , stZ^orrelanC^ 

.Urnr^"^ SaTCOm ? (SS> " ™ W«ive Soft tissue 

tumor that accounts for up to 10% of sarcoma with a 
peak modencc in adolescents and young adults. This 

WHf n , \* < * l0 *« e <**">>' characterized by the i(X; 
1 8) (pH^ll) translation, found in >95% of cases 
Although tracbuonally considered to be a higb^rade 
feren rccen n inves,i S ati °"s have suggested ihSi dif- 
ferent factors tnfluence. prognosis, including morpho- 
og.cal zn6 cytogenettc features, treatment strategies 
the plo.dy status. ar,d the apoptolic index.' ' 
,,,,-h hc ^ cvel °P'" c " t of new therapeutic advancements, 
such as the specie targeting of molecular alterations 
present ,„ human malignancies, has broyght to light .he 
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found between pioiein expression and mRNA le^J (J> a n oil 
Weased HFJ^/neu n^RNA ,eveh We ^gnificnUy 
.K ^ reCWTCnCe5 < P = Moreo^r, none of 

Our dau don pnstralc 0,3, HER2/neu « «pVe«ed In SSc and that 
^^^^^^ >IER2 /»- «P^-n correbte 
leveb of HER2/neu m SSs is associated with a n,or e f.voraMe clinical 
course. Further studies are needed ,o assess the roJe of this oncogene 

doS ^ T aPpBCari0 ° ° f ^^bu-auizedtnon^ 

dona! anbbocfies n, the trea^ent regimes for this mahVnancy 

re.™ tr^fpoc^^^ cbain ^ 

need to identify not only prognose feciore, bm also 
tumor features that are predictive of response to therapy. 
One of the most extensively studied molecular 

£f C W^o/ ^ iS lhe H£R " 2 /" eu protooncogene. 
ine HtR-2/neu oncogene (also kno>vn as c^rbB-2) 
located on chromosome I7q2l, is a member of the 
tyrosine kinase receptor family and encodes for a -J 85. 
Wodahon proLein that shows 50% homology with the 
epidermal growth factor receptor.*- 3 This gene is am- 
phhed,and/or over expressed in"20% to 30% of breast 
caranomas • and in various other tumors, 6 and usually 
is associated with tumor aggressiveness and poor pro K - 
Scveral slu *« have supported the value of 
WtK-2/ncu to predict the response to chemotherapy in 
breast cancer, and the use of recombinant humanized 
anybodies to HER-2/neu protein (Trastuzuxnab) in the 
care of patients ^vith advanced, metastatic breast tumors 
nas been approved. 

The role of HER-2/net. activation in soft tissue 
tumors remains poorly understood, and scarce molec- 
ular data backing immunohistocheinica) studies have 
been reported. HER-2/neu protein expression was im- 
inunohisiochemically studied in 204 sarcomas, includ- 
ing b SSs, and overexpression was absent in all these 
malignant mesenchymal neoplasms. ,p 

Recently, HER-2/neu alter aijons have been dr- 
senbed m osteosarcoma, with a high incidence of pro- 
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TABLE 1. ainfcopathdogic Features of 13 Patients With SS 



No. 


Age(y)/G 


Site 


Subtype 


Surgery 


Chcmo/Radio 


Rcc 


Mets 


HER2 


FU( ro o) 


) 


68/F- 


Kl 


MF 


RR 


No 


Yes 


Yes 


L 


108 


2 


71/F 


KE 


. BF 


V/A 


No 


No 


No 


L 


32 


3 


27/M 


Kl 


. BF 


' ME 


Yes 


Yes 


Yes 


L 


24 


4 


39/M 


T 


MF 


WE. 


Yes 


No 


No 


H 


48 


5 


15/M 


N 


MFJ*D 


WE 


Yes 


No 


No 


H 


48 


6 


41/F 


F 


MF^D 


WA 


Yes 


No 


* No 


H 


48 


. *7 


57/F 


Kl 


MF 


WA * 


No 


N/A 


N/A 


L 


N/A 


8 


48/M . 


. F 


BF,PD 


ME 


Yes 


Yes 


No 


L 


36 


9 


29/F 


T 


PD 


WE 


Yes 


Yes 


Yes 


L 


36 


10 


27/M 


KE 


BF 


WE 


No 


No 


No 


H 


24 


ii 


62/F 


A . 


MF 


WE 


No 


Yes 


No 


L 


24 


12 


62/F 


F 


MF 


WA 


No 


No 


No 


L 


2 


13 


34/F 


L 


MF 


RR 


No 


No . 


No 


L 


2 



Abbreviations: Age, age at diagnosis; G,. gender; M, male; F. female; Site, anatomic location; Kl, knee, intra-articulax; KE, knee, 
extra-articular; T, thigh; N, neck;. F, foot; A, arm; L> leg; Subtype, histological subtype; MF, roonophasic fibrous; BF, biphasic; MF-PB r 
monophasic fibrous with poorly differentiated areas; BF-PD, biphasic with pooi fy differentiated areas; PD. poorly differentiated; Surgery, 
primary surgical therapy; RR, radical en bloc resection; WA, -wide through-bone amputation; ME, marginal en bloc excision; Wfcjwide en bloc 
excision; Chcmo/Radio, adjuvant postoperative chemotherapy and/or radiotherapy; Rcc, recurrence; Mcts, presence of metastasis; HER2, 
HER2 roRNA expression; L, low expression; H, high expression; FU, follow-up status; N/A, not avaible. 



tein expression, ranging from 42% to 61 %. n ' 15 Indeed, 
despjie limited information on Her2/neu in this type 
of malignancy, based mostly on irnmunohistocbernicaJ 
findings, 2 clinical trials of Trastuzumab have been 
initiated for recurrent and metastatic osteosarcoma pa- 
dents (http://ww.cancer.gov/clinical_mals: MSKCG- 
99097/NCI-T9cKK)83 and COG-AOST01 21 ) . 

Therefore, we evaluated the mRNA expression and 
the gene product of HER-2/neu in 13 SS patients using 
realtime reverse transcription-polym erase chain reac- 
tion (RT-PCR) and immuriohistochemistry. To the best 
of our knowledge, this is the first report documenting 
increased levels of HER-2/neu mRNA and protein 
in SS. 



MATERIALS AND METHODS 

Potient Population 

Thirteen patients with primary SSs, obtained from (he 
files of the Department of Pathology, Gaetano Pini Orthope- 
dic Institute, were included in this study. Cases were chosen 
based on the availability of frozen primary rumor. Patient age 
ranged from 15 to 71 years (mean, 44.6 years). Anatomic sites 
included the knee (5 patients, 3 intraarticular), thigh (2 
patients), foot (3 patients), arm (1 patient), neck (1 patieDt), 
zud lower )eg (2 patient). The histologics) subtypes were 3 
biphasic (BF), 6 monophasic fibrous (MF), 2 monophasic 
fibrous with poorly differentiated areas (MF-PD), 1 biphasic 
with poorly differentiated areas (BF-PD), and } poorly differ- 
entiated (PD). Clinical staging was 1IB for all the patients. 
Local surgical excision was .performed in 9 patients; amputa- 
tion, in 4 patients. Recurrence was observed in 5 cases with 
subsequent amputation (3 patients) and local excision (2 
patients). Metastasis occurred to lung (2 patients) and ingui- 
nal lymph nodes (1 patients). Five patients received adjuvant 
postoperative chemotherapy, and 1 patient (case 4) was 
nested with chemotherapy associated with radiotherapy. Fol- 
low-up ranged from 2 to 108 months (mean, 3-1.3 months). 
Clinic opathologic data are summarized in Table 1. 



Pathologic Studies and Frozen Tissue 
Selection 

In all cases, the primary tumor was avaibble'Tor study. 
Surgically resected tumor tissues were partly snap-frozen in 
liquid nitrogen and stored at -80°C for RNA extraction, and 
partly fixed in buffered formalin and embedded in paraffin 
blocks. Hematoxylin and eosin-stained sections were re-eval- 
uated and graded according to FNCLCC grading system. 16 
Frozen tissue blocks were handled as follows: 4-/Mhick frozen 
sections were cut and stained with hematoxylin and eosin to 
determine the percentage of tumor cells present 'in the spec- 
imen. We used tissue blocks with tumor cells comprising 
more than 80% of the specimen. About 1 0 20-u.-thick sections 
were collected into Eppendorf rubes. Another 4-ix-thick fro- 
zen section was cut after the serial sections and examined by 
light microscopy to guarantee the percentage of tumor cells 
collected. Trizol (life Technologies; Gibco BRL, Gaithers- 
burg, MD) was used for RNA extraction, according to the 
manufacturer's protocol. RNA was quantified spectrophoto- 
metrically. . „ 



cDNA Synthesis 

Total RNA (200 ng) was re verse- transcribed in a total 
volume of 50 /iL containing 1 X TaqMan buffer, 5.5 mmol 
MgClj, 1 mmol deoxynucleotides, 2.5 fxmol random hexam- 
ers, 20 U RNase inhibitor, and 62.5 U MuLv reverse transcrip- 
tase. The samples were incubated at 25°C for 10 minutes, 
48^ for 30 minutes and 95°CfoT 5 minutes. 



PCR Amplification 

Amplification reactions were performed with the Univer- 
sal TaqMan 2 X PCR master mix in a volume of 25 /jL con- 
taining 300 nmo! of each primer, 100 nmol of probe, and 5 
uL of cDNA: Both 0-actin and HER2/neu amplification were 
done in duplicate for each sample. 

The thermal cycling conditions included 2 minutes at 
50 C C and 1 0 minutes at 95 °C followed by 40 cycles of 95=C for 
15 seconds and 50°C for I minute. /VI) reagents used for 
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RT-PCR were purchased from Applied Biosystems (Foster 
Gry,CA). 

Primers ond Probes 

Primers and probes for £acrin and HER^/neu mRNA 
were chosen using the computer program Primer Express 
(Applied Biosystems). Sequences of the forward primer for 
HER2/neu mRNA (CenBank accession number X03563) 
were 5'-TCC TGT GTG GAC CTG GAT CAC3' and the 
reverse primer 5'-CCA AAG ACC ACC CCC AAG A-3*; the 
sequence of the TaqMan probe was 5'(FAM)-AGC AGA ATG 
CCA ACC ACC GCA GA-(TAMRA)-3\ Sequences of the for- 
ward primeT for £acrin mRNA (GenBank accession number 
X00351) were 5-TCC TTC CTG GCC ATG GAG-3* and the 
reverse primer 5VAGG ACG AGC AAT CAT CTT GAT CTT- 
3'; the sequence of the TaqMan probe was 5*(FAM)-CCT GTG 
GCA TCC ACG AAA CTA CCT TC(TAMRA)3\ Probes were 
purchased from Applied Biosystems. 

ReoKFime RT-PCR 

To measure HER2/neu expression in these tumors we 
used a Teal-rime quantitative RT-PCR based on TaqMan meth- 
odology, as previously described, 17 with minor modifications. 
Bnefty, this technique allows, by means of fluorescence emis- 
sion; to find the cycling point when PCR product is detectable 
(Ct ifaJue or threshold cycle). As previously reported, the Ct 
value correlates to the starting quantity of the target mRNA. 1 s 
To normalize the amount of total RNA present in each reac- 
tion, we amplified the housekeeping gene ^actin, which is 
assumed to be constant in both normal samples and tumor 
tissues. 

Our results are expressed as relative levels of HER2/neu 
mRNA, referred to a sample, called a 'calibrator," chosen to 
represent IX expression of this gene. The calibrator was a 
breast cancer cellular line <MC^7) ,S that was analyzed on 
every assay plate wiih the unknown samples. All of the ana- 
lyzed rumors expressed n-fold HER2/neu mRNA relative to 
the calibrator. 

The amount of target, normalized to an endogenous 
reference (p-aenn) and relative to the calibrator, was defined 
by die AACt method as described by Livak K (Sequence 
Detector User Bulletin 2; Applied Biosystems). Specifically 
the formula is applied as follows: 

target amount = 2 _AAO ' 

where AA Ct = |[Ct (HER2/neu sample) - Ct (0-actin 
sample)) - [Ct (HER2/neu calibrator) - Ct (0-acrin calibra- 
tor))). 



. 25.0 



Immunohistochemistry 

Formalin-fixed, paraffin-embedded tissue sections were 
deparaffimzed, lehydrated, and exposed to the primary anti- 
body using, the EnVision+ system (Dako, Carpimeria, C\) 
Primary ano-HER-2/neu antibody (rabbit polyclonal anti- 
body, catalog number A0485; Dako) was applied in a dilution 
of 1:2000 for 60 minutes ai room temperarurc. Before expo- 
sure to the primary antibody, sections were mioowave-pre- 
neated in EDTA, P H 8.0. to retrioe antigemciry, and incu- 
bated with endogenous peroxidase-blocking solution for 10 
mmuies at room icmperarure. Positive control, constituted bv 
a breast carcinoma showing more ihan 80% posibve staimng 
for HER2/neu, as well as negative control, in which the 
primary antibody was omit led, were stained in parallel. 




rx)rmal tissue n=6 synoviaJ^rcormsv syrwte'ssxcamaa 
(few expression) (Hgh expression)' . 

FIGURE 1. Distribution of HER2/neu mRNA levels* in normal 
muscle tissues, ond in tow- and high-expression sarcomas. 
Data ore expressed as mean ond standard error of the mean 
for each group. 



All cases were examined for both cytoplasmic and mem- 
brane mimunoreactiviry. Cytoplasmic staining was evaluated 
ori a semiquantitative scale, according to Kilpairick et al with 
minor modifications, 20 and reported as 0 (no staining or 
staining in <10% of cells), 1+ (weak staining in >10% of 
cells), 2+ (moderate staining. in >1Q% of cells), or 3+ 
(strong staining in >J0% of cells). The presence of a mem- 
branous pattern of staining was recorded separately and 
scored as absent.. (no staining or weak staining in <)0% of 
cells) or present (complete and/ or incomplete staining in 
> 10% of cells). Tumors with a cytoplasmic score of 3+ were 
considered to have high HER-2/neu protein expression. 

Stotisticol Anolysis 

Statistical differences were calculated by Fisher's exact 
test. The Mest method was used to evaluate the differences 
between groups. Differences were considered statistically sig- 
nificant when Pvras <0.05. 



RESULTS 

HER2/neu mRNA Evoluotion 

All of the tissues analyzed contained detectable 
)eve)s of mn2~nev mKNA. Szt norma) rissve sampfes 
(skeletal muscle) were used lo establish basal level of 
HER2/neu mRNA. All the normal samples expressed 
very low levels of HER2/neu mKNA, ranging from 0.9 
«o 1.9 n (mean, 1.4 n). Among the 13 tumor samples, 
HER2/neu levels varied greatly, ranging from 2.1 to 
21 n. Selling a cutoff level at 7.9 n (a value that repre- 
sents the mean value of expression distribution of the 
SSs), 9 cases (69%) had Jow HER2/neu expression and 
-1 cases (31 %) had high HER2/neu expression (Fig 1; 
Table 1). The difference between the 2 groups (low 
and high FTER2/neu tumors) was statistically significant 
[P= 0.0004). 
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FIGURE 2. Immunohistochemicol locoltzotion of HER?/neu in SS. (A) Positive control (breast cancer) showing typical strong 
membrane pattern of posrtivity. (B) Case 2. a Diphasic SS. displaying very focal membrane staining limited too gland. Inset: local 
weak cytoplasmic positrvity in the epithelial component ot the same cose. (C) Case 10, the epithelial component of this biphasic 
SS. displaying strong membrane pattern of staining. (D) Cose A. an epithelioid area in o monophaslc fibrous SS. showing strong 
cytoplasmic positively. 



HER2/neu Protein Expression 

Staining with HER2/neu antibody revealed a vari- 
able cytoplasmic and membrane staining pattern. 
Three tumors (23%) showed strong staining involving 
both the cell membrane and the cytoplasm (cases 4, 5, 
and 10); weak to moderate, exclusively cytoplasmic 
staining was observed in 7 cases (cases 1, 3, 6, 7, 8, 9, 
and J 1). No staining was detected in 2 tumors (cases 12 
and ]3). In 1 case (case 2), a cluster of glandular 
structures representing <5% of the mmoi showed 
weak cytoplasmic and very focal membrane staining, 
the latter limited to a single gland. The epithelial/ 
epithelioid components exhibited stronger cytoplasmic 
staining compared with the spindle-cell component of 
the tumors. Membrane staining was predominantly in- 
complete and limited to the epitbelial/epithelioid ar- 
eas. AJI 3 cases with high immunohistochemical expres- 
sion of HER2/ncu (cases 4, 5, and 10) were grade 11J 
sarcomas, including ] ME, ] JiK, and 1 MFPD SSs. 



Examples brHER2/heu cyibplasmic membrane 
staining are depicted in Figure 2. 

Correlation of Moleculor ond 
Immunohistochemical Results 

A strong, statistically significant association was 
present between protein expression, for both mem- 
brane and cytoplasmic staining, and HER2/neu mRNA 
levels {P ~ 0.03), although ] case (case 6) displayed 
discordant results. Interestingly, this neoplasm showed 
high >]ER2/neu mRNA levels, whereas only weak stain- 
ing, limited to the cytoplasm of a minority of rumor 
cells, was detected by immunohistochemical analysis. 

HER2/neu Expression ond Clinicopolhoiogic 
Porometers 

Both HER2/neu protein expression and mRNA 
levels were evaluated to establish the relationships to 
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TABLE' 2. Correlation Behyeen Cltnicopc3thobgic 
Features and HER2/neu Expression as Delected by 
IHC and RT-PCR 



HER2/n 



IHC 



PCR 



Variable 


JL 


H 


lvalue 

i 


L 


H 


lvalue 


Age t years J 














<40 


3 


3 




3 


3 




>40 


7 


0 


. NS 


6 


1 


. NS 


Sex 












FcroaJe 


8 


0 




7 


1 




Male 


2 


3 


0.03 


2 


3 


NS 


Tumor size (cm) 












<5 


3 


2 




3 


2 




>5 


7 


1 


NS 


6 


2 


NS 


Histological grade 












11 


3 


0 




3 


0 




111 


7 


3 


NS 


6 


4 


NS . 


Histological type 












MF 


5 


1 




5 


1 




BF 


2 


1 




2 


] 




PD • . . 


3 


1 


NS 


2 


2 


NS 


Cb«o o/Radiotherapy* 












Yes 




2 




3 


3 




No 


5 


1 


NS 


5 


1 


NS 


RccuiTencef 










Yes 


5 


0 




5 


0 




No 


2 


3 


NS 


1 


4 


0.02 


Metastasis! 










Yes 


3 


0 




3 


0 




No 


4 


3 


NS 


3 


4 


NS 



. Abbreviations: L, low expression; H, high expression; NS, not 
signiScant; MF, monophasic fibrous; BF, biphasic; PD, poorh/ differ- 
entiated (including MF and BF with poorly differentiated areas). 
•Information not available for case 7. 
1 Cases 7, 12, and 33 were excluded from the analysis. 

ciinicopathologic features, including local recurrence 
and metastatic disease. Two cases (cases ] 2 and ] 3) with 
follow-up Jess than 12 months and 1 case (case 7) for 
which clinical information was not available were ex- 
cluded from the analysis of recurrences and metastatic 
behavior. 

No correlation was observed between HER2/neu 
mRNA expression and age, sex, tumor size, tumor 
grade, histotype, and metastasis. A correlation between 
sex of the patients and HER2/neu protein expression 
was found. In fact, none or the female patients showed 
high HER2/nen protein expression \P = 0.03). Pa- 
. tients with high Her2/neu mRNA levels had a lower 
risk of recurrence- than those with low Hci2/ncu 
mRNA levels {P = 0.02). None of the cases with high 
HER2/neu mRNA levels developed metastatic foci, al- 
though the small number of observations precluded 
reaching statistical significance (P = 0.1). Results are 
detailed in Table 2. 

DISCUSSION 

The present work provides the first combined mo- 
lecular by real-time RT-PCR and immunohistochemical 
evidence thai HER2/neu over expression occurs in SSs 



Our results indicate that this parameter may provide 
prognostic information and suggest that a specific ther- 
apy with humanized monoclonal antibodies against 
HER2/neu may be considered in a significant number 
of SSs. 

The HER2/neu oncogene has teen ; extensively 
investigated as.a prognostic factor arid more recently as. 
a predictor of response to therapy. It has been demon- 
strated in breast cancer, where HER2/neu overexpres- - 
sion is usually associated with gene amplification, 2 *and 
in other epithelial tumors, including ovarian, gastric, 
lung, and urinary bladder cardnomas. 

HER-2/neu amplification/overexpfession appears 
to be an early event in oncogenic transformation by 
interacting with other members of the HER family. 5 In 
breast cancer, it is involved in cell cycle and apoptotic 
pathways through the antiapoptotic effects mediated by 
p53 and p2I deregulation/ 2 ' 2 * 

Whether HER2/neu overexpression plays an im- 
portant role in mesenchymal neoplasms remains con- 
troversial. An immunohistochernical study of sarcomas; 
using a monoclonal antibody, reported no evidence of 
immunoreactivity for HER-2/neu in 6 SSs as well as in 
other 197 mesenchymal tumors, with cytoplasmic reac- 
tivity observed only in 1 case of peripheral neuroepi- 
ihelioma. A recent investigation reported gene ex- 
pression profiles of 41 soft tissue tumors with cDNA 
microarray analysis. Among these sarcomas, 6 
monophasic SSs were characterized by a unique expres- 
sion pattern of a cluster of 104 genes, induding the 
epidermal growth factor receptor, which shows 50% 
homology with the HER2/ neu gene. 24 These data also 
suggest that the erb-B receptor family plays a significant 
role in SS. It has been : demonstrated that a variable 
number of osteosarcomas overexpress HER2/neu. 1M5 
However, more recent studies 20 - 25 - 26 were unable to 
detect any HER2/neu gene amplification and/or over- 
expression using fluorescence in situ hybridization 
(FISH), RT-PCR, and immun ©histochemistry. 

Differences in the techniques used may play an 
important role and explain (at least in part) these 
discrepancies. HER2/neu alterations can be evaluated 
using different techniques irrtluding immunohisto 
chemistry, FISH, Southern hybridization, Northern 
blot, and competitive, differential, or real-time PCR. 27 
I mmun ©histochemistry is the most common method 
for detection of HER2/neu overexpression, but it is 
significantly affected by the sensitivity and specificity of 
the antibodies used, ihe type of tissue (frozen versus 
formalin-fixed), and the various interpretative criteria 
and scoring systems used to evaluate cases. Indeed, 
most studies of HER2/neu expression in osteosarcoma 
used immunohistochemical techniques, with different 
monoclonal or polyclonal antibodies. The discrepancy 
in results may stern from the use of different antibodies, 
as well as a lack of standardized evaluation. 

For these reasons, to evaluate HER2/neu imrnu- 
noreactiviry in our study, we used a polyclonal antibody 
(Dako, Cai pimeria, CA), arguably the most diffuse and 
thoroughly tested antibody for H£R2/neu assessment. 
Furthermore, we investigated HER2/neu mRNA ex- 
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pression with real-time RT-PCR, because it has been 
demonstrated that mRNA levels correlates tightly with 
protein expression. 28 At present, real-time RT-PCR 
probably represents the most powerful tool for quanti- 
tative analysis, because it allows better internal control 
and reduction of sample contamination, and provides 
more objective results. 18 

We analyzed HER2/neu gene expression at the 
mRNA and protein level in 13 cases of SS. HER2/neu 
expression was found in aJI of the cases investigated, 
and mRNA content in the tumors varied from 2.1 to 
24 n. . 

The variability of mRNA levels in SSs is reflected 
on heterogeneity of protein expression pattern as de- 
tected by irnxnunohistochemistry. We found that 
HER2/neu imrnunoreactivity correlates strongly with 
mRNA levels. A convincing cytoplasmic imrnunoreac- 
tivity was documented with the polyclonal antibody in 
10 of 13 sarcomas. Distinct membranous staining was 
observed in 3 cases, although it was never comparable 
to the positive breast control. ]t was predominantly 
incomplete and identified in the epithelial/ epithelioid . 
component of SS. Interestingly, all of the cases with 
. strong cytoplasmic staining also exhibited a membrane- 
staining pattern. 

In breast cancers, a membranous pattern of stain- 
ing is thought to be specific for HER2/neu protein 
expression and correlated with gene amplification, 
whereas cytoplasmic staining is usually considered non- 
specific. 29 However, cytoplasmic positivity for HER2/. 
neu has been reported to be prognostkally significant 
in other tumor types, including bladder, colon, pan- 
creas, thyroid, and nasopharyngeal carcinomas, and 
even in breast cancer. 3 

Patients with high HER2/neu mRNA expression 
had a significantly lower risk of recurrence. Similarly, 
all of the cases with high HER2/neu expression did not 
metastasize, although this correlation did not reach 
statistical significance due to the small number of cases 
studied. These data suggest that HER2/neu plays a roJe 
in the biology of SS and that HER2/neu over expression 
may be linked to a less-aggressive clinical behavior. 
Indeed, unlike many cancers where HER2/neu overex- 
pression has been shown to correlate with poor prog- 
nosis, HER2/neu levels are linked to a more favorable 
clinical course in other malignant rumors, such as thy- 
roid carcinoma and osteosarcoma. 1534 

The molecular mechanisms responsible for the 
action of HER-2/neu in SSs are unknown. A possible 
interaction between H£R-2/neu and the other mem- 
bers of HER family could be important in tumorigen- 
esis. Derangements of other oncogenes, tumor suppres- 
sors, and apoptosis regulators have been described in 
SSs. For instance, many SSs have been shown to be 
diffusely positive for be 1-2 family proteins (be 1-2, bax, 
bcl-x, and bak). These members of the bcl-2 family are 
involved in the regulation of apoptosis in SS. 36 This 
raises the hypothesis that complex aJteiations in apop- 
tosis-controlling mechanisms are present in ihese neo- 
plasms, with HER-2/neu interacting with Bel- 2 family 



members. Further studies, are needed to clarify the 
mechanisms of apoptosis in SS. 

Depending on the size and location, the therapy of 
choice for SS is radical local excision or amputation.. 
Whenever radical surgery cannot be performed, radio- 
therapy in concert with local exdsion is suggested in an 
attempt to avoid the need for amputation. Only re- 
cently has a study evaluated the possible role of chemo- 
therapy in the treatment of SS. 

The role of molecular markers in predicting treat- 
ment responsiveness is currently the focus of extensive 
investigation. Breast cancer patients with high HER2/ 
neu expression appeared to benefit from high-dose 
CAF (cyclophophamide, adriamycin, and 5-fluoro ura- 
cil) therapy. 38 In our study, 3 of 4 patients with high 
HER2/neu expression received adjuvant chemother- 
apy with ifosfamide; these patients had a favorable clin- 
ical outcome. These data raise the possibility that 
HER2/heu may have value in predicting. which patients 
are likely to respond to specific adjuvant chemotherapy 
regimens. Whether the favorable significance of 
HER2/neu expression depends on predicting clinical 
recurrence, response to chemotherapy, or both re- 
mains to be fully elucidated in SS patients. ■ 

To the best of our knowledge, this is the first 
report that shows expression of HER2/neu in primary 
SS by real-rime RT-PCR Elevated levels of HER2/neu 
hiRNA and protein are found in a significant group of 
SS pa den is, and these levels appear to correlate with 
features of good prognosis. Furthermore, our results 
suggest that this mechanism of disease in SS may be the 
target of specific inhibitory therapies based on human- 
ized monoclonal antibodies. Considering the small 
number of patients examined,, further investigation is 
needed to confirm these preliminary findings. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoft:(:§, Thomas ThykjaerTI, Frederic M. Waldman||, Hans Wolf**, 
and Julio E. Celisift: 



Gain and loss of chromosomal material is characteristic 
ot bladder cancer, as well as malignant transformation in 
general. The consequences ot these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis. The results showed 
that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef-. 
feet depended (p < 0.015) on the magnitude ot the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels. Be-, 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1 t 
emsl, and N-myc (3t-5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs; the fol- 
lowing alterations have been reported: 2q-, 11 p-, 1q + , 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome- wide tech- 
nology for detecting genomic gains and losses (CGH) with, 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non- invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder lumor biopsies were sampled after informed 
consent was obtained and after removal ol tissue lor routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary); 

1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC. transitional cell carcinoma; LOH, toss ol heterozygosity; 
PA-FABP, psoriasis- associated fatty acid-binding protein; 2D, 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. B„ expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart lumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length of the chromosome 
(ten*). The bold curve in the ratio profile represents a mean of four chromosomes and is sunounded by thin curves indicating one standard 
deviation. The central vertical line {broken) indicates a ratio value of 1 (no change), and the vertical lines next to it (dotted) indicate a ratio of 
. 0.5 {/en*) and 2.0 {fight). In chromosomes where the noninvasive tumor 335 used tor comparison showed alterations in DNA content, the ratio 
prpfile of thai chromosome is shown to the right of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDL.DK/sdata.htmi). The bars Indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; > 2- fold 
increase [black), >2-fold decrease (blue), no significant change [orange). The bar to the far right, entitled Expression shows the resulting change . 
tn expression along the chromosome; the colors indicate that at least halt of the genes were up- regulated (b/ac/c), at least half of I he genes 
down-regulated (blue), or more than half of the genes are unchanged [orange). It a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2- fold change. A 2- fold level was chosen as this corresponded to one standard deviation in a double 
determination of —1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium- guanidinium thiocyanate 
solution and stored at -80*C. Total RNA was isolated using the 
RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A)"* RNA was isolated by an oligo(dT) selection step (Oligotex 
. mRNA kit; Oiagen). 

cRNA Preparation— 1 **g of mRNA was used as starting material. 
The .first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled c RNA was prepared using the ME- 
GAscrip® in vitro transcription kit (Ambion). Biot in- labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Oiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was modified from a previous method (13). 10 ^g of cRNA was 
fragmented at 94 °C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI, 
10 mM Tris, pH 7.6, 0.005% Triton), was healed to 95 °C lor 5 min, 
subsequently cooled to 40 °C, and loaded onto the Aflymelrix' probe 
array cartridge. The probe array was then incubated for 16 h at 40 C C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 C C followed by 4 washes in 0.5 x SSPE-T 
at 50 *C. The biot iny la ted cRNA was stained with a streptavidin- 
phycoerythrin conjugate, 10 /tg/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1 — continued 



for 30 min at 25 °C followed by 1 0 washes in 6.x SSPE-T al 25 °C. The 
probe arrays were scanned al 560 nm using a conlocal laser scanning 
microscope (made for Atfymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Asymetrix gene 
expression analysis software. 

Microsatellite Analysis— Microsatellite Analysis was performed as 
described previously (14). Mrcrosatelliles were selected by use of 
www.ncbLnlm.nih.gov/genemap98/ and primer sequences were ob- 
tained fr6m the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood arid amplified by PCR in a volume of 20 lor 35 
cycles. The ampficons were denatured and eteclrophoresed for 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared wilh blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 *C until use. The procedure lor 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brillianl Blue. Pro- 
teins were identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master two-dimensional gel 
image ot human keratinocyte proteins; see biobase.dk/cgi-bin/cefis. 

CGH— Hybridization ot differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein- labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 pg) were 
denatured at 37 "C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C lor 2 days. Afler washing, 
the slides were count erst ained with 0.15 jig/ml 4,6-diamidino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed tor all tumor samples using fluorescein- labeled reference DNA 
and Texas Red-labeled lumor DNA (inverse labeling) to confirm the 
aberrations delected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ralios, indicating regions of DNA gains and losses. The average 
greemred fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the greemred fluorescence intensity ratio for the 
entire metaphase and background correction; Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindote band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
rinatic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization —The CGH analysis 
identified a number of chromosomal gains and losses in the 
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Table 1 

Correlation between alterations detected by CGH and by expression monitoring 

what expression ratio was found); bottom, altered expression used as 



Top, CGH used as independent variable (il CGH alteration 





Tumor 733 vs. 335 


Concordance 


CGH alterations 


Tumor 827 vs. 532 


Concordance 


CGH alterations 


Expression change clusters 


Expression change clusters 


13 Gain 
10 Loss 


10 Up-regu)ation 

0 Down- regulation 

3 No change 

1 Up-regulation 

5 Down- regulation 

4 No change 


77% 
50% 


10 Gain 
12 Loss 


8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 


80% 
• 17% 


Tumor 733 vs. 335 

"Expression change clusters 

K CGH alterations 


Concordance 


Tumor 827 vs. 532 
Expression change clusters < 
• CGH alterations 


Concordance 


16 Up-regulation 
21 Down-regulation 
15 No change 


11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

. 12 No change 
3 Gain 
3 Loss 

9 No change 


69% 
• 38% 
60% 


17 Up-regulation 
9 Down-regulation 
21 No change 


10 Gain 

5 Loss , 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17 No change 


59% 
33% 
81% 



two invasive tumors (stage pT1, TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+ ; 9q~, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24+, 2q14.1-qter-, 3q12~q13.3-, 6q12-q22-, 
9q34-f , 11q12-q13+, 17 + , and 20q11.2-q12+) that are typ- 
ical (or their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4- fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubting of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. ~\A) and 
20q12in TCC 827 (Fig. 18). 

mRNA Expression in Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
fTCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to. 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non-invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fbld were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, top). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2- fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no atter- 
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F»G 2 Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown .as a numerical -told change in ratio between invasive tumors 827 (A) and 733 ( ♦ ) and - their noninvasive 
counterparts 532 and 335. The expression change was taken from the Expression line to the right in Fig. 1, which depicts the resulting 
expression change for a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down- regulated 
to be scored as an expression change. AD chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. . 



ation in expression. No alteration was delected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2). For both tumors TCC 733 (p.< 0.015j,and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2). Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut -oft point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
. much larger extent. 

Microsatellite-based Detection of Minor Areas of Loss : 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32, 2p21, and 7q21*and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Rg. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript ihcr ease/decrease/in- . 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1/1). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of -the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomai material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2 t 12q21,1, and 16q24 
and in TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18ql2 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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F(G. 3. Microsatellite analysis of loss of heterozygosity. Tumor 
733 showing loss ol heterozygosity a! chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), (6) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close . 
to general /J-spectrin (gene number 1 1 on Fig. t) and of (d) lomor 827 
showing loss of heterozygosity at chromosome 18q12 by S18Stt18 
close to mitochondrial 3-oxoacyf-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the eleclropherogram obtained 
. from normal DNA Irom leukocytes (A/), and the tower curves show the 
eleclropherogram Irom tumor DNA (7). In all cases one allele is 
partially tost in the tumor amplicon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH {Fig. 3), suggesting that 
transcriptional down- regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels — 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all lour tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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FtG- A.. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated horizontal 
axis). The mRNA ratio. as determined by oligonucleotide arrays was 
plotted for each gene {vertical axis). A, mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference (fop of figure). Two different 
scalings were used lo exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
" between the groups. Proteins shown were as lollows: Group A (from 
left), phosphoglucomutase 1 , glutathione transferase class ft number 
4, fatty acid-binding protein homologue,. cytokeratin 15, and cyto- 
keratin 13; S (from left), fatly acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeralin 13, and calcyclin; C (from left), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, t4-3-3-«, and 
pre-mRNA splicing factor; D, mesothelial keratin K7 (type II); B (Irom 
fop), glutathione S-transferase-n and mesothelial keratin K7.(type II); 
F (from fop and left), adenytyl cyclase- associated protein, E-cadherin, 
keratin 19, calgi2zarin, phosphogfycerate mutase, annexin IV, cy- 
toskelelal 7-actih, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type ctathrin light chain- a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucteoprolein A/B, 
Iranslationally controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase 8-1 subunit; 6, (Irom fop and /eft). TCP20, calgizzarin, 70- 
kDa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 19, triosephosphale isomerase, hnRNP F, liver glyc- 
eraldehyde-3- phosphatase dehydrogenase, glutathione S-transfer- 
ase-tf, and keratin 8; H (Irom left), plasma gelsolin, autoantigen cal- 
reticulin. thioredoxin, and NAD+ -dependent 15 hydroxyproslaglandin 
dehydrogenase; / (from top), prolyl 4-hydroxylase 0-subunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-blphos- 
phatase; J annexin II; K, annexin IV; JL (Irom fop and left), 90-kDa heat 
shock protein, prolyl 4-hydroxylase 8-subunil, o-enolase. GRP 78, 
cyclophilin, and cofrlin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures")- In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except lor a group of cyto- 
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keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal' 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733;. see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence. as to the effect of these gains 
and losses on gene, expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-ofl point 
for detection of expression alterations was a 2-fold change, 
thus being a! the border of detection. In several cases, how- 

Table ll 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration* 


Protein alteration 


Annexin II 


1q21 


733 


Gain 


Abs !o Pres" 


, Increase 


Annexin IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3. 8- Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6- Fold up 


* Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 


FBP1 


. 9q22 


827 


Gain 


2.3- Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heal shock protein 28 


15qt2-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibit in 


17q21 


827/733 


Gain 


3.7-/2.5-Fold up* 


Increase 


Prolyl- A -hydroxy I 


I7q25 


827/733 


Gain 


5.7-/1 .6 : Fold up 


Increase 


hnRNPBI . 


7p15 


827 


Loss 


2.5-Fbld down 


Decrease 



° Abs, absent; Pres, present. 

b In cases where the corresponding alterations were tound in both TCCs 827 and 733 these are shown as 827/733. 




F»G. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper pari ol the figure shows a 2D gel 
(te/f) and the oligonucleotide array {fight) ol TCC 532. The red rectan- 
gles on the upper gel highlight the areas. that are compared below. 
Identical areas of 2D gels ol TCCs 532 and 827 are shown below." 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827. (red anno/at/on). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array [red arrow) from TCC 532 
and is compared with TCC 827. The upper row ol squares in each tile 
corresponds to perfect match probes; the lower row corresponds lo 
mismatch probes containing a mutation (used for correction lor un- 
speciFic binding). Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC. 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure (left) show levels ol PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively- Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were delected in TCC 733 (166 units). IEF. isoelectric focusing. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectivery. Some of these, transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors (e.^ in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in Ihis area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylation in tumor cells (17-19). Thus, it may be possible 
. that in chromosomes with increased DNA copy numbers two 
or more alleles could be.demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-;9q-, 1q+,Y- 
(2, 6). and in pTl tumors, 2q-,11p~, 11q~, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectivery. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique/ Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis ol these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss ol heterozygosity in several cases. II seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether- 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal cells and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translational 
regulation, post -translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertrahslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells. as deter- 
mined by arrays and 2D- PAGE (25), and a moderate correla- 
tion was recently reported by Ideker et a/. (26) in yeast. 

Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of. chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain ol 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss ol chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
. quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger-number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high, reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived. from genes with 
known locations, and 2D gel analysis, to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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Expression of bcr-abl mRNA ia individual chronic myelogenous leukaemia 
cells as determined by in situ amplification. 

Pachmanu K, Zhao S , SchenkT . Kantarjian H . El-Naggar AfC Siciiiano MX Guo 
JO, Arlinghaus RB . Andreeff M . 

The University of Texas M.D. Anderson Cancer Center, Department of Molecular. 
Haematology and Therapy, 1515 Holcombe Boulevard, Houston, TX 77030, USA. 

We present the results of a novel method developed for evaluation of in situ 
amplification, a molecular genetic method at the cellular level. Reverse transcription 
polymerase chain reaction (RT-PCR) was used to study bcr-abl transcript levels in 
individual cells from patients with chronic myelogenous leukaemia (CML). After 
hybridizing a fluorochrome-labelled probe to the cell-bound RT-PCR product, bcr-abl 
mRNA-positive cells were determined using image analysis. A dilution series of bcr-abl- 
positive BV173 into normal cells showed a good correlation between expected and actual 
values. In 25 CML samples, the percentage of in situ PCR-positive cells showed an 
excellent correlation with cytogenetic results (r = 0.94, P < 0.000 1 ), interphase 
fluorescence in situ hybridization (FISH) (r = 0.95, P = 0.001) and hypermetaphase FISH 
(r- 0.81, P < 0.001). The fluorescence intensity was higher in residual CML cells after 
interferon (IFN) treatment than in newly diagnosed patients (P = 0.004), and was highest 
in late-stage CML resistant to IFN therapy and lowest in CML blast crisis (P - 0.001). 
Mean fluorescence values correlated with bcr-abl protein levels, as determined by 
Western blot analysis (r = 0.62). Laser scanning cytometry allowing automated analysis 
of large numbers of cells confirmed the results. Thus, fluorescence in situ PGR provides a 
novel and quantitative approach for monitoring tumour load and bcr-abl transcript levels 
. in CML. 
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Correlative Immunohistochemical and Reverse 
transcriptase Polymerase Chain Reaction Analysis of 
Somatostatin Receptor Type 2 in Neuroendocrine . 
Tumors of the Lung 



— Mam u P apottirMJ^rS^br^ 

Aola Funaro, Pb.D:, Carta Pecchioni, Marcus Schindler, mjx,- arid 
7 Gianni Bussolati, M.D., F.R.C.Path. 



" Somatostatin receptors type 2 (sst2) have been frequently de- 
" lected in neuroendocrine tumors and bind somatostatin ana- 

- logues, such as octreotide, with high affinity. Receptor autora- 
diography, specific mRNA detection and. more recendy, anri- 

: ssQ polyclonal antibodies are currently, employed to reveal 
• ■ sst2rThe aim of the present study was to investigate by three 
' ■ diflejeni techniques the presence of sst2 in a series of 26 neu- 
■ roendocrine rumors of the lung in which fiesh frozen tissue and 
paraffin sections were available. It was possible, therefore, to 
compare, in individual cases. RNA analysis studied by reverse 

- transcriptase polymerase chain reaction (RT-PCR), in situ hy- 
' bridization (ISH), and jjiununohistochemistry. A scries of 20 

nonneurbendocrine lung carcinoma samples served as controls. 
" RT-PCR was positive for sst2 in 22 of 26 samples, including 15 
of. 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 
small-cell carcinomas. Tbe sst2 mRNA signal obtained by RT- 
PCR was strong in the majority (87%) of typical carcinoids and 
of variable intensity in atypical carcinoids and small-cell car- 
cinomas. A weakly positive signal was observed in 5 of 20 
control samples. In irnmunohistocheinistry, two different arjti- 
bodies (anti-sst2) were employed, including a monoclonal an- 
tibody, generated in the Department of. Pathology, University 
of Turin. In the majority of samples a good correlation between 
. sst2 mRNA (as detected by RT-PCR) and sst2 pjotein expres- 
sion (as detected by immunohisiochemisDy) was observed. 
However*, one atypical carcinoid and one small-cell carcinoma 
had focal immunostaining but no RT-PCR signal. 1SH jxi- 
formed in selected samples paralleled the results obtained with 
the oOiei techniques. A low ssl2 expression was associated with 
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high grade neuroendocrine tumors and with aggressive behav- 
ior. It is concluded that 1) neuroendocrine rumors of the lung 
express ssi2 > and there is a correlation between the mRNA 
amount and the degree of differentiation; 2) imrounohistochem- 
istry and 1SH are reliable tools to demonstrate" sst2 in these 
tumors; and 3) sst2 identification in tissue sections may provide 
information on the diagnostic or therapeutic usefulness of so- 
matostatin analogues in individual patients with neuroendO; 
crine rumors. 

Key Words: Neuroendocrine — Lung — Tumors — Somato- 
statin receptors — Immunohistcxhemistryv— Small cell carci- 
noma — Reverse transcriptase polymerase chain reaction. 

Diagn Mol Pathol 9(1): 47-57, 2000. 



The somatostatin receptor family '(sst) includes at least 
five isoforms that have been recently identified and char- 
acterized (18,32,41). The ssts are widely distributed in 
normal human tissues and in human tumors. Sst type 2 is 
more commonly detected in neuroendocrine rumors 
(32,37) and binds the somatostatin analogue octreotide 
with high affinity! 

Sst localization had originally been demonstrated by 
means of binding assays of radiolabeled somatostatin 
analogues (20,25,31). Subsequently, specific sst messen- 
ger KNA (mRKA) detection was obtained by means of in 
siru hybridization (1SH) and reverse transcriptase poly- 
merase chain reaction (RT-PCR) (14,32,37). Recently, 
polyclonal antibodies specific for different isoforms of 
sst were produced and used in irjimunohistocbemistry 
(10,12,15,1830,35,36). Giventhe well-known heteroge- 
neity of neoplastic. populations, in situ methods (uxunu- 
nohistocbeiiiistry and JSH) allow a more definite map- 
ping of Ore distribution of ihe receptor in such tissues. 
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This is potentially useful for predicting the responsive- 
ness of a given neoplastic cell population to medical 
treatment with somatostatin analogues, which are used in 
the clinical setting for both diagnostic and therapeutic pur- 
poses with special reference to neuroendocrine tumors. 

The spectrum of neuroendocrine tumors of the lung 
includes well-differentiated neoplasms (so-called typical 
carcinoids) and poorly differentiated small-cell carcino- 
mas (SCCs). Intermediate forms sharing features of both 
the aforementioned types also belong to this spectrum 
. (so-called atypical carcinoids or well-differentiated neu- 
roendocrine carcinomas). Finally, large-cell neuroendo- 
- — Gririe-caTGinoma-ha£-been4dentifjed and includedin-this— 
tumor group (4,40). The tissue distribution of sst2 in 
neuroendocrine tumors of the lung has not been thor- 
oughly characterized, although individual samples of 
: bronchial carcinoids were found to express sst (30). 
SCCs (but riot non- small-cell types) were also shown to 
be sst2 positive by receptor binding assay (33). More- 
over, sst2 has been detected in in vitro cell cultures of 
. human SSC of the lung (39,42). No study on a series of 
neuroendocrine tumors of the lung including all neuro- 
endocrine lung tumor types has been reported to date. 

The aim of this study was therefore to investigate the 
presence of ss l2 mRNA and protein in a series of 26 
neuroendocrine tumors of the lung, employing different 
technical approaches, such as RT-PCR, JSH, and immu- 
nobistochemistry. To this purpose a monoclonal anti- 
v body to sst2 (N-terminal) was generated in the Depart- 
ment of Pathology, University of Turin. The results were 
then compared and related to the rumoT grade and to 
. other clinicopathologic parameters. 

MATERIALS AND METHODS 

Case Series and RNA Extraction 

Twenty-six samples of neuroendocrine rumors of the 
lung, in which fresh frozen tissue was available, were 
' retrieved from the surgical pathology fjJe of the Univer- 
sity of Turin, Italy. All samples were reviewed applying 
currently accepted criteria of classification (4,40), and 
the neuroendocrine nature was confirmed by positive im- 
munostaining for chromogranin A (CgA) (with or with- 
out antigen retrieval) or synapiophysin, and by positive 
RT-PCR for CgA mRNA. According to the classi- 
fications described here, these included J 5 well- 
differentiated neuroendocrine tumors (typical carci- 
noids), 6 well-differentiated neuroendocrine carcinomas 
(atypical carcinoids), and 5 SCCs. 

A series of 20 non- small- cell lung carcinomas (30 
squamous, 9 adenocarcinomas, and 1 lajge-cel) anaplas- 
tic) lacking neuroendocrine differ cniiaiion, as demonstrated 
by ne<? alive irnmunohistochemistry and RT-PCR for CgA 
(1), served as a control group. Clinicopathologic data and 
follow- up informaiioD were obtained for all pauents. 



For hybridization ana]ysis, total RNA was extracted 
using the guanidine thiocyanate-cesiura chloride method 
(5). The concentration of RNA was estimated by spec- 
trophotometry, and RNA degradation was assessed by 
agarose gel electrophoresis, as previously reported (31). 

Reverse Transcriptase Polymerase Chain Reaction 
for ssL2 and Chromogranin A 

Total RNA (2 p,g) was first digested, with 10 units of 
RNase-free DNase (Boehringer, Mannheim, Germany) 
in a 10-mJL solution containing 20 mmol/L MgCl 2> to 
. avoid DNA contamination. The solution was kept at 
room temperamre fortO miDutesrihen healed for 5 min- 
utes at 70 D C to inactivate the DNase molecules; 40 
pmol/L of oligodeoxythymidine primers (oligo-dT16) 
were added and the solution was heated again at 70°C for 
10 minutes, then chilled on ice to allow the primer hy- 
bridization. The resulting solution was reverse tran- 
scribed using 1 00 units of reverse transcriptase (Gibco 
BRL, Gaiiersburg, MD). Complementary DNA (cDNA) 
was generated in a 50-jjlL final reaction volume contain- 
ing 50 mmol/L Tris-HCl pH 8.3, 75 mrool/L KCl 3 
mmol/L MgCl 2 , JO mmol/L dithiothreitol, 1 rrtmol/L de- 
oxynucfeodde triphosphates (dNTPs), and 20 units of 
RNasin (Proroega, Madison, W3). The solution was 
heated at 37 °C for 90 minutes. Finally, the enzymes were 
inactivated by hearing to 70°C for 10 minutes. 

The efficiency of the reverse transcription was deter- 
mined by performing a PGR reaction having the 0 2 - 
microglobulin "housekeeping gene" as a target. PCR was 
carried out in a 10-p.L final reaction volume containing 
1 pi, of cDNA template, 10 pmol of sense and antisense 
oligonucleotide primers, 67 mmol/L Tris-HQ pH 8.8, 16 
mmol/L (NH 4 )2S0 4 , 0.03 % polysorbate 20, 2 mmol/L 
dNTPs, 1 mmol/L MgCl 2 , and.Q.5 units of Taq polymer- 
ase. 0 2 - Microglobulin, sst2, and CgA PCR reactions were 
performed using the same protocol at the following PCR 
conditions: 35 cycles, each cycle consisting of denaruration 
at 94°C for 2 minutes, annealing at 55 D C for 1 minute for 
(V microglobulin, at 61 D C for sst2, and at 68°C for CgA; . 
extension was performed at 72°Cfor 1 minute. The primers 
used for RT-PCR (9,1 3 ,2337) are reported in Table 1 . 

The amplified fragments were run in a 1% agarose gel, 
containing .ethidium bromide. Strict precautions against 
contamination weje undertaken (19) and negative con- 
trols (a no- template control and a no^-reverse transcriptase 
control and distilled water to replace the RNA) -were in- 
cluded. The RNA extracted from an H716 neuioendocrine 
colon carcinoma cell line, and from a neuroblastoma (37) 
served as positive controls for CgA and sst2, respectively. 



Antibodies 

Two different antibodies specific for ssi2 were em- 
ployed: The first one was a monoclonal antibody raised 
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TABLE 1. Sequences of primers used lor reverse 


transcriptase polymerase chain reaction 




Size of PCR 
product (bp) 


Position 


Study 


1IITR2 anfcense: 5' GCA AAG , ACA XAT GAT GGT 6A 3" 
« rnA sense- 5' GCT CCA AGA CCT CGC TCT CC 3 
6 ST. !!?t!hL.^RAC CGA CTC 7C6 CCT TTC CG 3' 


120 
2B4 
583 


286-305 
389-408 
402-421 
665-684 
316-335 
878-807 


Gussow et aL (B) 
Sestin!etal.(37) 
Helman et a). (11) 





V ■ - pCBi polymerase chain reaction. 

^in -me Department of Pathology (University of Turin) 

^fir for an N-tenmp^e fl u^rjC^of ■ftejsgisharsd- 
>7ty both A and B receptor isoforrns). The octapeptide 
;; EPYYDLTS corresponding to amino acids 35 to 42 of 
r the human receptor (and differing by one amino acid 
; *rroirj the mouse sequence), was synthesized, having a 
; ' ^<dn added to tbe N-terrriinaL This sequence was similar 
: totbat used by other groups to produce polyclonal ant> 
bodies (17,18,27). This .sequence was rather short but 
made it possible to avoid extensive homology with sstl. 

• In addition, according to a genbank search using FASTA 

- (28) this protein sequence is unique to human sst2 and 
."-•has a partial homology only with rat and human nuclear 

V receptor retinoid orphan nuclear- receptor-beta (a protein 
: - bavin- nuclear localization). Three Balb/c mice were im- 

- munizwl with the peptide conjugated to keyhole limpets 
Vhemocyanin (KLH) (Sigma, St. U)uis, MO) following 

the standard procedure. After the first intrasplemc rajec- 
tion (1.00 ixg of protein) at time 0. the mice were intra- 
peritoneally injected six times with the pepbde-KLH 

• conjugate (150 ixg) in the presence of Freund adjuvant. 
The inactivity of the sera from each animal was evalu- 
ated using an enzyme-Jinked immunosorbent assay, us- 

■ - ing ihe peptide coated onto the plasric. The bybridomas 
/. were produced by somatic fusion of immunized spleno- 

cytes with the mouse myeloma cell line Ag8.X63.653, 
^.following the standard jecbriique (21). Tbe monoclonal 
^ anbbodies of interest were selected on tbe basis of the 

■ reactivity with the target peptide and with appropriate 
tissue sections. The latter included formal in-fixed and 

\ parafTm-erobedded sections of pituitary, gland and pan- 
cieatic islets and were analyzed by means of iramuno- 
peroxidase staining. Parallel control experiments were 
also rxrforrned by staining serial sections of these tis- 
sues, emitting tbe primary antibody oi with the preim- 
raune serum oi widi the antibody pjeadsoibed with high 
concentrations (1 rog/mL) of the antigenic peptide. In 
addition, the selected monoclonal antibodies (coded 
)0C6and l0G4),.boib oflgM isotype, weie further char- 
acterized by Western blotting. Membranes were prepared 
from stable uansfected Chinese hamster ovary (CHO> 
K] cells, individually expressing recombinant human so- 
matostatin receptors (sstl to sst5). ^Western b)oiUn S was 
performed as previously described (36). Tbe monoclonal 



antibody was used as culture supernatant at 1:3 dilution 
Jr^> 'horns at room temperatuTe. in Tris-bufTered saline 
(TBS) supplemented with 0.1% poJysorbate 20. Blots 
were washed in TTBS and incubated with peroxid.ase- 
conjugated goat antimouse IgM, diluted 1:1-500 for 90 
minutes at room temperature. Then, blots were washed in 
TTBS and immunocoroplexes were visualized using 
ECt followmg.roanufacrurer's instructions (Amexsbam, 

Bucks, UK). " 

A second polyclonal antibody was produced that bad 
been characterized previously (35,36). This antibody 
(coded K230) was raised in sheep and was specific for a 
sequence of the C- terminal portion of the sst2A^(KSRL- 
OTTTETQRTLLNEDLQ, amino acids 347 to 366). 

. Irjocmunobistocbeirastry 

Sections 4 or 5 u, thick, adjacent to those used for 
conventional histopathologic examination and irnmuno- 
stoning for neuroendocrine markers, were collected onto 
poly-L-)ysine-coated slides. The proliferative activity of 
the tumors was assessed by means of K567 immunosta- 
nining (clone MB31. Imrounotecb, Marseille. France), 
diluted 1:10 after microwave -based antigen retrieval in 
citrate buffer). Tbe ascitic fluid of monoclonal antibody 
10G4 was used in this study and was applied to tissue 
sections with prior antigen retrieval (ibree 3-minute pas- 
sages in a.roicTOwave oven at 800 "W in citrate buffer pH 
6.0), at the dilution of 1:10,000 or 1:12,000 for 30 min- 
utes' at room temperature, tbe antiserum coded K230- 
was applied overnight at a dilution of 1:300 with rx> prior 
antigen retrieval. The immune reactions were then re- 
vealed with the immunoper oxidase technique (13) using 
the streptavidin-peroxidase kit and diaminobenzidme as 
chroroogen. A weak nuclear coumeistain or no countei- 
stain was used in parallel sections. Control stagings for 
both antibodies included immunopei oxidase of serial 
sections using preirnroune serum oi antibody piead- 
sorbed with the antigen or buffer instead of the primary 
antibody. 

In Siru Hybridization 

Selected tumors (12 samples) wcie also analyzed fo* . 
sst2 inRNA expression by means of a nonradioactive* 
tyramide deposition-based 3SH technique. Tbe procc; 
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dure of amplification was modified from procedures re- 
ported by Kerstens et al. (16); Speel e| ah (38), and the 
GenPoint (biotinyMynamide) manufacturer (Dako, Glos- 
trup, Denmark). Briefly, 5-ujn-thick paraffin sections 
were collected onto silane-coated slides and deparaf- 
flni2ed through xylene and graded alcohols to phosphate 
buffer saline (PBS). The slides were then incubated for 5 
minutes in a microwave oven at 800 W in citrate buffer 
pH 6.0. After washing in PBS, they weie digested with 
proteinase K (J p-g/mL) for 10 minutes at 23°C Endog- 
enous peroxidase activity was blocked with 3% hydro- 
gen peroxide and endogenous biorin was blocked using 
'^dhr-blockrDg j ea gentrfor-15 miputes-foBow ed by 



washing in PBS and biotin-blocking reagent for 15 min- 
utes (3). Sections were then prebybridized for 1 hour at 
room temperature in a mixture composed of 4x SSG, 
50% foiroamide, Denhardt's lx, dextrane sulfate 5x, 500 
fig/mL salmon sperm DNA, and 250 M-gvmL tRNA. Hy- 
bridization took place overnight at A2°C in a solution 
containing the specific probe at a concentration of I 
pmol/mL The probe was a digoxigenin-labeled 4 8- base 
oligonucleotide (32), complementary to positions 91 lo 
139 of the human sst2 gene (41). After hybridization,, 
excess hybridization buffer and coverslips were removed 
by a rapid wash in 4x SSC followed by striogent washing 
in 0.1 x SSC for 10 minutes at 42°C The hybrids were 
revealed by the following incubation steps: peroxidase- 
labeled antidigoxigenin (diluted 1:100 in PBS) for 30 
minutes at room Temperature, bioiinyjated tyramide (di- 
luted 3:5 in PBS) for 15 minutes at room temperature, 
and peroxidase-labeJed streptavidin for 15 minutes at 
room temperature. Diaminobenzidine was used as chro- 
mogen. Controls for 1SH included staining of serial sec- 
tions with sense probe, an unrelated probe (EBER-l of 
the Epstein-Barr virus), and omission of the probe in the 
hybridization mixture, with all other experimental con- 
ditions identical to the procedure described here. 



RESULTS 

Reverse Transcriptase Polymerase Chain Reaction 

All neuroendocrine tumors, but no nonneuroendocrine 
rung carcinomas, were positive for CgA mRNA (Fig- 1). 
Sst2 mRNA was amplified in 22 of 26. samples of neu- 
roendocrine rumor. The signals had variable intensities 
(Fig. 2) and were weak in moderately or poorly differ- 
entiated tumors (mostly in SCCs). No amplification was 
obtained in no-template or no-reverse transcriptase ex- 
periments. Control samples (nonneuroendocrine lung 
. carcinomas proven by negative CgA RT-PCR) were 
weakly positive for sst2 in 5 of 20 samples only (includ- 
ing 3 adenocarcinomas, 1 squamous, and the large-cell 
anaplastic carcinoma) (Fig. 3). These, differences were 
statistically significant (P < 0.01) by X 2 test 

Characterization of Monoclonal Antibodies io sst2. 

Several clones were identified having a positive bind- 
ing by enzyme-linlced immunosorbent assay and a par- 
allel immun ore activity' on formalin-fixed paraffin- 
embedded human endocrine tissues (pituitary and pan- 
creatic islets). In Western blotting experiments, two 
clones (coded 10C6 and 10G4) specifically developed a 
band at approximately 70 WX When the antibodies were 
used against CHO-transfecied cells expressing recombi- 
nant somatostatin receptors 1 through 5, a specific band 
corresponding io sst2 (at approximately 70 kD) was re- 
vealed by the monoclonal antibody 10G4. Monoclonal 
antibody J0C6 developed a strong band with sst2 but 
displayed a weaker reactivity also with sstl, 3, and 5, al 
least in the present experimental conditions (Fig. 4 AJB). 
The same antibodies were also tested by means of irn- 
rounoper oxidase staining, on formalin- fixed, paraffin- 
embedded samples of normal human pituitary gland and 
pancreas. Monoclonal anubody 1QG4 gave good results 
in immunohistochemisiry and .was used at increasing; di- 




1 2 3 4 5 6 7 8 9 JO 11 12 13*14 15 16 17 18 19 20 21 22 23 24 25 26 C C* 

. FIG. 1. Reverse transcriptase polymerase chain reaction for chromograniri A (CGA) mRNA 
in 26 samples ot neuroendocrine tumor ot the lung. Numbers in each lane correspond to 
sample numbers in Table 2. CGA mRNA is amplified at 429 bp. C and C* stand lor positive 
(neuroendocrine colon carcinoma cell line, H716) and negative (distilled water) controls, 
respectively. The las! column to ihe right represents the molecular weight marker. All samples 
are positive with a variable intensity ol the amplification band.. 
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FIG. 2. Reverse transcriptase polymerase chain reaction lor sst2 mRNA in 26 samples of 
neuroendocrine tumor of the lung. Numbers in each lane correspond to sample numbers in 
Table 2. sst2 mRNA is amplified at 284 bp. C and CVstand lor positive, (a neuroblastoma) ■ 
and negative (distilled water) controls, respectively. The last column lo the right represents 
the molecular weight marker. Twenty-two of 26 samples are positive with a variable Intensity 
of the amplification band. 



lotions (up to 1:15,000) with specific staining. Using thin 
' sections (approximately 4 u-m), a strong merhbrane- 
bound and peripheral cytoplasmic imrnunoreactivity was 
found in an adenohypopbyseal cell population (corre- 
sponding to growth hormone- secreting cells, as con- 
firmed by double immunohistochemical analyses) and in 
pancreatic islets (Fig. 4 C,D). In the latter, the staining 
was apparently not restricted to a specific hormone- 
producing cell type and had a peripheral cytoplasmic or 
membrane distribution. Exocrine pancreatic cells (both 
acinar and ductal) were only occasionally immuno- 
stained. lrmTjunohistc>chernisrry performed on serial con- 
trol sections, either omining the primary, antibody or us- 
ing the preimmune serum or antibodies preabsorbed with 
the synthetic peptide, was negative in both tissues. 
Monoclonal antibody 10C6 had a relatively higher back- 
ground staining at similar dilutions. 

Irnniunohistochemistrj* " 

The antibodies tossi2 (monoclonal antibody J0G4 and 
polyclonal K230) gave slightly different imrnuDoreac- 
lions in 25 samples, and staining was not done in J 
sample because of lack of residua) paraffin blocks. The. 
monoclonal antibody 10G4 stained 21 of 25 samples, the 



negative samples being 1 atypical carcinoid and 3 SCCs 
(Fig. 5). The rumors had 5% to 25% of the neoplastic 
cells immunoreactive. The staining was at the periphery 
of the cytoplasm, and omitting the counterstain its mero- 
brane-bound distribution was better outlined in most 
samples (Fig. 6). One 1 sample of atypical carcinoid <no. 
21) was focally immunoreactive for sst2, despite nega- 
tive RT-PCR findings. Conversely, sample no. 26 was 
imrnunohistochemistry negative and RT-PCR positive. 
The antiserum anii-sst2A (code K230) gave positive sig- 
nal in 1 9 of 25 samples, in 5% to 60% of the neoplastic 
cell population (Fig. 7). The location of the staining was 
at the membrane level associated with a weak cytoplas- 
mic reactivity. The same pattern was seen in positive 
controls, e.g., pancreatic islets (Fig. 7, inset). Two 
samples (nos. 19 and 26) were negative in spite of a 
positive RT-PCR signal. Two other tumors (nos. 2] and 
-22), apparently devoid of sst2 mRNA,. showed a small 
percentage of immunoreactive cells. Incidentally, one of 
these latter samples (no. 21) was also immunoreactive 
with monoclonal antibody 10G4 (Table 2). 

The five control samples positively RT-PCR were 
also reactive with the antibodies. The type of imrnuno- 
cytocbemical location of sst2 receptors was similar to 
that described here, being a peripheral cytoplasmic stain- 




M 1 2 3 4 5 6 7 8 9 10 11 12 13 14 35 16 17 18 19 20 C C* M 

HG. 3. Reverse transcriptase polymerase chain reaction for.sst2 rnRNA in 20 control 
samples cl nonneuroendocrine tung carcinoma. Five ol 20 samples show a weak band at 
204 bp corresponding to sst2 mRNA. Control columns (C and C") are identical to those in 
Fig. 2. 
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FIG. 4. Western blot analysis of monoclonal antibody 
clones 10G4 and 10C6 against sst2 in Chinese hamster 
ovary cells transfected with recombinant sst 1 through 5 
(numbers of each column "correspond to receptor type). 
Monoclonal antibody 10G4 shows a specific band at ap- 
proximately 70 kD for sst2 only (A) as opposed to mono- 
clonal antibody 10C6, which strongly reacts with sst2 but 
also has some degrees of cross- re activity with sst 1, 3, 
and 5 (B). The lower figures show control forma lin-fixed 
pa raff in- embedded pancreatic islets immunostained with 
monoclonal antibody 10G4 without (C) and with (D) 
preadsorption with the peptide antigen, respectively. The 
majority of endocrine cells show a membrane- bound irn- 
munoreactivity (C) (immunoperoxidase). Bar; 90 pm. 



ing present in 40% to 70% of neoplastic cells. A weak 
and focal staining was also observed in five of the re- 
maining RT-PCK- negative samples, when the antibody 
K230 was used (but not when the monoclonal was em- 
ployed). 

Several cells in peritumoral tissues were occasionally 
stained. Ciliated cells of bronchial mucosa had a periph- 
eral staining at the cilia border. Mucous glands were 
negative. Kaie chondrocytes had a membrane staining. 
The wall of peritumoral as well as of occasional distant 
vessels was stained at the endothelium level and in oc- 
casional smooth muscle cells. 

.. The reactivity 'of both antibodies was abolished in se- 
rial sections when ihe reagents were preabsorbed with 
the respective synthetic peptides, but not when an unre- 
lated pepiide was used. The peri rum oral bronchial mu- 
cosa had a foca) staining of ciliated cells with both an- 
tibodies. This reactivity disappeared when the pre ab- 
sorbed antibody was applied. 



In Situ Hybridization 

Eight of 12 samples stained by 1SH were positive for 
sst2 mRNA. The mRNA was present in a percentage of 
cells (ranging from 10% to 40%) and gave a weak signal 
(Fig. 8), despite the amplification provided by the tyra- 
mide-based procedure. The background level was mini-, 
mal using diluted biotinylated tyramide. Control sections 
stained with sense probe or an unrelated probe, or omit- 
ting the probe, were consistently negative. 

Clinical Data 



Qinicopathologic data are summarized in Table 2. At 
follow-up, the majority of patients with typical carci- 
noids are free from disease 1 to 1 1 years after surgery. 
Two patients are alive with stable .metastatic disease. 
Patients affected by atypical carcinoids had disease pro- 
gression in one third of samples. Finally, patients with SCC 
had fatal outcomes* within 1 year from diagnosis (except the 
recent sample). Eight patients had preoperative octreotide 
scintigraphy performed at the time of diagnosis. All patients 
had positive octreoscan findings; *and, in these patients, 
also the tumor was positive by RT-PCR and immu- 




FIG. 5. sample no. 25 (small cell carcinoma). Absence of 
innmunoi eactivity lor sst2 with the monoclonal 1 0G4. This 
sample was also negative by reverse transcriptase poly- 
merase chain reaction and in situ hybridization. (Immuno- 
peroxidase in a formalin- fixed par a (fin- embedded sample. 
Ntfclei slightly counterstained with hemalum.) Bar: 45 prn. 
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nohistochemistry or ISH. In addition, three of these pa- 
tients received octreotide therapy administered at the 
time of tumor recurrence or metastatic spread. Stable 
disease is recorded at follow-up more than 5 years after 
diagnosis. 

'Correlations 

Overall, complete overlapping (i.e., RT-PCR, ISH, 
and immunohistochernistry with two antibodies) be- 
tween sst2 gene and protein expression was obtained in 
21 of 25 samples (84%) and between RT-PCR results 
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. the antibodies in 24 of 25 samples (96%). The rnonoclo- 
. pal antibody 10G4 looked highly sensitive, being able to 

"■- * stain all but one sample (no. 26) (95%) positive for sst2 
;.**.*. niRNA by RT-PCR. Sst2 expression, at mRNA as well 

. , as at protein levels, was reduced in high grade tumors, 
* ' with SCCs being weakly positive in only two of five 
samples. Decreasing expression of sst2 appears to cor- 




FIG. 6. sample no. 16 (typical carcinoid). Immunohisto- 
chemical detection ol sst2 by means ot monoclonal anti- 
body 10G4. The neoplastic cells have a peripheral cyto- 
plasmic staining and membrane positivity in some xells, 
whereas the peribronchial gland adjacent to the tumor is 
unreactrye. (Immunopef oxidase in a foimalin- fixed paraf- 
fin-embedded sample. Nuclei slightly counterstained with 
hemalum.) Bar: 45 pm. The membrane hound distribution 
ol the immunostaining is better outlined in a parallel sec- 
tion stained tor monoclonal antibody 10G4 omitting ', 
nuclear counterstain (inset). 




FIG. 7. Same sample as in Fig. 6. tmmunohistocbemtcal 
detection ol sst2 by means of the polyclonal antibody 
K230. The immunostaining is more intense at the cell bor- 
der (arrows), as observed with the monoclonal antibody. 
In the inset, a pancreatic islet, used as positive control, 
shws a predominant membranerbound immunostaining of 
many neuroendocrine cells. (tmrnunoperoxidase in a for- 
malin-fixed paraffin-embedded sample. Nuclei slightly 
counterstained with bemaJum.) Bar 45 pm. 

relate with high tumor grade and elevated proliferative 
activity, but not with other parameters such sex, age, or 
tumor size. 

DISCUSSION 

In this study, the presence of sst2 mRNA has been 
demonstrated in a series of resected neuroendocrine ru- 
mors of the Jung by.means of RT-PCR and confirmed by 
a sensitive nonradioactive tyramide-based JSH procedure 
and by immunohistochernistry with anti-sst2 antibodies. 
Samples of both carcinoid tumors and SCCs were ssi2 
positive, although a reduced, or absent signal was ob- 
served in poorly differentiated (small : cci]) carcinomas. 
This is the first siudy of sst2 expression in a relatively 
larpe series of neuroendocrine tumors of the lung. Single 
samples of human carcinoids and SCCs (including cell 
lines of ihe latter) had previously been analyzed and 
found to express ssi2 (7,15:30,32,33,39,42). Several 
methods have been used to detect these receptors and 
partially overiappmg results-were obtained. 

hi the present study, the expression of high amounts of 
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TABLE 2. Clinicopaihologic data and somatostatin receptor type 2 (sst2) expression in 26 cases of neuroendocrine 

- lung tumors 



Patient 






Size 


Follow-up 


CgA 


CgA 


SYP 


Ki67 


sst2 


sst2IHC 


sst2 Jr 


no. 


Diagnosis 


Sex/age 


. («?>. 


(mo) 


IHC 


RT-PCR 


IHC 


IHC* 


RT-PCR 


Mab(10G4) 


(K230j 


1 


WD NET 


F/35 


35 


NED 90 




44+ 


+ F 


1.5 


44-4 


4 


. 4 


2 . 


WD NET 


F/2? 


4 » 


NED 45* 




44 


4 F 


3 


++ 


4 


4 


3 


WD NET 


F/41 


2,5 


NED 70 


+ 


44 


4- 


0.1 


444 


"4 


4 F 


4 


WD NET| 


F25 


4 


NED 21 


-f 


44 


4 


2.6 


4+4* 


4F 


+ F 


5 


WD NET| 


M/58 


3.8 


NED 23 


+ 


+ 44 


4 


13 


4+4 


4 F 


+ 


6 


WD NET 


M/52 


Z5 


NED 42 




+4 


4 


. 4.5 


+4+ 


+ p 


+ F 


7 


WD NET 


F/69 


-3,5 


NED 47 


+ 


44 


4 


1 


4 


*4 


4 


a 


WD NET 


W29 


3 


NED 70 




444 


4. 


1.5 


444 


"4 


4 


9 . 


WD NET 


W27 . 


4 


NED 108 


+ 


444 


4 


NT 


444 


■ 4 


4 


10 


WD NE Cat 


M/66 


*B 


AWD55 




44 


4F 


1.1 


44 


4F . 


4 


11 


\ iir\ til" i 

WD NETi 


F/29 


2 


A WD 56 


+ 


444 


4 


25 


44+ 


4 




12 


WD NET 


F/32 


"""3 


NED.26 




44 


4 


4 *" 


4+4 


4 ■ 


4 


13 


WD NE Ca 


M/60 


3 


NED 133 


4 


444 


•4 


1 


444 


NT 


NT 


14 


WD NET 


M/28 


' 4 


NED 130 


4 


444 , 


4 


1 


444 
+44 


4 


4 


. 15 


WD NET4 


M741 


1.3 


A WD 53 


4 


444 


+ 


1.5 


4 


4 


16 


WD NET| 


F/31 


1 


NED 13 


+ 


44 


+4 


2.6 


4 


.4 


4 


17 


WD NET 


F/53 


4 


NED 24 . 


. 4 


44 + 


+ F. 


4 


+++ 


4F 


4 


16 


WD NECa 


M/62 . 


3 


NED 6 




44 




13- 


+44 


4 


4F 


19 


WD NE CA 


F/73 


5 


DOD20 


4 


4+4 


4 


3 


44 


4 




20 


sec 


M/57 


6 


DOD12 


+ 


+4 


+ 


45 


+ 


+ F 


+ F 


21 


sec . 


M/51 


4.5- 


DOD5 


4 


4 


4 


35 




+.r 


+ F 


22 


WD NE Ca 


M/60 


6 


NED 51 


4 


44- 


4 


1.5 






4 


23 


sec 


F/56 


6 


DOD11 


4 


4 


4 


50 








24 


WD NE Ca 


W77 


2.5 


NED 21 


4 


+ 


4 


24 


+ 


4F 


+ F 


25 


see 


W57 


5 


DOD10 


4 


++ 


44 


80 








26 


sect 


M/68 


.11 


recent case 


4 


4 


44 


71 


4 







AWD, alrve with disease; CgA, chromogranin A; DOD. died of disease; * F, focal: positive in <5% of celts; IHC; Immunohistochemistyy; 
Mab, monoclonal antibody; NECa, Neuroendocrine carcinoma; NED, no evidence of disease; NET, neuroendocrine tumor; NT, not tested; 
RT-PCR. reverse transcriptase polymerase chain reaction; SCC, SmalKcell lung carrinoma; SYP, synaptophysin; Wp, well differentiated. 

' Ki67 IHC: values correspond to percentage of positive nuclei of neoplastic cells. 

t Patients who had preoperative odreoscan performed. 

+ Patients who had octreoscan performed and octreotide treatment. 



sst2 mRNA was cojnrfirmed in well to moderately differ- 
entiated neuroendocrine rumors, in agreement with the 
results obtained by Reubi et a). (32) by means of radio- 
active 1SH. The presence of sst2 mRNA in SCC had 
never been reported in human specimens, except for two 
samples included in Reubi et al's series (32). Although 
the data on cell lines support the observation that SCCs 
contain sst2 (42), slightly discrepant results were found 
in some of samples described here. Unfortunately, SCCs 
are rarely operated on, and therefore it is difficult to 
collect a- large number of surgical- specimens. The Jive 
samples studied in the current series by means of RT- 
PCR had a low amount (iwo samples) or absent (three 
samples) sst2 mRNA. This could be the result of the 
extensive necrosis commonly present in such rumor 
types. However, because care was taken to freeze frag- 
ments thai weje .macroscopicaJly devoid of necrotic ar- 
eas, a more likely hypothesis is that sst2 expression is 
reduced in poorly differentiated tumors. Recently, Reis- 
inger et a). (29) showed that the uptake of somatostatin 
analogues in patients with SCC undergoing chemo- 
therapy is significantly lower, and therapeutic externa] 
factors" may affect rne receptor status of individual ru- 
mors. In addition, the upiake of somatostatin analogues 



in metastatic deposits of SCC has been shown to be low 
or absent (29,2). The present findings suggest that the 
sst2 mRNA content is related to the degree of rumor 
differentiation. These. data roust be corifirmed in larger 
series of nonneurbendocxine rumors to ascertain whether 
the observed loss oi decrease of sst2 expression in neu- 
roendocrine rumors is a common, event linked to neo- 
plastic dedifferentiation. In addition, further studies are 
needed to assess the functionality of such receptors, by 
comparing the profile of sst2 expression in tumor tissues 
with binding assays employing labeled somatostatin and 
with the clinical response to diagnostic and therapeutic 
adrninistjanpn of somatostatin analogues. 

To this "purpose, several investigators have demon-' 
strated a correlation between clinical imaging or re- 
sponse to somatostatin analogue treatment, and ssi2 
mRNA content in single . samples of carcinoid rumors 
(15,22). Northern blotting and 1SH were the techniques 
used for ssl2 mRNA identification. This kind of corre- 
lation is useful for selecting patients for somatostatin 
analogue treatment, although the demonstration of recep- 
tor mRNA in a cell does not imply per se thai the recep- 
tor is fully functional. 
The present study relied on a highly sensitive icch- 
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FIG. 8> sample no.- 44- (typical carcinoid). In -sttu- hybrid- 
ization (ISH) for ss12 mRNA shows a weak cytoplasmatlc 
staining (A) in most tumor cells. An ISH performed with an 
unrelated probe was negative in a serial section of the 
same tumor (B). This sample was strongly positfve by 
reverse transcriptase polymerase chain reaction for sst2 
mRNA and by immunohistochemistry. (Nonradioactive 
ISH revealed by peroxidase and diamtnobenzidine, as 
substrate. Nuclei counterstained with hemalum.) Bar: 
75 urn. ' - 

nique,. RT-PCR, to identify aft samples bearing even 
small amounts of ss\2 mRNA. Indeed, in a previous 
study, single samples exhibiting ocrreotide-binding sites 
had no demonstrable sst2 mRNA by means of ISH, pos- 
sibly due to tbe low sensitivity of the ISH procedure (34). 
The RT-PCR bas shown sst2 mRNA transcripts in tbe 
majority of samples here studied. Only four samples 
were negative, all belonging to poorly differentiated higb 
grade tumors, which usually follow an aggressive course. 
A decrease of sst2 mRNA expression in association with 
neuroendocrine rumor dedifferentiation had also been re- 
ported in neuroblastomas (37). In tbe above report, as 
well as in the current study, samples having an unfavor- 
able prognosis were found to contain a relatively low 
amount of ss!2 mRNA, as compajed wjtb well- 
differentiated tumors. 

In the current sample series, eight samples were in- 
vestigated before surgery with radiolabeled octreotide. 
Despite the low figures, all the samples positive at the 
diagnostic procedure had a strong RT-PCR signal for 
sst2 mRNA. Three of eight patients were also responsive 
to octreotide ueatment administered at the time of je- 
!apse or metastatic spread. More extensive correlative 
clinicopathologic studies on the sst starus are needed to 
better define the tissue distribution, of somatostatin bind- 
ing sites and their potential clinical role in the treatment 
of patients. 

Sst2 evaluation by means of ISH (14,32) or RT-PCR 
(26,37) is a highly sensitive and reliable procedure. Un- 
forrunatcly, diese icchniques have limitations because 
hozen tissue is needed for some of them, and radioactive -- 
tnaiejjal or costly and time-consuming methods are nec- 



essary for others. Immunobistocbemical analysis of sst2 
by means of specific antibodies represents an ideal> 
cheap, and rapid alternative, easUy applicable to aicbiva] 
material. For (hese reasons, several investigators have 
. raised polyclonal antibodies specific for sst (8,10,15,17, 
18). In tbe current study, tested tumor fragments adjacent 
to those snap frozen for RT-PCR analysis were tested 
with a polyclonal antibody against a C-terminal portion 
of the sst2A splice variant (35,36). In addition,^ mono, 
clonal antibody was produced in the Department of Pa- 
thology (University of Turin) against an N-tenninal se- 
quence of tbe human sst2. This antibody was tbe fust 
monoclo n al devel oped a&ainst sst2 and was.sbown to be 
highly specific for sst2 in Western blot and immunobis- 
tocbemical analysis. Both tbe monoclonal and tbe poly- 
clonal antibodies specifically reacted with all. samples 
also positive by RT-PCR (with minor discrepancies in 
two samples, likely due to rumor heterogeneity). Tbe 
observed correlation between RT-PCR and unmunobis- 
tocbemistry indicates that the latter may be a reliable 
diagnostic tool and may allow immunom^tocbernical in- 
vestigation for ssl2 even in small biopsy samples. Tliis in 
rum may enable a rapid screening of sst2-poshive rumors 
for medical treatment with somatostatin analogues. 

Having confirmed in a relatively large series that tbe 
vast majority of. neuroendocrine tumors of tbe lung con- 
tain variable amounts of sst2 mRNA, a final comment is 
deserved for sst2 expression- in nonneiuoendocrine lung 
carcinomas. No data have been reported thus far in tbe 
literature concerning normal human lung, although in tbe 
present study some bronchial cells of peritumoral paren- 
chyma were positive for sst2 when immunobistocbemi- 
cal analysis was performed with either antibody. Tbe 
staining was specific because it was abotished using pre- 
absorbed antibodies. Therefore, it is likely that normal 
human lung tissue contains sst2. This might be con- 
firmed by alternative techniques (e.g.,' Western blot, RT- 
PCR). However, in situ morphologic procedures, such as 
those employed here, have definite advantages. In fact, 
the lung is rich in vessels, and in several tissues (either in - 
tumoral or in inflamniatory- re active conditions) the ves- 
sels were recendy shown to contain sst (6). 

A low expression of sst2 was found in 25% of lung 
carcirjomas of nonneuroendocrine type investigated in 
the present study by means of RT-PCR. Therefore, sst 
type" 2, at least; does not appear to be extensively ex- 
pressed in nonneuroendocrine carcinomas of the lung. 
However, because two tumors in the control group (a 
squamous carcinoma and an adenocarcinoma, respec- 
tively) had positive octreotide scintigraphy, but no sst2 
mRNA, it is plausible that a" heterogeneous distribution 
of sst occurs in nonneuroendocrine lung rumors. Other 
receptor types may be expressed in these tumors arid may 
be responsible for the positive results in diagnostic test- 
ing. Because sso. is also known to bind somatostatin 
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analogues, such as octreotide, with high affinity (24), the 
expression of this jeceptor type will be investigated in 
future studies. O 

AcfcnowJedgroeuts: Tbe authors are grateful to Dr. 
Filosso (Turin) for clinical data, to Mrs. M. Cerrato and Miss 
Solero for skDlful technical help, a»d to Mn A. Grua for the 
photographs. 

REFERENCES 

1. Abbona GC, Paponi M. Vibeni L, Macri L, Stella A. Bussolali G. 
' Cbromogranin A gene expression in non- small cell lung carcino- 

dmTJ Pathol 1 998;1 86:1-6- : 

2. Berenger N, Moreni JL, Boaziz C, Vigneron N, Morere IF, Brtao 
JL. Somatostatin receptor imaging in small cell lung cancel. Eur J 
Cancer 1996;32:J429-3L * 

3. Bussolali G, Guglioria P, VpJanttM, Pace M, Paponi M. Retrieved 
endogenous biotro: a novel marker and potential pitfall in diag- 
nostic mununonistCKbemisrjy. Bistopaihology 1 997*3 1;400V7- 

4. Capella C, Heitt PU, Bofler H, Solcia E, JCloppel G. Revised 
classification of neuroendocrine rumors of ihc lung, pancreas and 
gut; VirchowsArch 1995;425:547-oU 

5. Chirgwin JM, ftrzybyla AE, Mac DoDald RJ. Runer W. Isolation of 
biologically active RNA from sources enriched in ribonuclease- 
Biochemistry 1979;18:5294-7, 

6. Denzler B, Reubi JC. Expression of somatostatin receptors in peri- 
rumoral veins of bum an rumors. Cancer 1999;85:189-98. 

7. Fujit* T, Yamnji Y, Soto M, Murao JC Taxahara J, Gene expres- 
sion of somatostatin receptor subtypes, SSTR1 and SSTR2, in 
human lung cancer cell lines. Life Set 1994 ;55: 1797-806. 

8. Gu WZ, Scbonbninn A. CoupKng specificity be r ween somatostatin 
receptor sst2A and G proteins: isolation of the recepror-G protein 
complex with a jeceptor antibody. Mot Endocrinol J 997; J 1:527- 
37. 

9. Gussow D, Rein R, Ginjaar 1, Hocbstcnbach F, Seemann G/Kott- 
man A. Ploegh HI- The human beta 2- microglobulin gene: pri- 
mary structure and definition of the transcriptional unit J Immunol 
1987;J 39:3 132-8. 

10. Hclboe L, Moller M, Nonegaaid L, Schiodt M, Sudsen CE. De- 
velopment of selective antibodies against the buman somatostatin 
receptor subtypes Sstl-sst5. Brain Res Mol Brain Res 1997;49: 
82-8. 

1). Hebnan D, Ahri TG, Levine MA, et al. Molecular cloning and. 
primary structuic of human cbiomogranin A (secretory protein 1) 
cDNA. J Biol Chem 1988,263:1 1559-o3. 

12. Holland LJ, Liu Q, Van Koetsveld-PM, et al. Jmmunohisiocbem- 
ical detection of somatostatin receptor subtypes sstl and sst2A in 
buman somatostatin receptor positive rumors. J Clin Endocrinol 
A^roi 1999;84;775-80. 

13. Hsu SM, Raine L, Fangei H. Use of avi dine -peroxidase complex 
(ABC) in iminunoper oxidase techniques: comparison between 
ABC and unlabeled antibody (PAP) procedures. J Hiswehem Cy- 
tochem 1981;29:577-80. 

14. lanson ET, Gobi A, Kalkne KM, Obero K. A comparison between 
the efficacy of somatostatin receptor scintigraphy and thai of in situ 
hybridization foj somatostatin receptor subrype 2 messenger RNA 
10 predict tbeiapeotic outcome in carcinoid patients. Cancer Bes 
199636:2561-5. 

15. lanson ET, Stridsberg M, Gobi A, Westiin JE, Oberg K. Determi- 
nation of somatostatin receptojs subtype 2 m carcinoid rumoj"s by 
immuDohistochemical investigation ^iib somatostatin teceptor 
subtype 2 antibodies. Cancer Rn 1998;58:23;5-8. 

16. Kerstens HM1, Poddighe PJ. Hanselaai AGJM. A novel in siru 
hybridization signal amplification rjiethod based on the deposition 
of biotinylaied lyr amine. J Hi\io<hcm Cynochem 1995:43:347-52. 

17. JCrisch B, Fejndt J. MenUein R. ) mmoeocjecuonotitioscopic 
analysis of Ujc li^and- induced internab^jtiou of ine soiiiaiosiin 

Oiafn. Mol Pmhol Vol 9. .A'o. K 'J(XO ■ - T*~' 



receptor subtype 2 in cultured buman glioma cells. 7 Histochem 
Cytochem 1998;46:1233-^Z 

18. Kumar U; Laird D, Snlcant CB, Eschcr E, Pattl YC. Expression of 
tbe five somatostatin receptors (SSTR1-5) subtypes in rat pituitary 
somatotrophes: quantitative analysis by double-lflbcl nnrauno/h> 
oresceoct confocal microscopy. Endocrinology 1997;138:4473-6. 

19. Kwok S. Higuctri R. Avoiding false posifivts wdth PCR- Nature 
1989^39:237-8. 

20. Lamberts SWJ, Holland IJ, Koeuveld FH Rwhi JC, Bruining. 
HA, Batker WH, Krcnnin? EP. Parallel in vivo and in vitro de- 
tection of functional soroatosmtin receptors in human endocrine 
pancreatic tumors: consequences with regard to diagnosis, local- 
ization and therapy: J Clin Endocrinol Mctat 1990;7I ^66-.74. 

. 2L Malavasi F, Funaro A, Bellone G, et al. Functional and molecular 
characterization by tbe CB04 monoclonal antibody of a cell surface 
.structure exerting C3-corDplement receptor activity. J Clin frnmu- 
^ dqU5K5^4.12=2£L^ 

22. Nilsson O, Kolby U Wangberg B, el al Comparative studies on tbe 
expression of somatostatin receptor subtypes, outcome of octreo- 
tide scintigrapby and response to octreotide treatment in.paticDts 
with carcinoid rumours. Br J Cancer 1998;77:632-7. 

23. Pagani A, Fomi M, Tonini GP, Paponi M, Bussolali G. Expression 
of members of the cbiomogranin family in primary neuroblasto- 
mas. Dwgn Mol Pathol 1992;1:16-24. 

24. Panel ta R^ Greenwood MT, WarszynsXa A. et al Molecular clon- 
ing, functional characterization and cbjomosomal localization of a 
human somatostatin receptor (somatostatin xecepior type 5) with 
preferential affinity for somatostatin-28. Mol Pharmacol 1994;45: 
417-27. 

25. Papoio* M, Macri L. BussoJao G, Reubi iC ConeUtive study oh 
neuroendocrine differentiation and presence of somatostatin recep- 
tors in breast carcinomas. Jnt 3 Cancer 1989;43:365-9. - 

26. paponi M, Macri L, Pagani A, Aloi F, BussoIbd* G. QuantitaboD of 
somatostatin receptor type 2 in neuroendocrine (Meikel cell) car- 
cinoma of ibe sbn by competitive RT-PCR. Endocr Paihol 1999; 
10:1-10. 

27. Patel YC, Panena R. Escher E, Greenwood M, Srikant CB. Ex- 
pression pi multiple somatostatin receptor genes in AlT-20 cells: 
evidence for a novel somaiostadn-28 selective receptor subrype. J 
Biol Chem 1994;269:1506-9. 

28. Pearson WR, Lipman DJ. Improved tools for biological sequence 
comparison. Proc Natl Acod Sci USA 1988;85:2444-8. 

29. Reisinger 1, Bohuslavizki KM, Brenner W, et al Somatosiatin re- 
ceptor scintigraphy in small -cell lung cancer results of a multi- 

* center study. J Nucl Med 1 99839:224-7. 

30. Reubi JC. Kappeler A, Wsaer B. Laissoe J, Hipiin RW, Scbon- 
bnmn A. Irajnunohistccbemical localization of somaiosiann tecep- 
tors sst2A in human rumors. Am J Pathol 1998;153:233-45. 

31. Reubi JC, Mawer R„ von Wcrder K. Torborst J, KJijin JGM, Lam- 
bens SWJ. Somatostatin receptors in human endocrine tumors. 
Cancer Res i*987;4 7:55 1-8. 

32. Reubi JC, Scbaer JC, Waser B, Mengod G. Expression and local- 
ization of somatostatin receptors SSTR1, SSTR2, SSTR3 messen- 
ger RNA in primary human rumors using in situ hybridisation. 
Cancer Res 1994.^4:3455-9. 

33. Reubi JC, Waser B, Sbeeppard M. Macaulay V. Somatostario re- 
cepiors aie piesent in small-cel) but not in non-imail-cel) primary 
Jung carcinomas: relationship to EGF receptors. Jnt J Cancer 1990; 
45:269-74. 

34. Scbaer JC, Waser B, Mengod C, Reubi JC- Somatostatin receptors 
subtypes sit), sst2, sst3 and sst5 expression in human pituitary, 
gasDoeniciO-pancreatic and mammary -tumors: comparison of 
mRNA analysis with receptor aotoradiograpby. Jnt J Cancer 1997; 
70:530-7. 

35. Schindler M. Hoiloway S, Humphrey PPA. Waldvogel H, FauJI 
RLM, Bereei W, Eibsod PC. Localization of the somatostatin s"?t2a 
receptor in human cerebral con ex, hippocampus and cerebellum. 
Neurorepon 1998;9:521-5. 

36. Schindler M. Sellers LA, Humphrey PPA, Emson PC. Lnimmo- 
bisioc hemic al locsJizaiion of the somatostatin sst2{A) icceptor in 
die rat brain nnd spinal cord. Nenroiciencc 1997;76:225-40. 

37. Sestini R. Ojlando C, Peri A. ei al. Quantification of somatostniin 



SOMATOSTATIN RECEPTOR TYPE 2 JN NEUROENDOCRINE LUNG TUMORS 



57 



receptor type 2 gent expression in neuroblastoma: cell lines and 
primary tumors using, competitive reverse transcriplion-pojy- 
jnexase chain reaction. Clin Cancer Res 1996;2:1757-65. 

38. Spccl EJM, Sareinaslani P, Roth J. Hopman AHN, KomminDih P. 
improved mRNA in situ hybridi22tion on formaldehyde-fixed and 

* - paraffin-embedded tissue using signal amplication with different 
baptenized tyjanrides. Histochem Cttt Biol 1998;420:1-7. 

39. Taylor JE, Tbeveniau MA, Bashirzadch R, Keisine T, Eden PA. 
Detection of somatostatin receptor subtype 2 (SSTR2) in esteb- 

' lisbed rumors and tumor cell lines: evidence for sst2 heterogeneity. 
Pepthkz 1994;15:1229-36. 



40. Travis WD, Gal AA, Colby TV, Kfimstra DS/Faft R, Koss MR 
Reproducibility of neuroendocrine hrog tumor classification. Hum 
PmW 1998;29:272-9. 

4). Yamada Y, Post SK Wang K, Tager H> Bell GI, Seino S. Ctoning 
and functional characterization of a family of human and mouse 
somatostatm receptors expressed in brain, gastrointestinal trad and 
kidney- Biochemistry 1992;89:251-5. 

42. Zhang CY, Yokogosbi Y, Yoshimoto K, Fujinaka Y, Matsumota 
K> Saito S. Point mutation of the somatostatin receptor 2 gene in 
the human small cell lung cancer cell line COR-L103. Biochem 
Biophys Res Commun 1995;210:805-15. 



15 °k Virch0W ? Ardh 2002 M ay; 44 0(5):46l-75. Epub 2002 Mar 23. Related Articles. Links 
M Spr ing erLin k 

Expression of somatostatin receptor types 1-5 in 81 cases of 
gastrointestinal and pancreatic endocriue tumors. A correlative 
immunohistochemical and reverse-transcriptase polymerase chain 
reaction analysis. 

Pa P otti M > BonRiovanni M, Volante M . AHia E , LandoIfiS . Helboe L , Schindler M 
Cole SL , Bussolari G . r\ 

Department of Biomedical Sciences and Oncology, University of Turin, Via Santena 7, 
10126 Turin, Italy, mauro.papotti@unito.it 

Somatostatin receptors (SSTRs) have been extensively mapped in human tumors by • 
means of autoradiography, reverse-transcriptase polymerase chain reaction (RT-PCR), in 
situ hybridization (ISH) and immunohistochemistry (IHC). We analyzed the SSTR type 
. 1-5 expression by means of RT-PCR and/or IHC in a series of 81 functioning and non- 
functioning gastroenteropancreatic (GEP) endocrine rumors and related normal tissues. 
Moreover, we compared the results with clinical, pathological and hormonal features. 
Forty-six cases (13 intestinal and 33 pancreatic) were studied for SSTR 1-5 expression 
using RT-PCR, IHC with antibodies to SSTR types 2, 3, 5 and ISH for SSTR2 mRNA. 
The vast majority of rumors expressed SSTR types 1 , 2, 3 and 5, while SSTR4 was 
detected in a small minority. Due to the good correlation between RT-PCR and IHC data 
on SSTR types 2, 3, and 5, thirty-five additional GEP endocrine rumors were studied with 
IHC alone. Pancreatic insulinomas had an heterogeneous SSTR expression, while 100% 
of somatostatinomas expressed SSTRS and 100% gastrinomas and glucagonomas 
expressed SSTR2. Pre-operative biopsy material showed an overlapping 
immunoreactivity with that of surgical specimens, suggesting that the SSTR status can be 
detected in the diagnostic work-up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a reliable tool to detect SSTR types 
2, 3 and 5 in surgical and biopsy specimens. 
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P-cadherin overexpression is an indicator of clinical outcome in invasive 
breast carcinomas and is associated with CDH3 promoter 
hypomethylation. 

Paredes J , Albergaria A. OHvcira JT. Jcronimo C. Milanczt F. Sclimitt PC . 

Institute of Pathology and Molecular Immunology of Porto University (IPATIMUP) 
Braga, Portugal, jparedes@ipatimup.pt 

PURPOSE: P-cadherin overexpression has been reported in breast carcinomas where it' 
was associated with proliferative high-grade histological tumors. This study aimed to 
analyze P-cadherin expression in invasive breast cancer and to correlate it with tumor 
markers, pathologic features, and patient survival. Another purpose was to evaluate the P- 
cadhenn promoter methylation pattern as the molecular mechanism underlying this gene 
regulation. EXPERIMENTAL DESIGN: Using.a series of invasive breast carcinomas, P- 
cadhenn expression was evaluated and correlated with histologic grade, estrogen 
receptor, MIB-1, and p53 and c-erbB-2 expression. In order to assess whether P-cadherin 
expression was associated with changes in CDH3 promoter methylation, we studied the 
methylation status of a gene 5'-flanking region in these same carcinomas. This analysis 
was also done for normal tissue and for a breast cancer cell line treated with a 
demethylating agent. RESULTS: P-cadherin expression showed a strong correlation with 
high histologic grade, increased proliferation, c-erbB-2 and p53 expression, lack of . 
estrogen receptor, and poor patient survival. This overexpression can be regulated by 
gene promoter methylation because the 5-Aza-2'-deoxycytidine treatment of MCF-7/AZ 
cells increased P-cadherin mRNA and protein levels. Additionally, we found that 71% of 
P-cadhenn-negative cases showed promoter methylation, whereas 65% of positive ones 
were unmethylated (P = 0:005). The nornial P-cadherin-negative breast epithelial cells 
showed consistent CDH3 promoter methylation: CONCLUSIONS: P-cadherin expression 
was strongly associated with tumor aggressiveness, being a good indicator of clinical 
outcome. Moreover, the aberrant expression of P-cadherin in breast cancer might be 
regulated by gene promoter hypomethylation. 
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Mammary-derived growth inhibitor protein and messenger ribonucleic 
acid concentrations in different physiological states of the gland. 

Politis 1, Gorcwit RC , Mutler T, Grossc R . 

Department of Animal Science, Cornell University, Ithaca 14853. 

Expression of mammary-derived growth inhibitor in tissue from Iactating and involuting 
bovine mammary glands was investigated. Seventeen Iactating, pregnant (220 to 272 d in 
gestation) cows were divided in two groups of 8 and 9 cows each. Cows of the first group 
were slaughtered while in lactation. Cows of the second group (9 involuting cows) were 
slaughtered at 13 to 52 d following sudden cessation of milking. High concentrations of 
mammary-derived growth inhibitor (.63% of the total protein) were detected in mammary 
tissue of Iactating cows. Mammary-derived growth inhibitor (less than .10% of the total 
protein) was dramatically reduced during most of the involution period (13 to 45 d 
following cessation of milking). Mammary-derived growth inhibitor was again detected ' 
(.28% of the total protein) during the last stage of the involution (46 to 53 d after 
cessation of milking), which coincided with colostrum formation. When steady state 
concentrations of mammary-derived growth inhibitor mRNA were examined, the results 
obtained mirrored those obtained at the protein concentration. These data suggest that 
regulation of mammary-derived growth inhibitor occurs via modulation of the steady 
state concentration of its mRNA. Furthermore, there is a strong correlation between 
mammary-derived growth inhibitor expression and lactation in dairy cows. 
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T-cell receptor V beta-family usage in primary cutaneous and primary 
nodal T-cell uon-Hodgkin's lymphomas. 

Prcesman AH , HuHZ, TUanus MG, deGeus B, Schuurman HJ. ReitemaR v,„ 
Wichea PR van Vlotcn WA . de Weger RA . ~ ~ " — 

Department of Pathology, University Hospital Utrecht, The Netherlands. 

To evaluate whether the expression of T-cell receptor (TCR) V beta families in eight 
cases of malignant T-cell lymphomas took place in a preferential manner, we analyzed 
four cases of mycosis fongoides (MF), the most common form of primary cutaneous T- 
cell non-Hodgkin's lymphomas (NHL), and four cases Of primary nodal T-cell NHL The 
usage of V beta families in T-cell populations was investigated on mRNA that was 
transcribed to cDNA using a C beta primer and reverse transcriptase. Subsequently the 
specific usage of the families was analyzed by polymerase chain reaction (PCR) using 
combinations of the selected C beta-oligonucleotide primer and one of the family-specific 
V beta primers. Peripheral blood lymphocytes from four healthy volunteers and 1 
reactive" lymph node served as a control and expressed all 20 V beta families tested for. 
in T-cell lines, with restricted V beta expression, and in three patients with advanced MF 
only one.or two V beta families were expressed at the mRNA level. In an early MF lesion 
this monoclonal expression was absent: several V beta families were expressed with a 
weak intensity. This may indicate either a polyclonal origin of MF, or that too few 
monoclonal neoplastic cells were present in the tissue specimen. In the four nodal T-cell 
NHL, only one family could be clearly distinguished, whereas some of the other V beta 
families showed only a weak expression. These latter families represent the reactive T- 
cell component in the nodal T-cell NHL. Both in nodal T-cell NHL and in MF there was 
no preferential expression of a particular V beta family. Ttiere was a good correlation 
between PCR data and the expression of V beta^family protein products observed by 
immunohistochemistry on tissue sections of the T-cell lymphomas. AH T-cell lines three 
cases of MF, and three cases of nodal T-cell NHL showed a rearrangement of the TCR 
beta chain on DNA level. 
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Matiiliu-3 in human articular cartilage: increased expression in 
osteoarthritis. 

P»K*g Q , Wcsetoh G . Klatt AR . Wagener R, Swoboda B . 

Division of Orthopaedic Rheumatology, Department of Orthopaedics, University of 
Erlangen-Nuremberg, Rathsberger Str. 57, D-91054 Erlangen, Germany. 
oliver.pullig@med.uni-erlangen.de 

OBJECTIVE: Matriiin-3 is a member of the recently described matrilin family of 
extracellular matrix proteins containing von Willebrand factor A- like domains. The 
matrilin-3 siibunit can form homo-tetramers as well as hetero-oligomers together with 
subunits of matrilin- 1 (cartilage matrix protein). It has a restricted tissue distribution and 
is strongly expressed in growing skeletal tissues. Detailed information on expression and 
distribution of extracellular matrix proteins is important to understand cartilage function 
. in health and in disease like osteoarthritis (OA). METHODS: Normal and osteoarthritic 
cartilage were systematically analysed for matrilLn-3 expression, using 
immimohistqcherrustry, Western blot analysis, in situ hybridization, and quantitative 
PCR. RESULTS: Our results indicate that matrilin-3 is a mandatory component of 
mature articular cartilage with its expression being restricted to chondrocytes from the 
tangential zone and the upper middle cartilage zone. Osteoarthritic cartilage samples with 
only moderate morphological osteoarthritic degenerations have elevated levels of 
matrilin-3 iriRNA. In parallel, we found an increased deposition of matrilin-3 protein in 
the cartilage matrix. Matrilin-3 staining was diffusely distributed in the cartilage matrix, 
with no cellular staining being detectable. In cartilage samples with minor osteoartliritic 
lesions, matrilin-3 deposition was restricted to the middle zone and to the upper deep 
zone. A strong correlation was found between enhanced matrilin-3 gene and protein 
expression and the extent of tissue damage. Sections with severe osteoartliritic 
degeneration showed the highest amount of matrilin-3 mRN A, strong signals in in situ 
hybridization, and prominent protein deposition in the middle and deep cartilage zone. 
CONCLUSION: We conclude that matrilin-3 is an integral component of human articular 
cartilage matrix and that the enhanced expression of matrilin-3 in OA may be a cellular 
response to the modified microenvironment in the disease. Copyright 2002 Osteo Arthritis 
Research Society International. 
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Up-regulation of mitochondrial peripheral benzodiazepine receptor 
expression by tumor necrosis factor alpha in testicular Icydig cells. 
Possible involvement in cell survival. 

Rey C Maaduif C Naureils O , Benahmcd M , LouisoMP, Gasnier F . 
INSERM U. 189, Faculte de Medecine Lyon-Sud, BP12, 69921 cedex, Oullins, France- 
Porcine Leydig cells in primaiy cultures are resistant to tumor necrosis factor alpha 
(TNFalpha) cytotoxicity. Here we report that these cells can be rendered sensitive to 
TNFalpha killing by treatment with the translational inhibitor cycloheximide, suggesting 
the existence of proteins that can suppress the death stimulus induced by the cytokine. In 
search of these cytoprotective proteins, we focused on the constituents of the 
mitochondrial permeability transition pore (PT pore), whose opening has been shown to 
play a critical role in the TNFalpha-mediated death pathway. We found that TNFalpha 
up-regulated mRNA and protein expression of the mitochondrial peripheral 
benzodiazepine receptor (PBR), an outer membrane-derived constituent of the pore. A 
strong correlation was established between the resistance of the cells to TNFalpha killing 
and the density of PBR-binding sites. Concomitantly, TNFalpha down-regulated Bcl-2 
mRNA and protein expression. As Bcl-2 has been shown to be an endogenous inhibitor 
of the PT pore, we hypothesize that the TNFalpha-induced up-regulation of PBR 
expression may compensate for the decrease in Bcl-2 levels to prevent the opening of the 
PTpore. 
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GLUTl messenger RNA and protein induction relates to the malignant 
transformation of cervical cancer. 

Rudtovvski C , Becker AJ, Schroder W. Rath W, Buttner R . Moser M . 

Dept of Gynecology and Obstetrics, University Hospital Heidelberg, Vossstr 7-9, D- 
691 15 Heidelberg, Germany. 

We studied whether induction of glucose transporters (GLUTs) I to 4 correlates with 
human papillomavirus (HPV)-dependent malignant transformation of cervical epithelium. 
Tissue samples of cervical intraepithelial neoplasia (CtN; grades 1 to 3), invasive 
carcinomas, and lymph node metastasis were examined. HPV typing was performed. 
Tissue sections were immunostained with GLUTl to GLUT4 antibodies. Messenger 
RNA (mRNA) in situ hybridization confirmed GLUTl protein expression. Weak 
expression of GLUTl was found in nondysplastic HPV-positive and HPV-negative 
epithelium; significant expression was observed in preneoplastic lesions, correlating with 
the degree of dysplasia. In CJN 3 high-risk HPV lesions, cervical cancer, and metastasis, 
GLUTl was expressed at highest levels with a strong correlation of GLUTl mRNA and' 
protein expression. Imrriunostains for GLUT2 to GLUT4 were negative. Cervical rumor 
cells respond to enhanced glucose utilization by up-regulation of GLUTl . The strong 
induction of GLUTl mRNA and protein in HPV-positive CIN 3 lesions suggests GLUTl 
overexpression as an early event in cervical neoplasia. GLUTl is potentially relevant as a 
diagnostic tool and glucose metabolism as a therapeutic target in cervical cancer. 
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Expression aad distribution of laminin alphal and alpha2 chains in 
embryonic and adult mouse tissues: an immunochemical approach. 

SasaMX GiUayjt, TaJtsU, Tjnj^TR, TalfsJF. 

Max-Planck-Institute for Biochemistry, Martinsried, D-82 1 52, Germany. 

Protein levels, mRNA expression, and localization of laminin alphal and alpha2 chains in 
development and in adult mice were examined. Recombinant fragments were used to 
obtain high-titer-specific polyclonal antibodies for establishing quantitative 
radioimmuno-inhibition assays. This often demonstrated an abundance of alpha2 chain, 
but also distinct amounts of alpha 1 chain for adult tissues. The highest amounts of alphal 
were found in placenta, kidney, testis, and liver and exceeded those of alpha2. All other 
tissue extracts showed a higher content of alpha2, which was particularly high in heart 

. and muscle when compared to alphal . Content of gamma 1 chain, shared by most 
laminins, was also analyzed. This demonstrated gammal chain levels being equal to or 

. moderately exceeding the sum of alphal and alpha2 chains, indicating that these isoforms 
represent the major known laminin isoforms in most adult mouse tissues so far examined. 
Moreover, we found good correlation between radioimmuno-inhibition data and mRNA 
levels of adult tissues as measured by quantitative real-time reverse transcriptase-PCR. 
Embryonic tissues were also analyzed by radioimmuno-inhibition assays. This 
demonstrated for day 1 1 embryos comparable amounts of alphal and gammal and a 
more than 25-fold lower content of alphaZ This content increased to about .10% of 
alphal in day 13 embryos. The day 18 embryo showed in heart, kidney, and liver, but not 
yet in brain and lung, alphal /alpha2 chain ratios comparable to those in adult tissues. 
Immunostaining demonstrated alphal in Reichert's membrane (day 7.5), while alpha2 
could not be detected before day 11.5. These data were compared with 
immunohistochemical localization results on several more embryonic and adult tissue 
sections. Our results regarding localization are consistent with those of earlier work with 
some notable exceptions. This was in part due to epitope masking for monoclonal 
antibodies commonly used in previous studies in esophagus, intestine, stomach, liver, 
kidney, and spleen. 
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Discordant regulation of granzyme H and granzyinc B expression in 
human lymphocytes. 

Sedelies KA, Savers TJ, Edwards KM , Chen W. Pctlicci DG / Godfrey PI , Trapani 
J A. . 

Cancer Immunology Laboratory, Peter MacCallum Cancer Centre, Locked Bag 1, 
A'Beckett Street, East Melbourne, 8006, Australia. 

We analyzed the expression of granzyme H in human blood leukocytes, using a novel 
monoclonal antibody raised against recombinant granzyme H. 33-kDa granzyme H was 
easily detected in un fractionated peripheral blood mononuclear cells, due to its high 
constitutive expression in CD3(-)CD56(+) natural killer (NK) cells, whereas granzyme B 
was less abundant. The NK lymphoma cell lines, YT and Lopez, also expressed high 
granzyme H levels. Unstimulated CD4(+) and particularly CD8(+) T cells expressed far 
lower levels of granzyme H than NK cells, and various agents that classically induce T 
cell activation, proliferation, and enhanced granzyme B expression failed to induce 
granzyme H expression in T cells. Also, granzyme H was not detected in NK T cells, 
monocytes, or neutrophils. There was a good correlation between mRNA and protein 
expression in cells that synthesize both granzymes B and H, suggesting that gzmH gene 
transcription is regulated similarly to gzmB. Overall, our data indicate that although the 
gzmB and gzmH genes are tightly linked, expression of the proteins is quite discordant in. 
T and NK cells. The finding that granzyme H is frequently more abundant than granzyme 
B in NK cells is consistent with a role for granzyme H in complementing the pro- 
apoptotic function of granzyme B in human NK cells, 
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BCL2 protein expression parallels its mRNA level in normal and 
malignant B cells. 



Shen Y , Iqbal J, Huang JZ , Zhou G , Chan WC . 



Department of Patliology and Microbiology, University of Nebraska Medical Center, 
Omaha, USA. 

The regulation of B-cell lymphoma 2 (BCL2) protein expression in germinal center (GC) 
B ceils has been controversial. Previous reports have indicated posttranscriptional 
regulation plays a dominant role. However, a number of recent studies contradicted these 
reports. Using real-time polymerase chain reaction (PCR) and Standardized Reverse 
Transcriptase-PCR (StaRT-PCR), we measured the level of mRNA expression in GC, 
mantle zone (MNZ), and marginal zone (MGZ) cells from laser capture microdissection. 
Both quantitative RT-PCR measurements of rnicrodissected GC cells from tonsils 
showed that GC cells had low expression of BCL2 transcripts commensurate with the 
low protein expression level. These results are in agreement with microarray studies on 
fluorescence-activated cell sorter (FACS)-sorted cells and rnicrodissected GC cells. We 
also examined BCL2 mRNA and protein expression on a series of 30 cases of diffuse 
large B-cell lymphoma (DLBCL) and found, in general, a good correlation. The results 
suggested that BCL2 protein expression is regulated at the transcriptional level in normal 
B cells and in the neoplastic cells in most B-cell lymphoproliferative disorders. 
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Quantitative determinations of the steady state transcript levels of 
hexokinase isozymes and glucose transporter isoforms in normal rat 
tissues and the malignant tumor cell line AH130! 

Shinohara Y, Yamamoto K. Inoo K . Yamazaki N. Terada H . 

Faculty of Pharmaceutical Sciences, University of Tokushima, Japan. 
yasuo@ph.tolcushima-u.ac.jp 

The steady state transcript levels of the four hexokinase (HK) isozymes and four glucose 
transporter (GLUT) isoforms were determined quantitatively by Northern analysis of 
RNA samples from rat tissues using synthetic fragments of the RNAs encoding the HK 
isozymes and GLUT isoforms. Results showed that the levels of HK isozyme transcripts 
were low in rat tissues, the level of that most highly expressed, the type I isozyme (HKl) 
m the brain being 0.025% of the total poIyfA)* RNA. A good correlation was found ' 
between the reported HK activities and the total amounts of transcripts encoding all HK 
isozymes in various tissues, showing that the HK activities in tissues can be estimated 
from the total amount of transcripts encoding HK isozymes. The proposed associated 
expresses of HK isozymes and GLUT isoforms in particular tissues were confirmed at 
thar transenpt levels. The steady state transcript levels of type II HK and the type 1 
GLUT isoform in the malignant tumor cell line AH1 30 were also determined 
quantitatively. . 

PMID: 9459591 [PubMed - indexed for MEDLINE] 



158: Photochem PhotobioL 2002 Mar;75(3):302-10. Related Articles. Links 

s^Sflcrt Press, IncSe*- 

UVA irradiation-induced activation of activator protein-1 is correlated 
with induced expression of AP-1 family members in the human 
keratinocyte cell line HaCaT. 

Silvers AL, Bowden GT . 

Department of Radiation Oncology, Arizona Cancer Center, The University of Arizona, 
Tucson 85724, USA. 

To determine whether the transcription factor activator protein- 1 ( AP- 1 ) could be 
modulated by ultraviolet A (UVA) exposure, we examined AP-I DNA-binding activity 
and transactivation after exposure to UVA in the human immortalized keratinocyte cell . 
line HaCaT. Maximal AP-I transactivation was observed with 250 kJ/m2 UVA between 
3 and 4 h after irradiation. DNA binding of AP-1 to the target 12-0- 
tetradecanoylphorbol-13-acetate response element sequence was maximally induced 1-3 
h after irradiation. Both de novo transcription and translation contributed to the UVA- 
induced AP-1 DNA binding. c-Fos was implicated as a primary component of the AP-1 
DNA-binding complex. Other components of the complex included Fra-2, c-Jun, JunB 
and JunD. UVA irradiation induced protein expression of c-Fos, c-Jun, Fra-1 and Fra-2. 
Phosphorylated forms of these induced proteins were determined'at specific time points. 
A strong correlation existed between UVA^induced AP-1 activity and accumulation of c- 
Fos, c-Jun and Fra-1 proteins. UVA irradiation also induced c-fos and c-jun mRNA 
expression and transcriptional activation of the c-fos gene promoter. These results 
demonstrate that UVA irradiation activates AP- 1 and that c-fos induction may play a 
critical role in the response of these human keratinocytes to UVA irradiation. 
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Rat kidney glutamyl aminopeptidase (aminopeptidase A): molecular 
identity and cellular localization. 

SongJL, Ye M , Troyanovskava M. WilkE, Wilk S . Healy DP . 

Department of Pharmacology, Mount Sinai School of Medicine, City University of New 
. York, New York 10029. 

Glutamyl aminopeptidase [aminopeptidase A (EAP), EC 3.4. 11 .7] is an ectoenzyme that 
selectively hydrolyzes acidic ami no acid residues from the amino terminus of 
oligopeptides. EAP activity is highest within the kidney and small intestine. The murine 
pre-B cell BP-1/6C3 and the human kidney glycoprotein gpl<50 differentiation antigens 
have been reported to have biochemical properties indistinguishable from EAP. It is not 
known, however, if rat kidney EAP is a homologue of these antigens or molecularly 
distinct Using the reverse transcription-polymerase chain reaction method with 
oligonucleotide primers based on the BP-1/6C3 nucleotide sequence, we isolated a 450- 
bp partial cDNA from rat kidney poly(A)-f UNA The partial cDNA encoded a predicted 
protein that was 92% and 86% identical to the murine BP-1/6C3 and human gpl60 
antigens, respectively; the amino acid sequence within the zinc-binding domain was 
completely conserved. Purification of EAP from rat kidney and microsequence analysis 
of a tryptic digest peptide fragment ( 1 8-mer) indicated that the fragment was highly 
similar to a region within the BP-1/6C3 and gpl60 proteins. Northern blot hybridization 
and immunoblot analyses were also consistent with labeling of products the same size as 
reported for the BP-1/6C3 and gpI60 antigens. There was a good correlation between the 
cellular distribution of EAP mRNA and EAP immunoreactivity, with proximal tubules 
and glomerular mesangial cells having the highest densities. These results indicate that 
rat kidney EAP is a species homologue of the murine BP- 1/6C3 and human gp 1 60 
antigens. Furthermore, on the basis of its cellular localization, rat kidney EAP is likely to 
be involved in degradation of oligopeptides within the glomerulus and the glomerular 
filtrate. Since cells that express EAP also express receptors for angiotensin II, an 
intrarenal vasoactive hormone that is a substrate for EAP, these results further suggest 
that EAP may play a role in modulating the activity of intrarenal angiotensin IL 
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Tumor necrosis factor-alpha uprcgulates the prostaglandin E2 EP1 
receptor subtype and the cycIooxygcnasc-2 isoform in cultured amnion 
WISH cells: 

Spaziani EP , Bcnoit RR , Tsibris JC , Gould SF , O'Driea WF . 

University of South Florida Health Science Center, Department of Obstetrics 8c 
Gynecology, Tampa 33612, USA. espazian@comLmed.usf.edu 

Recent studies have demonstrated a strong correlation between infection and preterm 
labor. Preterm delivery is also associated with high levels of cytokines and prostaglandins 
in amniotic fluid. The purpose of this study was to investigate the effect of tumor necrosis 
factor-alpha (TNF-alpha) on the levels of cyclooxygenase, prostaglandin E2 production 
(PGE2), and expression of the PGE2 receptor subtype EP 1 in amnion WISH cell culture. 

. Amnion WISH cell cultures were incubated in increasing concentrations of TNF-alpha 
(0-50 ng/ml). Changes in cyclooxygenase and EPl receptor proteins were evaluated by 
Western blot analysis. Changes in EPl mRNA were evaluated by Northern blot, and 
culture fluid concentrations of PGE2 were estimated by en2yme immunoassay (ElA). 

-EPl protein (p<0.01), EPl mRNA (p<0.05), cyctooxygenase-2 (COX-2) protein 
(pO.OOl), and PGE2 concentrations (p<0.01) all increased with increasing 
concentrations of TNF-alpha. Changes in COX-1 protein were not observed following 
TNF-alpha-incubation. The results suggest that TNF-alpha may play a role in infection- 
induced preterm labor by its pleiotropic ability to simultaneously stimulate COX-2 
activity, PGE2 concentrations, and PGE2 EPl receptor levels in human amnion; 
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Transcriptional activity of potent glucocorticoids: relevance of 
glucocorticoid receptor isoforms and drug metabolites. 

Sptka 1, Hammer S , Klcuser B , Korting HC Schafer-Korting M . 

; Institut fur Pharmazie, Abteilung for Pharmakologie und Toxikologie, Freie Universitat 
Berlin, Berlin, Germany. 

As compared to standard glucocorticoids (GC), prednicarbate (PC) is favorable in the 
treatment of eczema due to its high benefit/risk ratio. The remarkable anti-inflammatory 
effects of PC are in strong contrast to its reported low glucocorticoid receptor (GR) 
binding affinity, fn transfected COS-7 cells we related the transcriptional potencies of 
PQ its metabolites and conventional GC to their receptor binding properties. Moreover, 
the expression pattern of the human GR isoform hGRalpha and its mutual dominant 
negative inhibitor hGRbeta in skin cells have been investigated as well as the influence of 
hGRbeta on receptor binding and transactivation. hGRalpha mRNA and protein was 
largely overexpressed in skin cells. hGRbeta showed no influence on hGRalpha binding 
and transactivation. Concentration response curves indicated the greater transactivation 
potency of betamethasone 17-valerate followed by dexamethasone and prednisolone 17- 
ethylcarbonate. Native PC appeared almost as potent as dexamethasone. With both a 
strong correlation was observed between transactivation and GR binding. Copyright 2003 
S. FCarger AG, Basel 
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Specific inhibition of AQP1 water channels in isolated rat intrahepatic bile 
duct units by small interfering RNAs. 

Splinter PL , Masyuk AI , LaRusso NF . 

Center for Basic Research in Digestive Diseases, Division of Gastroenterology and 
Hematology, Mayo Medical School, Clinic, and Foundation, Rochester Minnesota 55905 
USA. ' 

Cholangiocytes express water channels (i.e. aquaporins (AQPs)), proteins that are 
increasingly recognized as important in water transport by biliary epithelia. However, 
direct functional studies demonstrating AQP-mediated water transport in cholangiocytes 
are limited, in part because of the lack of specific AQP inhibitors. To address this issue, 
we designed, synthesized, and utilized small interfering RNAs (siRNAs) selective for 
AQP1 and investigated their effectiveness in altering AQP I -mediated water transport in 
intrahepatic bile duct units (IBDUs) isolated from rat liver. Twenty-four hours after 
trans fection of IBDUs with siRNAs targeting two different regions of the AQP 1 
transcript, both AQP1 mRNA and protein expression were inhibited by 76.6-92.0 and 
57.9-79.4%, respectively. siRNAs containing the same percent of base pairs as the 
AQP 1 -siRNAs but in random sequence (i.e. scrambled siRNAs) had no effect. 
Suppression of AQP 1 expression in cholangiocytes resulted in a decrease in water 
transport by IBDUs in response to both an inward osmotic gradient (200 mosm) or a 
secretory agonist (forskolin), the osmotic water permeability coefficient (P(f)) decreasing 
up to 58.8% and net water secretion (J(v)) decreasing up to 87%. A strong correlation 
between AQP I protein expression and water transport in IBDUs transfected with AQP1- 
siRNAs was consistent with the decrease in water transport by IBDUs resulting from 
AQP1 gene silencing by AQP 1 -siRNAs. This study is the first to demonstrate the 
feasibility of utilizing siRNAs to specifically reduce the expression of AQPs in epithelial 
cells and provides direct evidence of the contribution of AQP 1 to water transport by 
biliary epithelia. 
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Type IV collagenase (M(r) 72,000) expression in human prostate: benign and 
malignant tissue. 

Stearns ME , Wang ML 

Department of Pathology, Medical College of Pennsylvania, Philadelphia 19129. 

The expression of type IV collagenase (M(r) 72,000) has been examined in tissues from patients 
with benign prostatic hyperplasia (6 patients) and varying Gleason grades of malignant prostate 
cancer (1 8 patients). Immunoperoxidase labeling indicated that expression of the type IV 
collagenase was weak or nonexistent in benign tissue but consistently strong in the glandular and 
ductal epithelial cells of prostate tumors diagnosed at Gleason grades 1-8. In moderate to 
advanced cancer (i.e., Gleason grades 2 to 8), invasive tumor foci in the stromal tissue produced 
relatively modest amounts of type IV collagenase. The normal stromal tissue (i.e., fibroblasts) 
uniformly failed to produce detectable levels of type IV collagenase in the 24 patients examined. 
Northern and quantitative slot blot hybridization assays demonstrated that collagenase type IV 
mRNA levels were low in benign tissue and high in malignant tumors. In contrast, the stromal 
cells did not express significant amounts of type IV collagenase mRNA. Enzyme-linked 
immunosorbent assays demonstrated that the amounts of type IV collagenase protein correlated 
directly with the mRNA levels in the tumor tissue. The studies suggest that type IV collagenase 
may be selectively overexpressed by malignant, preinvasive prostatic epithelial cells. 
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The decompensated detrusor III: impact of bladder outlet obstruction on 
sarcoplasmic endoplasmic reticulum protein and gene expression. 

Stein R , Gong C. Hutcheson JC . Canning DA . Zderic SA . 

Division of Urology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, 
USA, 

PURPOSE: Regulation of calcium ion homeostasis has a significant role in smooth 
muscle contractility. The sarcoplasmic endoplasmic reticulum, calcium, magnesium, 
adenosine triphosphatase (SERCA) is a regulatory ion pump that may have a role in the 
functional outcome after outlet obstruction. We investigate what correlation if any existed 
between SERCA protein and gene expression, and the contractile properties in the same 
bladder. MATERIALS AND METHODS: Standardized partial bladder outlet 
obstructions were created in adult New Zealand white rabbits, which were divided into, 
control, sham opferated and obstructed groups. Muscle strip studies subcategorized the 
obstructed group into compensated (force greater than 50% of control) and 
decompensated (force less than 50% of control). Microsomal membrane and total RNA 
fractions were prepared from the same bladder tissue. Membrane proteins were used for 
. Western blot analysis using a SERCA specific monoclonal antibody, and total RNA was 
assessed with Northern blot analysis. RESULTS: The relative intensities of signals for 
the Western and Northern blots demonstrated a strong correlation between protein and 
gene expression. Furthermore there was a strong association between the loss of SERCA 
messenger RNA and protein expression and loss of bladder function. CONCLUSIONS: 
Bladder contractility after outlet obstruction is influenced in part by smooth muscle cell 
ability to maintain calcium homeostasis via SERCA. The loss of SERCA protein 
expression is mediated by down-regulation in gene expression in the same bladder. These 
data suggest that smooth muscle ion pump gene expression is in part mechanically 
(pressure work) regulated. 
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TNF-alpha and IL-8 arc upregulated in the epidermis of normal human 
skin after UVB exposure: correlation with neutrophil accumulation and E- 
selectin expression. 

Strickland t , Rhodes LE , Flanagan BF, Friedmann PS . 

Department of Dermatology, University of Liverpool, United Kingdom. 

Hie in vivo response to ultraviolet B (UVB) radiation in skin is characterized by the 
accumulation of both mononuclear and polymorphonuclear cells within the dermis and an 
induction of vascular endothelial adhesion molecules. Epidermal production of cytokines 
(IL-8 and TNF-alpha) has been strongly implicated in the development of UVB-induced 
inflammation. In the current study, we examined the time course of IL-8 and TNF-alpha 
mRNA and protein expression in the epidermis over a 24 -h period after in vivo UVB 
irradiation. Also, the induction of adhesion molecule expression and the accumulation of 
neutrophils within the dermis were followed- We found constitutive expression of both 
cytokines (mRNA and protein) in the epidermis of unirradiated skin. IL-8 was rapidly 
upregulated after irradiation and mRNA and protein increased at 4 h, reaching a 
maximum between 8 and 24 h. TNF-alpha mRNA and protein was minimally increased 
by 8 h after UVB irradiation and reached a maximum by 24 h. No significant alteration in 
ICAM-1 or VCAM-l expression was observed. E-selectin expression, which was absent 
from control samples, was increased from 4 h onward, and also reached a maximum at 24 
h, coinciding with peak neutrophil accumulation. A strong correlation (r ~ 0.96) was 
found between number of E-selectin-positive vessels and numbers of infiltrating 
neutrophils at this time. Moreover, because E-selectin expression was increased before 
any apparent increase in TNF-alpha protein (4 h), TNF-alpha does not appear to be 
involved in the early induction of the adhesion molecule, but cytokines such as TNF- 
alpha and IL-8 may act subsequently to augment the inflammatory response. 
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Basic fibroblast growth factor expression is increased in human renal 
fibrogenesis and may mediate autocrine fibroblast proliferation. 

Strut? F, Zeisberg M , Bemmerlem B , Sattler B, Hummel K Becker V ; MuI)er GA . 

Department of Nephrology and Rheumatology, Georg- August-University Gottingen, 
Germany, fstrutz@gwdg.de 

BACKGROUND: Interstitial fibroblasts play a critical role in renal fibrogenesis, and 
autocrine proliferation of these cells may account for continuous matrix synthesis. Basic 
fibroblast growth factor (FGF-2) is mitogenic for most cells and exerts intracrine, 
autocrine, and paracrine effects on epithelial and mesenchymal cells. The aims of the 
present studies were to localize and quantitate the expression of FGF-2 in normal and 
pathologic human kidneys and to study the in vitro effects of FGF-2 on proliferation, 
differentiation, and matrix production of isolated cortical kidney fibroblasts. METHODS: 
FGF-2 protein expression was localized by immunpfluoresence double labelings in 
normal and fibrotic human kidneys. Subsequently, interstitial FGF-2 labeling was 
determined semiquantitative^ in 8 normal kidneys and 39 kidneys with variable degrees 
of interstitial fibrosis and was correlated with the morphometrically determined 
interstitial cortical volume. In addition, FGF-2 expression was quantitated by immunoblot 
analysis in three normal and six fibrotic kidneys. FGF-2 rnRNA was localized by in situ 
hybridizations. Seven primary cortical fibroblast lines were established, and expression of 
FGF-2 and FGF receptor-1 (FGFR-I) were examined. The effects of FGF-2 on cell 
proliferation were determined by bromodeoxyuridine incorporation and cell counts, those 
on differentiation into myofibroblasts by staining for alpha-smooth muscle actin, and 
those on matrix synthesis by enzyme-linked immunosorbent assay for collagen type I and 
fibronectin. Finally, proliferative activity in vivo was evaluated by expression of M1B-1 
(Ki-67 antigen). RESULTS: In normal kidneys, FGF-2 expression was confined to 
glomerular, vascular, and a few tubular as well as interstitial fibroblast-Iike cells. The 
expression of FGF-2 protein was increased in human kidneys, with rubuIointerstitiaJ 
scarring correlating with the degree of interstitial fibrosis (r = 0.84, P < 0.01 ). 
Immunoblot analyses confirmed a significant increase in FGF-2 protein expression in 
kidneys with interstitial scarring. In situ hybridization studies demonstrated low-level 
detection of FGF-2 rnRNA in normal kidneys. However, FGF-2 rnRNA expression was 
robustly up-regulated in interstitial and tubular cells in end-stage kidneys, indicating that 
these cells are the source of excess FGF-2 protein. Primary cortical fibroblasts express 
FGF-2 and FGFR-1 in vitro. FGF-2 induced a robust growth response in these cells that, 
could be blocked specifically by a neutralizing FGF-2 antibody. Interestingly, the 
addition of the neutralizing antibody alone did reduce basal proliferation up to 31.5%. In 
addition, FGF-2 induced expression of alpha-smooth muscle actin up to ] .6-fold, but no 
significant effect was observed on the synthesis of collagen lype 1 and fibronectin. 
Finally, staining for MJB-J revealed a good correlation of interstitial FGF-2 positivity 



with interstitial and tubular proliferative activity (r = 0.71 , P < 0.01 for interstitial 
proliferation, N = 30). CONCLUSIONS; Interstitial FGF-2 protein and mRNA 
expression correlate with interstitial scarring. FGF-2 is a strong mitogen for cortical 
kidney fibroblasts and may promote autocrine fibroblast growth. Expression of FGF-2 
correlates with interstitial and tubular proliferation in vivo. 
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Adiposity elevates plasma MCP-1 levels leading to the increased CD1 lb- 
positive monocytes in mice, 

TakahashiK Mizuarai S , Araki H , MashikoS, Ishihara A, Kanacani A , Itadani Ij 
Kotani H . 

Banyu Tsukuba Research Institute in collaboration with Merck Research Laboratories, 
Tsukuba, rbaxaki 300-26 1 1 , Japan. 

Obesity is currently considered as an epidemic in the western world, and it represents a 
major risk factor for life-threatening diseases such as heart attack, stroke, diabetes, and 
cancer. Taking advantage of DNA microarray technology, we tried to identify the 
molecules explaining the relationship between obesity and vascular disorders, comparing 
mRNA expression of about 12,000 genes in white ad ipose tissue between normal, high 
fat diet-induced obesity (DIO) and d-Trp34 neuropeptide Y-induced obesity in mice. 
Expression of monocyte chemoattractant protein-1 (MCP-1) mRNA displayed a 7.2-fold 
increase in obese mice as compared with normal mice, leading to substantially elevated 
MCP-1 protein levels in adipocytes. MCP-l levels in plasma were also increased in DIO 
mice, and a strong correlation between plasma MCP-1 levels and body weight was 
identified. We also showed that elevated MCP-1 protein levels in plasma increased the 
CD 1 lb-positive monocyte/macrophage population in DIO mice. Furthermore, infusion of 
MCP-1 into lean mice increased the CD 1 lb-positive monocyte population without 
inducing changes in body weight. Given the importance of MCP-l in activation of 
monocytes and subsequent atherosclerotic development, these results suggest a novel role 
of adiposity in the development of vascular disorders. 
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Augmented expression of neuronal nitric oxide synthase in the atria 
parasympathetically decreases heart rate during acute myocardial 
infarction in rats. 

Takimoto Y, AoyamaT , Tanaka K , Keyamura R , Yui Y , Sasayama S . 

Department of Cardiovascular Medicine, Graduate School of Medicine, Kyoto 
University, Kyoto, Japan. 

BACKGROUND; Nitric oxide (NO) synthesized within sinoatrial cells recently has been 
shown to participate in the autonomic control of heart rate. We hypothesized that NO in 
the neuronal cells in the heart was increased and parasympathetically regulated heart rate 
after myocardial infarction (MI). METHODS AND RESULTS: We examined heart rate 
dynamics and neuronal HQ synthase (nNOS) expression and activities in the atria of rats . 
with MI 1, 3, 7, and 14 days after MI (n=7 to 22 for each group). Both the mRNA levels 
of nNOS in the atria determined by competitive reverse transcriptase-polymerase chain 
reaction and the protein levels determined by Western blotting were significantly 
increased compared with controls 1, 3, and 7 days after ML nNOS activity in the atria 1 
day after infarction was also increased in MI rats. nNOS immunoreacti vity was observed 
in nerve fibers in the atria. After infusion of a specific inhibitor of nNOS and iNOS, l-(2- 
trifluoromethylphenyl) imidazole (TRIM) (50 mg/kg IV), heart rate was significantly 
(P<0.01) increased in MI rats compared with controls I, 3, and 7 days after ML The 
iNOS-specific inhibitor, 1400W (10 mg/kg SC), did not sijgnificantly affect the heart rate 
in rats with ML The effect of TRIM was abolished by pretreatment with L-arginine (25 
mg/kg TV) or by parasympathetic blockade with atropine but not by propranolol. There 
was a strong correlation (r=0.837, PO.0001) between the nNOS protein expression and 
heart rate change after TRIM infusion. CONCLUSIONS: These results indicate that . 
increased nNOS parasympathetically decreased heart rate via the production of NO in 
rats with acute ML 
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Differential upregulation of cellular adhesion molecules at the sites of 
oxidative stvess iu experimental acute pancreatitis. 

Tclek G , Ducroc R , Scoazcc JY ; Pasquicr C Feldrnann G . Roze C . 

INSERM U 410, Universite Paris 7 Denis Diderot, 75870 Paris, France. 

BACKGROUND: Severe acute pancreatitis (AP)(2) is associated with exaggerated 
leukocyte adherence and activation. Endothelial cellular adhesion molecules (CAMs) can 
be induced by cytokines, but also directly by oxygen free radicals (OFRs), mediated by 
nuclear factor kappa-B (NF-kappa B). We investigated the behavior of inducible CAMs 
in relation to pancreatic oxidative stress. Our novel modification of cerium capture 
histochemistry (reaction of OFRs with cerium produces laser reflective Ce perhydroxide 
precipitates) combined with reflectance confocal laser scanning microscopy (CLSM) 
allows the histological codemonstration of in vivo OFR production and immunolabeled 
CAMs, or NF-kappa B. METHODS; Taurocholate AP was induced in rats; sham 
operated and normal animals served as controls. To achieve in situ, in vivo reaction of 
cerium with OFRs, animals were perfused with CeCl(3) solution at different time points 
(1, 2, 8, 24 h) and then sacrificed. E-selectin, P-selectin, ICAM-1, VCAM, and NF-kappa 
B p65 were labeled by immunofluorescence (IF) on frozen sections of cerium perfused 
pancreata. IF and Ce perhydroxide reflectance were simultaneously detected by CLSM. 
Pancreatic gene expression of the same CAMs was quantified by competitive RT-PCR 
(MIMIC internal control). RESULTS: Control pancreata showed negligible reflectance 
and minimal CAM expression. Early (1, 2 h) AP samples were characterized by intense, 
heterogeneous acinar OFR production, strong P-selectin, and increasing ICAM 
expression, with nuclear translocation of p65, histologically all colocalizing with the 
areas of acinar oxidative stress. Adherent polymorphonuclear leukocytes (PMNs) 
displayed weak OFR formation. Later (8, 24 h), a slowly declining P-seleetin, but 
persisting ICAM-1 expression, was paralleled by widespread adherence of PMNs 
producing surprisingly large amounts of OFRs. VCAM and E-selectin showed a mild 
increase at 24 h. CAM gene activation was in good correlation with the protein 
expression. CONCLUSIONS: The early acinar oxidative stress is colocalized with NF- 
kappa B activation, preferential P-selectin, and ICAM upregulation in this AP model. 
Subsequently, adherent, activated PMNs become the major source of OFRs, thereby 
contributing to tissue damage. Copyright 2001 Academic Press. 
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Myotonic dystropliy: an unstable CTG repeat in a protein kinase gene. 
Timchenko L , Monckton DG , Caskey CT . 

Department of Molecular and Human Genetics, Baylor College of Medicine, Texas 
Medical Center, Houston 77030, USA. 

Myotonic dystrophy (DM) is caused by the amplification of CTG repeats in the 3' 
untranslated region of a gene encoding a protein homologous to serine/threonine protein 
kinases. In DM patients the CTG repeats are extremely unstable, varying in length from 
patient to patient and generally increasing in length in successive generations. There is a 
strong correlation between the size of the repeats and the age of onset and severity of the 
disease. The molecular basis of the effect of the CTG expansion on the development of 
the DM phenotype continues to be investigated. The first working hypothesis of the 
molecular mechanism of DM was a reduction in steady-state myotorun-protein kinase 
(Mt-PKL) mRNA and protein levels. However, although the consensus finding is that the 
Mt PKL mRNA and protein levels are decreased in DM patients, it is still not clear if this 
reduction leads directly to the DM phenotype. In this short review we discuss the 
molecular aspects of CTG instability and the expression of the myotonin-protein kinase 
gene in normal and DM populations. 

Publication Types: 

• Review 

• Review, Tutorial 



PMID: 7620 11 7 [PubMed - indexed for MEDLINE] 



480: Chem Biol Interact. 1992 Aug 14;83(2):107-19. 



Related Articles, Links 



Induction of class 3 aldehyde dehydrogenase in the mouse hepatoma cell 
line Hepa-1 by various chemicals. 

Torronen R , Korkalainen M . Karenlampi SO . 

Department of Physiology, University of Kuopio, Finland. 

The mouse hepatoma cell line Hepa-l was shown to express an aldehyde dehydrogenase 
(ALDH) isozyme which was inducible by TCDD and carcinogenic polycyclic aromatic 
hydrocarbons. The induced activity could be detected with benzaldehyde as substrate and 
NADP as cofactor (B/NADP ALDH). As compared with rat liver and hepatoma cell 
lines, the response was moderate (maximally 5-fold). There was an apparent correlation 
between this specific form of ALDH and aryl hydrocarbon hydroxylase (AHH) in the 
Hepa-1 wild-type cell line-in terms of inducibility by several chemicals. However, the 
magnitude of the response was clearly smaller for ALDH than for AHH. Southern blot 
analysis showed that a homologous gene (class 3 ALDH) was present in the rat and 
mouse genome. The gene was also expressed in Hepa-1 and there was a good correlation 
between the increase of class 3 ALDH-specific mRNA and B/NADP ALDH enzyme 
activity after exposure of the Hepa-1 cells to TCDD. It is concluded that class 3 ALDH is 
inducible by certain chemicals in the mouse hepatoma cell line, although the respective 
enzyme is not inducible in mouse iiver in vivo. 

PMED: 1505055 [PubMed - indexed for MEDLINE] 



91: Leuk Res. 2003 Dec;27(12);l 1 15-23. Related Articles. Links 

Relationship between cyclin Dl and p21(Wafl/Cipl) during differentiation 
of human myeloid leukemia cell lines. 

UHmannova V , Stockbauer P , Hradcova M » Soncek J , Haskovec C . 

Department of Molecular Genetics, Institute of Hematology and Blood Transfusion, U 
Nemocnice 1, 128 20 Prague 2, Czech Republic, ullman@ulikt.cz 

Expression of cell cycle-regulating genes was studied in human myeloid leukemia cell 
lines ML-1, ML-2 and ML-3 during induction of differentiation in vitro/ 
Myelomonocytic differentiation was induced by phorbol ester (12-o-Tetradecanoyl- 
phorbol- 13-acetate, TP A), tumor necrosis factor alpha (TNFalpha) or interferon gamma 
(INFgamma), or their combination. Differentiation (with the exception of TNFalpha 
alone) was accompanied by inhibition of DN A synthesis and cell cycle arrest. Inhibition 
of proliferation was associated with a decrease in the expression of cdc25A and cdc25B, 
cdk6 and Ki-67 genes, and with increased p21(Wafl/Cipl) gene expression, as measured 
by comparative RT-PCR. Expression of the following genes was not changed after 
induction of differentiation: cyclin Al, cyclin D3, cyclin El and p27(Kipl). Surprisingly, 
cyclin Dl expression was upregulated after induction by TP A, TNFalpha with 
IFNgamma or BA. Cyclin D2 was upregulated only after induction by BA. The results of 
the expression of the tested genes obtained by comparative RT-PCR were confirmed by 
quantitative real-time (RQ) RT-PCR and Western blotting. Quantitative RT-PCR showed 
as much as a 288-fold increase of cyclin Dl specific mRNA after a 24h induction by % 
TP A. The upregulation of cyclin Dl in differentiating cells seems to be compensated by 
the upregulation of p2l(Wafl/Cipl).These results, besides others, point to a strong 
correlation between the expression of cyclin Dl and p2l(Wafl/Cipl) on the one hand 
and differentiation on the other hand in human myeloid leukemic cells and reflect a rather 
cpmplicated network regulating proliferation and differentiation of leukemic cells. 
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Intestinal carbamoyl phosphate synthase I in human and rat Expression 
during development shows species differences aud mosaic expression in 
duodenum of both species. 

Van Beers EH ; - Rings EH, Posthuma G . Dingemanse MA . Taminiau JA , Heymans 
HS, Eincrhand AW, Butler HA . DekkerJ. 

Pediatric Gastroenterology and Nutrition, Department Pediatrics, Emma Children's 
Hospital, Academic Medical Center, Amsterdam, The Netherlands. 

The clinical importance of carbamoyl phosphate synthase I (CPSI) relates to its capacity 
to metabolize ammonia, because CPSI deficiencies cause lethal serum ammonia levels: 
Although some metabolic parameters concerning liver and intestinal CPSI have been 
reported, the extent to which enterocytes contribute to ammonia conversion remains 
unclear without a detailed description of its developmental and spatial expression 
patterns. Therefore, we determined the patterns of enterocytic CPSI mRNA and protein 
expression in human and rat intestine during embryonic and postnatal development, using 
in situ hybridization and immunohJstochemistry. CPSI protein appeared during human 
embryogenesis.in liver at 31-35 e. d. (embryonic days) before intestine (59 e.d.), whereas 
in rat CPSI detection in intestine (at 16 e.d.) preceded liver (20 e.d.). During all stages of 
development there was a good correlation between the expression of CPSI protein and 
mRNA in the intestinal epithelium. Strikingly, duodenal enterocytes in both species 
exhibited mosaic CPSI protein expression despite uniform CPSI mRNA expression in the 
epithelium and the presence of functional mitochondria in all epithelial cells. Unlike rat, 
CPSI in human embryos was expressed in liver before intestine. Although CPSI was 
primarily regulated at the transcriptional level, CPSI protein appeared mosaic in the 
duodenum of both species, possibly due to post-transcriptional regulation. 
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Expression of deoxycytidine kinase in leukaemic cells compared with solid 
tumour cell lines, liver metastases and normal liver. 

van der Wilt CL V Kroep JR, Loves WJ , Rots MG; Van Groenineen CJ. Kaspcrs GJ, 
Peters GJ . 

Department of Medical Oncology, VU University Medical Center, Amsterdam, The 
Netherlands. 

Deoxycytidine kinase (dCK) is required for the phosphorylation of several 
deoxyribonucleoside analogues that are widely employed as chemotherapeutic agents. 
Examples include cytosine arabinoside (Ara-C) arid 2-chiorodeoxyadenosine (CdA) in 
the treatment of acute myeloid leukaemia (AML) and gemcitabine to treat solid tumours. 
In this study, expression of dCK mRNA was measured by a competitive template reverse 
transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 16 childhood and adult AML samples, 10 human liver samples arid 1 1 
human liver metastases of colorectal cancer origin. The enzyme activity and protein 
expression levels of dCK in the cell lines were closely related to the mRNA expression 
. levels (r=0.75, P=0.026 and r=0.86, P=0,007). In AML samples, dCK mRNA expression 
ranged from 1 .16 to 35.25 (x!0(-3)xdCK/beta-actin). In the cell line panel, the range was 
2.97-56.9 (xl0(-3)xdCK/beta-actin) of dCK mRNA expression. The enzyme activity in 
liver metastases was correlated to dCKmRNA expression (r=0.497, P=0.05). In the liver 
samples, these were not correlated. dCK mRNA expression showed only a 36-fold range 
in liver while a 1 50- fold range was observed in the liver metastases. In addition, dCK 
activity and mean mRNA levels were 2.5-fold higher in the metastases than in the liver 
samples. Since dCK is associated with the sensitivity to deoxynucleoside analogues and 
because of the good correlation between the different dCK measurements in rrialignant 
cells and tumours, the CT-RT PCR assay will be useful in the selection of patients that 
can be treated with deoxycytidine analogues. 
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Expression of cytokines and growth factors in human 
glomerulonephritides, 

Watdherr R, Noronha IL . Niemir Z , Kruger C , Stein H , Stumm G . 
Department of Pathology, University of Heidelberg, Germany. 

Numerous experimental studies point to the potential role of cytokines and growth factors 
in the pathogenesis of renal disease. However, from the various autocrine and paracrine 
mediators identified in- vitro and in animal models, so far only a few have been 
demonstrated in selected human glomerulopathies. We examined two types of 
glomerulonephritis (GN): extracapillary GN with anti-neutrophil cytoplasmic 
autoantibodies (ANCA), an example of an acute form of GN, and mesangial IgA GN, 
usually a chronic form of GN, with immunocytochemistry, in situ hybridization and the 
polymerase chain reaction. Normal renal tissue from tumour nephrectomies served as a 
control. In ANCA-positive GN with active renal lesions (crescents, glomerular and 
vascular necrosis), infiltrating mononuclear ceils in glomeruli and in the interstitium 
expressed interleukin (IL)-1 beta, tumour necrosis factor (TNF)-alpha, IL-2, interferon 
(IFN)-gamma, platelet-derived growth factor (PDGF) and transforming growth factor 
(TGF)-beta. Cytokine expression was also observed in activated resident cells, including . 
endothelial cells, capsular epithelial cells, smooth muscle cells of vessel walls,' fibroblasts 
and some tubular epithelial cells. In addition, we noted an increase in the cytokine and 
growth factor receptors TNF-R, IL-1 R type II, IL-2R, IFN-gamma R and PDGF beta-R! . 
In contrast, in mesangial IgA-GN, IL-1 beta, TNF-alpha, IFN-gamma and IL-2 were 
usually absent in glomeruli, Mesangial expansion in this disorder was accompanied by an 
increased expression of PDGF, PDGF beta-R, TGF-beta and IL-6 in mesangial areas. In 
both conditions a good correlation was observed between cytokine expression at the 
mRNA (in situ hybridization) and protein level (imjmunoGytochemishy).(ABSTRACT 
TRUNCATED AT 250 WORDS) 
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Malignant trausformatiou of the human endometrium is associated with 
overexpression of lactoferrin messenger RNA and protein. 

Walmer PIC Padin CJ , Wrona MA- . Heal y BE , Bentley RC Tsao MS . Kolder MF . 
McLachlan JA . Gray KD . 

Department of Obstetrics and Gynecology, Duke University Medical Center, Durham 
North Carolina 27710. 

In the mouse uterus, lactoferrin is a major estrogen-inducible uterine secretory protein, 
and its expression correlates directly with the period of peak epithelial cell proliferation. 
In this study, we examine the expression of lactoferrin mRNA and protein in human 
endometrium, endometrial hyperplasias, and adenocarcinomas using 
irnmunohistochemistry, Western immunoblotting, and Northern and in situ RNA 
hybridization techniques. Our results reveal that lactoferrin is expressed in normal 
cycling endometrium by a restricted number of glandular epithelial cells located deep in 
the zona basalis. Two thirds (8 of 12) of the endometrial adenocarcinomas examined 
overexpress lactoferrin. This tumor-associated increase in lactoferrin expression includes 
an elevation in the mRNA and protein of individual cells and an increase in the number 
of cells expressing the protein. In comparison, only 1 of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also observe distinct 
cytoplasmic and nuclear immunostaining patterns under different fixation conditions in 
both normal and malignant epithelial cells, similar to those previously reported. in the 
mouse reproductive tract: Serial sections of malignant specimens show a good correlation 
between the localization of lactoferrin mRNA and protein in individual epithelial cells by 
in situ RNA hybridization and irnmunohistochemistry. Although the degree of lactoferrin 
expression in the adenocarcinomas did not correlate with the tumor stage, grade, or depth 
of invasion in these 12 patients, there was a striking inverse correlation between the 
presence of progesterone receptors and lactoferrin in all 8 lactoferrin-positive 
adenocarcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis which is not shed with menstruation and is 
frequently overexpressed by progesterone receptor-negative cells in endometrial 
adenocarcinomas. 
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Malignant Transformation of the Human Endometrium Is Associated with 
Overexpression of Lactoferrin Messenger RNA and Protein 
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ABSTRACT 

• In the mouse uterus, lactoferrin is a major estrogen-mductbte uterine 
secretory protein, and its expression correlates directly with tbc period of 
peak epithelial cell proliferation. In this study, **e examine the expression 
of lactoferrin mRNA and protein In human endometrium, endometrial 
hyperplasias, and adenocarcinomas using immun ©histochemistry, West- 
ern immunobiolting, and NoHhern and in situ RNA hybridization lech- 
- niqoes. Our results reveal that lactoferrin is expressed in normal cycling 
endometrium by a restricted number of glandular epithelial cells located 
deep in the zona basalis. Two thirds (8 of 12) of the endometrial adeno- 
carcinomas examined over express lactoferrin. This tumor- associated in- . 
crease in tactoferrio expression includes an elevation in the rnRNA and 
.protein of individual cells and an increase In the number of cells express- 
ing the protein. In comparison, only 1 of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also 
observe distinct cytoplasmic and nuclear immunosufmng patterns under 
different fixation conditions in both normal and malignant epithelial cells, 
similar to those previously reported in the mouse reproductive tract 
-Serial sections of malignant specimens show a good correlation between 
the localization of lactoferrin mRNA and protein in individual epithelial 
celts by in situ RNA hybridisation and imrnunohisiochcinistry. Although 
the degree of lactoferrin expression in the adenocarcinomas did not 
correlate with the tumor stage, grade, or depth or invasion in these 12 
patients, there was a striking inverse correlation between the presence of 
progesterone receptors and lactoferrin in all 8 lartoferrin-positive adeno- 
carcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis which b not shed with menstru- 
ation and is frequently over expressed by progesterone receptor-negative 
cells in endometrial adenocarcinomas. 



INTRODUCTION 

The uterus is a sex steroid-responsive organ that plays a major role 
in women's health. Hysterectomies weic the most frequently per- 
formed major surgical procedures in a 20-year srudy interval 
(1965-1984; Ref. 1). Fifty-eight to 80% of Ihese 12.5 million 
procedures weie performed foi cstrogen-r elated disoidcrs of prolifer- 
ation. Chronic unopposed estrogen exposure, roost commonly asso- 
ciated with type 11 ovulatory disoidcrs, eventually leads to the devel- 
opment of complex endometrial hyperplasia and adenocarcinoma. 
Since the sex steroids, estiogcn and progesterone, act on their target 
tissues by regulating the expression of a wide variety of signaling 
molecules, identifying these regulatory factors will provide critical 
information towards undei standing normal repioduciion and repro- 
ductive tract pathology. Our current knowledge of estiogen and pro- 
gesterone acrion on the reproductive tract is based to a gieat extent on 
information collected from lodenis (2). Although differences exist 
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between the. reproductive physiology of rodents and banians, the 
mouse bas been a useful model for studying steroid hormone action in 
the human female reproductive tract (3, 4). One potential regulatory 
molecule shown to be regulated by estrogen in the mouse reproductive 
tract is lactoferrin. Lad of errin is a basic glycoprotein with an extraor- 
dinarily high affinity for iron that was originally discovered in mflk. 
This protein is expressed id a wide variety of tissues, most notably hj 
polymorphonuclear leukocytes and. most mammalian exocrine glan- 
dular secretions. In the mammary gland (5) and the female reproduc- 
tive tract of the mouse (6-8), ladofenin is regulated by endocrine 
hormones. Prolactin stimulates lactoferrin synthesis in the breast; 
whereas in. the uterus and vagina, the ovarian sex steroid, 170- 
estradiol, is the inducer. (6, 7, 9). To date, lactoferrin is one of the few 
genes that have been identified in the rodent that are directly regulated 
by estradiol. The lactoferrin gene contains an ERE 2 that is important 
for. regulating its expression in vivo in the mouse reproductive tract 
Being linked to estradiol, the expression of lactoferrin by the uterine 
epithelium parallels the onset of DRA synthesis. Although sequencing 
information suggests that the human lactoferrin gene also contains a 
functional imperfect ERE in the 5 '-flanking promoter region (10, H\ 
there is very little data regarding lactoferrin expression in the human 
female reproductive tract; 

The purpose of our study was to examine the expression of lacto- 
ferrin in the human endometrium under normal and pathological 
conditions by iinmuDohistochemistry, immunoblotling, and Northern 
and in siiu RNA hybridization techniques. In addition, we looked for 
correlations between lactoferrin expression and several parameters, 
such as the stage of the menstrual cycle, the distribution of estjogen 
and progesterone receptors, HER-2/new expression, markers of cell 
proliferation, and the histopathologicaj grade and extent of myome- 
trial invasion in the adenocarcinomas. Our data demonstrates that 
lactoferrin is expressed in a very restricted number of glands in the 
basal region of normal human endometrium and is markedly over ex- 
pressed in a significant number of the uterine adencK^rcinoroas by 
PR-negative cells. * 



MATERIALS A NO METHODS 

Tissue Preparation ond Hisiolo^ical EvaJ uau'on. Surgical pathology 
Specimens were obtained from Duke University Medical Center (Durham, NC) 
and tbc Department of Pathology ai MonUeal General Hospital (Quebec. 
Montreal, Canada). Cycling endometrium was obtained from 22 women (age* 
31-49). and atrophic endometrium was obtained bom 7 postmenopausal 
women (agts 64 -77). Hysterectomies from cycling women were performed 
for subscjosa! leiomyomas (n = 6), pelvic relaxation (n = 8), pelvic pain 
(" = 4 )y peritoneal endometriosis (n = 2), and cancer of either the esocervw 
in ~ J) or the ovary (n = 3). In addition, 12 adenocarcinomas, 3 * typical 
complex' hyperplasias,, 3 complex hyperplasias without alypia, and 4 simple 
hyperplasias were analyzed. £acb human uterus was bivaJved shortly aftti 



' Tnc abbicWstionj used aic: 
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hysterectomy, and endometrium was removed from the fundal region. A full 
thickness biopsy was placed into either 10% neutral -buffered formalin or 
jjouin's soloiioo overnight at room temperature before dehydration, paraffin 
embedding, and sectioning at 4 jim on silaniied slides. Histological evalua- 
tions of hematoxylin and eos in-stained slides were performed blindly by one 
board-certified pathologist. Normal endometrial samples were dated by the 
criteria of Noyes ct at (J 2). Endometrial hyperplasias and carcinomas were 
classified according to the current recommendations, of the International 
Society of Gynecological Pathologists under the auspices of WHO (13). 
Histological grading of rumors was performed according to Federation Inter- 
nationale des Gynccologistes et Obstciristes criteria (14). Each specimen was 
read a minimum of thiee times, and only specimens that were read consistently 
ihc same way were included in the study. Unstained sections of the same 
tissues were used foi the cy tochetnical analysis of protein and roRNA expres- 
sion using specific reagents. A few endometrial samples were frozen for 
subsequent protein and RNA extraction* which were evaluated by Western and 
Northern blotting, respectively. All human tissues were handled with the 
precautions and the guidelines required by Duke University and National 
Institute of Enviioninenla) Hearth Sciences, 

Immunolocaliiation. Slides chosen for study were deparaffinized and 
rinsed in 20% glacial acetic add at 4°C for 15 s to inhibit endogenous alkaline 
phosphatase. All subsequent incubations and washes were at room tempera- 
ture. Sections were next equilibrated in PBS for 20 min and incubated for 20 
min with 1.5% normal goat serum dDuted in PBS to block nonspecific binding. 
Defection of lactoferrin was performed primarily with a jabbit ajjtt'-bumart 
lactoferrin polyclonal anlisera geneialed in our laboiatory and affinity purified. 
Similar results weie also seen with a non affinity -purified commeicial antisera 
(Biogenex, Ssn Ramon, Ca). Following incubation at room temperature with 
primary antisera foi 60 min, the seciions were washed in PBS twice for 10 min 
each, and lactoferrin was localized using an alkaline phosphatasc-biotm- 
sneptavidin dctcciion system (Vectastain ABC-AP kit: Vector Laboratories, 
Builingamc, CA; oi the Super Sensitive Detection System: Biogenex, San 
Ramon, CA)- To identify nonspecific staining, pxeinnnune rabbit serum was 
used in place of the primary antibody. The immu nor taction was quantitated by 
determining the percentage of glands and the peicentage of cells siaining for 
• bet oft n id in the 2ona basal is and the 2 on a funciionalis, with a minimum of 
300 cells counted in each region. PR antibody was provided by Geoffrey 
Greene (KD66), and a commercial source was also used (Biogenex, San 
Ramon, CA). Identical staining patterns were confirmed with boih prepara- 
tions. Other commercially obtained anlisera include PCNA (Biogenex, San 
Ramon, CA), ER (ERJD5; AMAC, Wesibrook, Ma), MIB-3 (AMAC), and 
HER-2/nfu (Biogenex, Sao Ramon, CA). The primary antiscia incubatioos 
were 2 b for tbe PR, 1 b for PCNA. M1B-1, and ER, and 30 min for 
HER-2//ICU- Antiscia dilutions were 1:100 for M1BO and 1:20 for HER-2/new. 
Antigen jetrieval (Biogenex, San Ramon, CA) was performed before adding 
the pjogestejone primary antiscia. 

Western Blot Analysis. Proteins weit exu acted from endometrial biopsies 
by bomogenization on ice in J % Triibn-X and 20 mM Tris-HO (pH 7.4) with 
protease inhibitors (10 /ig/ml leupeptin, 200 KU/rolaprotinin, and 20 p-g/ml 
phenylmetbyUuJfonyl fluoride) and clarified by cemrifugation at 45,000 rpm 
for 30 min in a BecJunan 70.1 Ti rotor; then the supernatant was analysed foi 
protein concentration by the BCA protein assay (Pierce, Rocfcfojd, IL). Ali 1 
quois of 200 fig were separated by electrophoresis on an 85% SDS polyac- 
rylamide gel, blotted onto nmoceDulose membranes, incubated with poly- 
clonal rabbit aniihuznan lsciolerrin antisera, and localized with an 135 I- labeled 
donkey anfj-rebbit immunoglobulin, 65 described previously (7). 

In Siru Hybridation- All slides were pre treated with 02 N HO for 30 
niin at room lempeiarure, digcsicd wjib 1 /ie/ml pioieinasc-K (Sigma Chem- 
ical Co.. St. Louis. MO) in 0.1 M Tris-HCl (pK 7.4VG.05 w EDTA for 15 min 
at 37"C, and then treated with 0.1 M triethanolamiue-0.25% aceik anhydride 
for 5 min at. room, tempera lure and 0.1 m Tris- glycine (pH 7.4) for 30 min ai 
room lemperalurc. The seciions woe subsequently dehydrated wilh graded 
"hanol, air dried, and prel)ybridi2cd 'ai M)"C for 1 b in 2 X SSC 10 dim DTT, 
5 X Denhardrs solution, 100 pc/ml of botb salmon sperm DNA and yeast 
iRNA, and 50% fonnamide (15). The slides were iben hybridized ovemigbr at 
50*C in ibe same medium wiib 10% dcxiran sulfate and 2 X Iff" cpm/fi! of the 
specific RNA probe. The lacioferrin oligonucleotide pjobe wa* amplified by 
J*CR using primers thai spannid nucleosides 71 £-1654 (10) and cloned into 
pGEM-42. "S-labclcd sense and antisensr RNA probe* were made w»b ibe 
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Promega Riboprobe kit (Promega, Madison, Wi), washed twice id 1 X SSC fox 
10 min at room temperature, digested with RNase (2J >g/ml RNase-A, 03 
Mg/ml RNasc TJ, 10 mM Tris-HQ (pH 7.4). and 15 mM NaOj. and washed 
again with 1 x SSC twice for 20 min each time at 50°C, twice for 20 to 
OJ X SSC at 55°C and once for 20 min at 60*C Tbe sections were then 
dehydrated and dipped- in KodaV autmadiographic emulsion (NTB-2) for 
dejection of specific mRNA expression. Tbe slides were allowed to develop for 
2 weeks. After this period, the slides were developed using Kodak DJ9 
developer and Kodak Kapid Fixer. 

Northern RNA Analysis. Total cellular RNA was purified from froieo 
tissue by the guanidine isothiocyanale-ccsiuro chloride method, and poIytA*> 
RNA was isolated by oligo(dT>ccllulose chromatography using methods 
described previously (15). For Northern blot analysis, poiy(A*>.RNA"was 
resolved by electrophoresis on 1 5% formaldehyde agarose gels, stained with 
eibidium bromide, and transferred to a nylon membrane. The membrane was 
probed with a ^P-labeled lactoferrin cDNA derived from human uterus (nu- 
cleotides 718-1654; accession no. S52659) using PCR techniques, followed 
by cloning into pGEM-42 (Promega, Madison, WI). tp order to insure that the 
quality and quantity of RNA analyzed by Northern blotting was equivalent 
between control and treated groups, the Hot was probed simultaneously for 
glyceraldehyde-3-pbosph»te dehydrogenase. 

. Statistical Analysis. Values are presented as means ± SD. Differences 
between the zona basalis and funciionalis were tested by the two-taDed 
Student's 1 test. * 

RESULTS 

Immunobistochemical Analysis of Lactoferrin. Protein 

Normal Cycling Endometrium. InunuBohistochciDipl studies of 
norma) cycling human endometrium localize lactoferrin protein pre- 
dominantly to tbe glandular epithelium deep in the zona basalts and 
not to the functionalis (Fig. 1A). The association of Jacloferrin protein 
expression with the zona basalis is statistically significant, (p < 0.001; 
Fig. 2). Two to 56% of the glands express lactofenin at any given time 
during the menstrual cycle. Within positive glands, lactofenin protein 
immunoIocaJization is heterogeneous in that positively staining epi- 
thelial cells are interspersed with cells negative for lactoferrin expres- 
sion. No apparent differences in morphology, PCNA, ER, or PR 
expression are seen to account for the heterogeneous pattern of inua r 
and intexgland lactoferrin expression.in normal endometrium. Similar 
to our previous findings in mouse uterine epithelial cells, the positive- 
staining glandular cells of the human endometrium demonstrate two 
distinct irnmunostaining patterns foi lacioferrin, cytoplasmic and nu- 
clear (Fig. 1B\ seen with both formalin and BouiVs fixation. Jn 
evaluating the temporal expression of lactofen-in, tbere is a trend 
toward? more glands expressing lacioferrin during tbe secretory 
phase. Because of the large variance, tbe trend is not statistically 
significant. As expected, the polymorphonuclear leukocytes in the 
endometrium also stain intensely for lactoferrin protein, which is 
stored in their secondary gtanules (Fig. 3> These results demonstrate, 
for the first lime, that lactoferrin protein is expressed in the human 
endomeuium predominately by polymorphonuclear leukocytes and 
epithelial cells of glands located deep in tbe zona basaJis. 

Proliferative Endometrial Disorders: Hyperplasias and Adeno- 
carcinomas- Immunohistocberoistry reveals that the expression of 
jar jofrrrin pxtfrhi is i»ae*sc4 in 66.6% (8 nf 22) of !br. endometrial 
aderxKaicinomas examined. In one- half of these cases (4 of 12), 
Jacioferrio is intensely expressed by malignant epithelial cells 
throughout the entire tumors (Fig. 4). The other four adenocarcinomas 
demonstrate increased staining lor lactofenin in concentrated regions 
of the rumors. In all eight cases where lactofenin expression is 
elevated, the cells expressing lactoferrin have one similarity witti 
normal positive gjands in that they demonstrate heterogeneous stain- 
ing of interspersed positive and negative cells. However, the expres- 
sion of lactofenin by the malignant cells citarly differs from normal 



lACTOFERJUN DYS REGULATION IK ENDOMETRIAL CaKCEH 




Ftj- J- Localization ol Jadofcnin protein in normal "cycling endcnictjiuiD by imroo- 
*■ nohisiocbctnistiy using a specific polyclonal antibody. Our analysis rtvepfe thai lactof- 
emo protein b p/cseni'm » limited number of gUnds located in the 2ona bas*Jis of the 
endometrium MJe X 10. Also, note that lactofcrnn is heterogeneous wilfcin positive 
glands, i.t., cells staining foj lactofenin we interspersed with negative -staining < pit belief 
cells throughout iht glmd. Two immunohj?tochcmicaJ staining ptlteros ire- noted lor 
lactoferrin in normal uterine epithelium. In ont pattern, lactoierrin mote in is imrntmoio- 
calizcd primaiity over tJie cytopfasto (BJ\ and in the other, tbe steining is seen over the 
nucleus (B7)r t X 40. 



positive glands in thai the lactoferrin is n6t limited lo tbc basal regions 
of the tumors, many more ccDs are positive, and lb e relative intensity 
of Ihe staining ovej individuals cells is increased. Aithougb increased 
lactofenin expression is associated with malignant transformation, we 
do not find a correlation between lactoferrin prolein presence and the 
stage, nuclear, grade, Federation Internationale des Gynecologistes el 
Obsictrisres grade, or tbe depth of myometrial invasion in the 12 
tumors studied (Table J). In sharp contrast lo tbe common dysregu- 
1 a lion of lacioferrin expression found in ihe malignant endometrium, 
only 1 of 10 endometrial hyperplasia specimens evaluated contained 
an increased number of cells staining foi lacioferrin. The hyperplastic 
specimen over expressing lacioferrin was read as complex without 
atypia. 

In Si/u and Northern Analyses of Lacioferrin mRNA 
Expression in Normal and Malignant Endometrium 

To fun her our understanding of !hc location cf lacioferrin protein 
synlbesis in the human endometrium, we examined lacioferrin mRNA 
expression by in situ and Nonhem hybridization using specific 33 S- 
labcled probes for human lactoferrin. No delectable RNA hybridiza- 
tion is observed in ihe normal endometrium by in siiu hybridization, 
even in the ptesence of iromunodeteciable piolein, using equivalent 
hybridization conditions and development limes as used for the ade- 
nocarcinomas. Consistent with the results obtained by in situ hybrid- 
ization, long e^posuje lime^ weie. required to demonstrate lacioferrin 
mRNA in noiinai endometrium by Northern blol using poh(A*)- 
mRNA. Tim indicates that lacioferrin mRNA is present in- normal 
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tissue but in very low levels, consistent with tbe very limited pattern 
of protein expression in normal endometrium. Equivalent RNA. load- 
ing and quality for each specimen was demonstrated by etbidiurn 
bromide staining of the RNA gels (data not shown) and by probing for 
the housekeeping gene glyccraldebyde-3-phosphate dehydrogenase 
that does not fluctuate significantly with tbe metabolic state of the 
tissue- A representative Northern blot is shown in Fig. 5. Jn situ 
hybridization with several adenocarcinomas reveals thai there is a 
direct correlation between the localization of lactoferrin mRNA and 
tbe irnmunostaming of expressed lacioferrin protein (Fig. 4). Lactof- 
errin mRNA is not associated with polymorphonuclear leukocytes by 
* in situ hybridization in either norma) or malignant tissue. - 

Western Blot Analysis 

To confirm the specificity of the antisera that we used Tor immn- 
nonJstccneroistry, we performed Western blot ana/ysrs oo proteins 
extracted from both normal and malignant endometrium which were 
separated by 8.5% SDS-polyacrylamide gel electrophoresis (Fig. 6).. 
Iramunoblotting identified a single broad protein band with a molec- 
ular weight between 70,000-80,000 in both normal and neoplastic 
endometrial tissue homogenates, consistent with tbe reported molec- 
ular weight of human lactoferrin. Supporting the imrau neurochemi- 
cal analysis, a representative immunoblot deafly demonstrates that 
tbe proportion of protein that is lactoferrin is markedly increased in 
ibe adenocarcinomas in comparison to the normal endometrium. Tbe 
molecular weight of laaoferrin in the adenocarcinomas appears to 
have a slightly higher molecular weigbi that the predominant form in 
normal tissue. 
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Fig. 3. Polvn»orphonoclc»r leukocytes (flrro>»,j) arr scattered throughout the endomc- 
oiuro and siain intensely foi tacto/crrin. Ladofc/rin is » Jipown component oj tbc 
srcondiry granules in polyroorpboouckir leukocyte*. Tbt presence of a segmented 
nucleus and iKioftrrm pioitbi is «o excellent rottbod foi identifying this group of 
rnflammerotj cells. A, X 20. * SO. 



Correlation of Lactoferrin Expression with the Expression of 
PCNA, Ki-67, HEK-2/»ru, EJR, and PR 

In an attempt 10 characterize the phenorype of endometrial cells 
which exprtss lactoferrin, wc performed imrounohistocbemistry on 
serial sections for (he Ki-67 antigen, PCNA, HER-2/neu, lactoferrin, 
ER, and PR. In normal tissue, Ki-67 and PCNA expression are cell 
cycle -specific markers of eel) proliferation (16, 17). Upon analysis of 
normal cycling endometrium, do relationship between lactoferrin pro- 
tein expression and ER, PR, or Ki-67 expression was observed. 
Similaily, in rnost of the adenocarcinomas evaluated, no relationship 
was noted between lactoferrin and PCNA protein expression. How- 
ever, in one adenocarcinoma (Fig. 7), there was a cleat inverse 
it rations hip seen between Jaciofenin and PCNA localization, which 
was present throughout the entire tumoi. Most dramatic, however, was 
a striking inverse correlation seen between lactoferrin and PR expres- 
sion in 8 of 8 PR-positive uterine adenocarcinomas (rig. 7). Two 
tumors negative for PR also did not express lactoferrin. AJt hough an 
inverse correlation was also suggested with HER-2/neu and PR, Che 
iDveise correlation was more precise with lactoferrin in these tumors. 

DISCUSSION 

In the mouse uterus, lacroferrin is an csirogen-induced uterine 
secretory protein that is present throughout the epithelium (7), and it 



is expressed concomitantly with epithelial cell proliferation. In con- 
trast to lactoferrin*s ubiquitous expression in the estrogeni2ed mouse 
uterine epithelium, Jaciofenin protein is limited to glandular epithelial 
cells in the basal regions of normal burn an endometrium and usually 
to glands thai were directly adjacent to the myometrium (£.£., the 
deepest glands of the zona basalis). This regional localization of 
lactoferrin . expression is not surprising in that other biochemical 
parameters have been reported to show site specificity in primate 
endometrium. These parameters include the proliferative index and 
the expression of the secretory component of IgA (18). Similar to. our 
observations in the endometrium, Jacioferrin is also expressed region- 
ally in the mammary gland. In bovine beast tissue, lactoferrin is 
localized primarily to the basal alveolar cells (19); whereas in human 
breast tissue, the ductal epithelium appears to be the primary source of 
. secreted Jaciofenin during Jactation (20). 

Examining the endometrium on different days of the menstrual 
cycle demonstrates a trend towards increased lactoferrin expression 
during the luteal phase. Although this data is not statistically signif- 
icant, the cyclic variation may be biologically iclevant Kim ti oL (23) 
recently reported thai the basal endometrial epithelial cells arc unique 
because they proliferate during the postovulatory luteal phase. Inter- 
estingly, in the mouse uterus, there is a direct correlation between 
lactoferrin expression and epithelial cell proliferation (6). Therefore, 
lactoferrin may have a similar role in the human and mouse endome- 
trium. If lactoferrin expression is. cyclic, the ERE in the 5'-flanking 
promoter region of the human lactoferrin gene may be activated 
during the luteal phase (30, 13). 

Another similarity between human and mouse uterine lactoferrin 
expression is the observation of two immunobistochemical staining 
patterns. In one pattern, the antiseia binds primarily over the cyto- 
plasm, and in the other, the nucleus is the primary site of localization. 
Although this could represent a fixation artifact, we have now ob- 
served this partem in two species and under different fixation condi- 
tions. It has been demonstrated that signaling peptides, i.t\. platelet- 
derived growth factor (22), Int-2 (23), and probasin (24), can be 
selectively directed into the nucleus, cytoplasm, or secreted. It is 
believed that this differential processing may allow proteins to have 
intracrine, autocrine, and paracrine roles, depending on the physio- 
logical state of the cell. The two localization patterns observed suggest 
thai laciofenin might have signaling sequences that direct the final 
destination of the mature peptide. 

In normal endometrium, lactoferrin roRNA is present but in very 
low levels. Prolonged exposure times are needed to visualize the 
mRNA band with Northern analysis. Although with in situ hybrid-, 
ization the . lactoferrin mRNA signal is easily seen in adenocarcino- 
mas, we failed to localize the lactoferrin mRNA in normal endome- 
trium using an equivalent exposure time. These low levels of mRNA 
in normal endometrium suggest that the synthesis and degradation of 
lactoferrin mRNA is more tightly regulated in normal tissue than in 
the adenocarcinomas. Because of ihe low levels of message in normal 
tissue, we are unable at this time to definitively conclude that lacio- 
fenin mRNA is synthesized by tire same epithelial cells which express 
the protein. 

■. In endometrial adenocarcinomas, malignant transformation of the 
endometrium is associated with the up-regulation of lacloJerrin 
mRNA and piotein biosynthesis. The up- regulation at the RNA level 
is demonstrated by an increase in steady-srate RNA levels using both 
in jinv hybridization and Northern analysis techniques. In these can- 
cers, we slso observe an increase in ihe number of lactoferrin-positivc 
cells, which express both the protein and mRNA. In this study. 8 of 
the 12 adenocarcinomas evaluated ovcrexpress lactoferrin, compared 
with only 1 of 10 hyperplastic specimens. The form of lactoferrin 
protein extracted from endometrial adenocarcinomas appears to have 
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Kg. 4. OOoaliMtwD of Jictoftnin prxxtm (Ufi pcnth) and 
mRNA (right poneb) in a owmal proliferative endometrium «»d 
endometrial adtDOcardDOxna j by pcrf arming jjd cm nohistocb crab- 
try and m Jilw RNA hybridization on serial sections. Dual analysis 
of protein and mRNA expression rcveats thai glands h> normal 
citdoiDClnum do Dot hive detectable mRNA, is mtasw td bymiim 
bybridiiatioB (10^ pantb\ X 40), whereas analysts oi the adeno- 
carcinomas clearly demonstrates a direct correlation bclwctD pro- 
tein and RNA cxprcS5km loi iBaoferTin (middJe and fcevKw pontlr, 
X 10). Note ibet baofuTW protein and roRKA is dtslributed in t 
bctcrofeDcottt panem in ibe epithelial cells of the adcoocaicinomis. 
As is shows in Rg. 1, » htieiogeneops staining pa firm for bet of - 
trrin protein is ilso set» frequently Jn nonnal endometrium. 
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a slightly higher molecular weigh! than ihc protein present in norma) 
tissue. This could be due to alterations in the processing of tbe 
Iactoferrin mRNA, protein, or glycosylatiou by tbe malignant cells. 
Alicrnauvely, there could be minor differences between Iactoferrin 
protein which is present in neutrophils and the form synthesized by 
uterine epithelial cells. "We suggest two hypotheses to explain Jactof- 
crrin overexpressibn in endometrial adenocarcinomas, In the first 
hypothesis, lactofenin biosynlbesis is deregulated by the same pro- 
cesses that lead to tbe malignant transformation of endometrium. H 
Ibis hypothesis, is true then Iactoferrin may be a useful marker for 
endometrial adenocarcinoma investigation, and farther research is 
needed to del ermine whether Iactoferrin plays a contributing role is 
the malignant transformation. A second hypothesis is thai lacioferrin- 
positive human endometrial adenocarcinomas evolve from tbe clonal 
expansion of cells residing in tbe regenerative zone (zona basalis) of 
normal endometrium. It is interesting to speculate that Iactoferrin 
expression in endometrial cancer may be linked to estrogen action in 



some way, since proliferative disorders of human endometrium are 
linked to chronic estrogen exposure ovei several years and sequencing 
data suggests ihat the promoter for the human Iactoferrin gene does 
contain an ERE. 

Although the function of lactofenin is unknown, a variety of 
biological roles have been proposed for Iactoferrin which could link 
this protein to a role in cancer, including the regulation of DNA 
synthesis (25-29), modulation of the immune response (25, 30, 3J)> 
and iron transport (32). Some forms of lactofenin are jeported to have 
RNase activity (33, 34). Secreted RNases arc involved in develop- 
ment, reproductive /unction, neoplasia, angiogenesis, and immune 
suppression. (35, 36) If angiogeDesis and immunosuppression arc 
components of Jaaoferrin RNase activity, these properties could pro- 
mote tumor growth. 

In the endometrial adenocarcinomas, we observed 2 heterogeneous 
expression pallern for lactofenin, PCNA, Her-2//»ew, ER, and PR- 
With regard to prognosis, patient survival is reponedly worse when 
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1 2 3 4 



Fig. i- Northern analysis ol laciofeiim roKNA expression confumj ihal endometrial 
adenocarcinomas (Lanes 3 and si^nificznrty owexpress the 2.5-)uiobase transcript of 
human tacto/emn is comparison io normal endometrium (Lonrs J and 2\ The Northern 
hvbiidaation data supports me in j£m RNA rtsulls and confirms thai bctoftnia RNA 
txprcssion is dysresulatcd io uterine adenocarcinomas. Norma) uterine tissues appear to 
contain, low steady-state RNA levels ol bjctofcmn, reflecting a controlled pahxro of 
protein expression. Equivalent RKA loading and quality lot each specimen was demon- 
strated by ttro'dimn bromide staining of the RNA gel* (data not shown) and by probing for 
e housekeeping gene (j)yc*raldehyde-3-phospbaic dehydrogenase) that docs not fluctuate 
significantly with tie metabolic siate of the tissue. 



LACTOFEftRJN DYSREGUtATlON itf ENDOMTTWAL CANCER 

endometrial adenocarcinomas lose sex steroid receptors (37, 38), have 
a higher proliferative index (39), and demonstrate DNA aneuploidy. 
During the tumor progression of endometrial adenocarcinomas, il 
appears that the loss of steroid hormone receptors occurs earlier than 
either the increase in proliferation rate or the development of DNA 
aneuploidy (40). In our study, we note a striking inverse correlation 
between the expression of lacioferrin and PR in the endometrial 
adenocarcinomas. An inverse relationship also has been described for 
HER-2/mru and PR in endometrial adenocarcinomas that correlates 
with patient prognosis. Furthermore, in cancers of the human endo- 
metrium, ovary and breast Bcx-2/neu expression bas been associated 
with advanced disease and poor survival (41-43). HERrVmu is an 
oncogene that shares sequence homology with the epidermal growth 
factor receptor and is speculated to contribute s aberrant growth- Of 
note is that Jactoferrrn biosynthesis in the mouse uterus is associated 
with the expression of the" epidermal growth factor, like HER-2/nwi, 
the epidermal growth factor receptor is also frequently overexpresscd 
in PR- negative cells of endometrial adencoucinomas (44). The am- 
plification of growth factor receptor expression h» PR-negatjve endo- 
metrial adenocarcinomas may be associated with the acquisition of 
growth autonomy and hormone independence, which may contribute 
to (he poorer prognosis of PR-negative endometrial carcinomas (45). 
Some endometrial adenocarcinomas, including recurrent tumors, can 
be treated successfully with progesterone therapy (46, 47). Although 
the significance of the inverse relationship between lacioferrin and PR 
expression is not known, we speculate that the PR-negative cells do 
not undergo ihe normal growth inhibition and secretory differentiation 
normally associated with progesterone action. 

A survey of human tissues reveals that lacioferrin is expressed by 
most normal mammalian exocrine glands and may be a prognostic 
maiker in tumors (20). Lactofcrrin is found in normal ductule breast 
epithelium and in primary breast carcinomas. In breast tumors, there 
is an inverse correlation between lacioferrin and ERs (20). Notably, 
lactofenin expression in breast cancer may fall into the same category 



A. Immunoblot 



B. Coomassie Blue 
Protein Staining 



Fig. 6. Western blot analysis using an antiserum 
sptcJBc tot human lactofcrrin wis performed on 
proteins extracted from norma!' and malignant en- 
dometrium and sepajattd by SI>S-po)yacrylaniidc 
get electrophoresis. A broad prottin band with a 
molecular weight between 70,000 and 30,000 is 
Delected in both norma) and neoplastic endometrial 
■ tissue boroojenatci, consistent with the reported 
mo) t cub i weight ol human lactoferno (A\ The 
most significant observation fiom the unmiwoblot- 
tinf studies is that tactoferrin protein is markedly 
elevated in ibe adenocarcinomas, in comparison to 
norma) endometrium, vbich supports ibt iromvno* 
Otochctnic*i analysis th*l dcrooDSlralrs a •fester 
number of cc lb positive for laciofenin protein in 
the uterine tumors. S. »be relative amount ol protein 
loaded in each lane by Coomassie blue stajnin*. 
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Tig, 7. Conelaion of laaof em* poiew txprts- 
S3tm with the PCNA, a marker of proltftraiion, awl 
PR as measured by immunohisiocheinis&jr ptr- 
fonDtd on serial secoons; X 20. In most cases, do 
relationship between lactofenin and PCKA expres- 
sion is found in either normal co malignant endo- 
meirivni (data nol shown> However, in let 12 
kdenocarcbomas, an inverse correlation is seen 
between lactoferrm (A) and PCNA {B) localiza- 
doil This striking pattern was consisieni through- 
out the. entire tumor, suggesting the possibiliry of 
cell cyck regulation of lactofcrrii: expression in 
this adenocardnoroa. Tbe bottom pant Is eihibil an 
inverse corrclaboo between lactofcrrin (C) and PR 
expression (D), which was seen m a« eighl endo- 
metrial adenocarcinomas which expressed PR. 



LACTOFERStN DYSRECULATION IN ENDOMETRIAL ONCER 

LTF 



PCNA 




4 J > 



LTF 





~' " PR 



as ibt other maimers for ER-Dcgaiivc Tumors, soch as amplification of 
EGF reccpior, HER-2/neu, and transforming growth factor a expres- 
sion, which arc associated with poor prognosis. Id gastric carcinomas, 
lactofcrrin expression is associated with transformation of specific 
cell types including intestinal -type carcinomas, adenomas, and incom- 
plete intestinal metaplasias (28). Although this is complete specula- 
Don al this time, peibaps lactofenin ovei expression in the vaiious 
malignancies may complement !bc actions of the giowth.factoi path- 
way molecules and contribute to the autonomous giowth of these 
himots. 

In conclusion, oui studies reveal tbal lactofenin is. associated 
with a unique population of epithelial cells in the zona basaJis and 
ibat lactofenin ovcrexpiession may be associated with malignant 
liaDsfoiroation of the human endometrium. Further studies are 
needed to elucidate the role of lactofcrrin in normal and patholog- 
ical endometrial physiology. 
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Cell proliferation in human soft tissue tumors correlates with platelet- 
derived growth factor B chain expression: an immunouistocuemical and in 
situ hybridization study. 

Wang J, Coltrera MP . Gown AM . 

Department of Pathology, University of Washington, Seattle 98 195. 

/^S h ° r f ^ thC h yP 0(hesis mat the B chain of the platelet-deri ved growth factor 
(PDGF), a known connective tissue mitogen and growth factor, could be expressed by 
human soft tissue tumors, and that its expression could play a role in the control of cell 
proliferation in these tumors. Using a set of 56 soft tissue tumors, including benign 
tumors and all three grades of sarcomas, PDGF-B chain protein was localized using 
unmunohistochemistry and PDGF-B rnRNA was localized using in situ hybridization 
The hypothecs that PDGF-B expression was related to cell proliferation was tested by 
simultaneously demonstrating the expression of the proliferating cell nuclear antigen in 
sequential tissue sections of the same tumors. Sixty and 82% of tumors had demonstrable 
PDGF-B rnRNA and protein, respectively, with a strong correlation between their 
degrees of expression (P = 0.0001). Among the sarcomas, a strong correlation between 
on™ "o expresS ! on m * ih «w«ing malignant tumor grade (P = 0.006), and between 
PDGF-B expression and increasing proliferating cell nuclear antigen index (P = 0 01) 
was found. All tumors were also demonstrated to express the beta receptor of PDGF via 
immunohistochemistry. These studies suggest that PDGF-B expression may be an 
important mediator of cell proliferation control, via an autocrine mechanism, in human 
soft tissue tumors and may correlate with clinical outcome in the sarcomas. 

PMID: 790391 1 [PubMed - indexed for MEDLINE] 
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Expression of cadlierins and catenins in paired tumor and non neoplastic 
primary prostate cultures and corresponding prostatectomy specimens. 

tvp-P^ ^rT^ ^SEbensonM, Wang_Q, BiscegUaM, Stoneri XJionyis'A 
DcFlavia P , DhirR, Becich MJ . LLSlJ1 ' 

■ Department of Pathology, University of Pittsburgh Medical Center, PA, USA. ■ 

Cadherins are a family of transmembrane proteins that play a crucial role in cell : 
differentiation, cell migration, and intercellular adhesion. Cadherins are associated- with 
catenins through the,r highly cohsen^ed cytoplasmic domain,Down-regulationofE 
cadhenn protein has been shown in various human cancers. This study examined the 
expression of cadhenns and associated catenins at the mRNA level. Paired tumor and 
nonneoplastic primary prostate cultures were obtained from surgical specimens 

S rTpVT P y flU ^ eSCenCe ^^^Polv^erase chain reacaon 

(QMF RT-PCR) and quantitative analysis were performed and correlated with 

ST H m reSUl ?'. SiX .° f f VOT C3SeS ° f nCOpIaStic CultUres showed moderately-to- 
maricedly decreased levels of E-cadherin and P-cadherin mRNA. Similar losses of alpha- 
catenin and.beta-catenm mRNA were also observed. The results of QMF RT-PCR 
showed good correlation with the results of immunohistochemical studies based on 

STSS 'r t n " fiXed SeCti ° nS - In conc]usion > Paper presents a coordinated 
down-regulation m the expression of E-cadherin and associated catenins at themRNA 
and protem level in most of the cases studied. This down-regulation may play an 
important role mthe pathogenesis of prostate cancer. 

PMID: 1 1 127708 [PubMed - indexed for MEDLINE] 
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,P. dh f rinS a " 3 fami,y of ^"smembrane 
proteins that pJay a crucial role in cell differentiation, 
cell migrahorl and intercellular adhesion. Cadherins are 
assoaated with calenins through their highly conserved 
cytoplasmic domain. Down-regulation of E-cadherjn 
protein has been shown in various human cancers This 
study ecam.ned the expression of cadherins and associ- 
ated calemns at the mRNA level. Paired tumor and non- 
neoplasuc primary prostate cultures were obtained from 
surgical specmens. Quantitative multiplex fluorescence 
^ '™ DS = ri P'ase-polymerase chain reaction (QMF 
r/„„S an ^ uan,i,a,ivc were performed and 

correlated with immunostain results. Six of seven cases 

dLre^H? 0 f Uh r r f ^^.ely-to-markedly 
decreased levels of E-cadherin and P-cadherin mRNA 
■Smihr: losses oT a -catenin and /J-catenin mRNA were 
also observed. The results of QMF RT-PCR showed 
pod correlanon with the results of immunohistochem- 
rcal studies based on corresponding formalin-fixed sec- 
tions. In conclus.on, this paper presents a coordinated 

•^7? " lhe ra P rcssi0 " of E-cadberin and 
associated catemns at the mRNA and protein level in 
most of the cases studied. This down-regulation may 
play an important role in the pathogenesis of prostate 

cancer. 
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Pros ate cancer is the most common malignant tumor 
and rhc second leading, cause of cancer death in men. 
The clmica course of patients with prostate cancer 
vanes- widely, and different factors contribute to this 

™? ^ V t ariabi,il y- inc,udi »g genetic back- 
ground ^orrnonal environment, and the invasive 
potential of the tumor. Invasion and metastasis arc the 
hallmarks of malignancy and have been closely linked to 
al erations in cell-to-cell adhesion, cell migration, and. 
interactions with extracdluiar matrix components 122] 

, Cadhenns arc a family of transmembrane glycopro- 
teins responsible fox maintaining the integrity of tissue 
and are involved in cell differentiation, cell migration, 
and intercellular adhesion through a calcium-dependent 
mechanism characterized by homotypic adhesion [35- 
J/J. I heir highly conserved cytoplasmic domains asso- 
ciate with catemns, a group of intracellular proteins that 
mediate contact between the cadherins and the micro- 
filaments of the cytoskeleton. Each cadherin subclass 
shows a unique tissue distribution: E-cadherin is pre- 
donunanily expressed in epithelial cells and P-cadherin is 
restricted to decidua tissue and the basal or lower layers 
ofslratified epithelium [30]. 

The accumulating evidence suggests a decrease or loss 
ol Tunction in E-cadherin and P-cadherin in several 
human carcinomas [3, 9, 23, 32]. Loss of heterozygosity 
(LOH) at chromosome 16 in the location of the E-cad- 
nerm gene is present in a high percentage of prostate 
cancers |6, 2 1 , 26, 28]. Decreased expression of E-cadherin 
is seen in various human malignant tumor cell lines, and 
■he level of decrease correlates with the invasive potential 
of the tumor cell lines [I, 9, 12, 24, 40, 42]. )n addition, 
many, but not all, immunohistochemical studies using 
formahn-fixed, paraffin-embedded tissue have shown that 
the E-cadher,n protein is decreased in prostate cancer and 
the decrease is correlated with tumor grade |4 8 13 27 
38J. Other studies have shown decreased-to-abseni" 
F-cadherm levels, but variable E-cadherin levels [33] 
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a- and 0-catenins bind the cytoplasmic domain of 
E-cadherin and link it to the cytoskeleton 116, 31 J. Down- 
regulation of expression and deletion of a-catenin genes 
were identified in several human cancer cell lines [19, 24, 
4%). Immunohistochemical studies showed decreased 
ct-catenin staining, which correlated well with the loss of 
E : cadherin staining and patient survival [25, 29, 39). In 
addition, decreased ^catenin protein expression appears 
to be associated with malignant transformation of epi- 
thelial tissue [34]. These results suggest that cadherins 
and catenins may function as tumor invasion-suppressor 
genes. However, most of these results were obtained in 
studies using transformed tumor cell lines and formalin- 
fixed, paraffin-embedded tissue. To our knowledge, 
no studies to date have examined the co-expression of 
cadherins and catenins at the mRN A level using material 
derived from surgical specimens. 

Recently, we have developed the methodology to 
cultivate primary epithelial cells under defined condi- 
tions from surgical prostatectomy specimens [20]. Areas 
of both carcinoma and non neoplastic tissue are identi- 
fied grossly, verified histologically, and then cultured 
separately, resulting in paired primary cultures of both 
non-neoplastic and neoplastic epithelium from the same 
patient The non-neoplastic tissue cultures serve as an 
important control of any person-to-person variability in 
the expression of the genes of interest. 

In this study, we investigated the co-expression 
of cadherin and catenin mRN A, from multiple paired 
primary prostate cultures derived from surgical prosta- 
tectomy specimens using quantitative multiplex fluores- 
cence reverse transcriptase-polymerase chain reaction 
(QMF RT-PCR) [41, 44]. Immun ©histochemistry stud- 
ies of cadherins and catenins were performed on the 
corresponding paraffin-embedded prostate tissue. 



Materials and methods 

Tissue specimen and primary prostate cultures 

We studied patients with adenocarcinoma of the prostate who 
underwent prostatectomy at the University of Pittsburgh Medical 
Center during the lime interval January 1996-January 1998. No 
patient had received treatment prior to surgery. Fresh prostatec- 
tomy specimens were sectioned and grossly examined, represcnia- 
tive samples of neoplastic and non-neoplastic tissue were collected, 
and the diagnoses were confirmed by examination of hematoxylin 
and eosin (H&E) -stained sections. Epithelial cells from rumor and 



non-neoplastic areas of the specimen were isolated and then cul- 
tured with a chemically defined medium (COM) without addition 
of growth factors as previously described. Tissue fragments were 
cut into small pieces and underwent a scries of collagenous diges- 
tions. Following each digestion, the cells were pelleted by ccntn- 
fugation at 1,000 rpm for 4 M1N |20J. To selectively promote 
epithelial cell growth, the pellets were rcsuspended.and maintained 
in serum-free CDM supplemented with epithelial growth factor 
(EGF) [20). The culture's morphology was examined daily, and 
epithelial cells were allowed lo grow until confluence was reached, 
between days 7 and 10. 

Isolation of total RNA and synthesis of cDNA 

Total RNA was extracted from the cultured primary prostate 
epithelial cells at first passage according lo the Trizoi solution 
(Gibco BRL, Rockville, MD) modified method of ChoinczynsJc* 
and Sacchi [7J. The RNA was then quantitated spectrophotomet- 
rically. Two micrograms of total RNA were used for first strand 
cDNA synthesis using oligo-dT primers and MMLV reverse tran- 
scriptase (Gibco BRL, Rockville, MD). 

PCR primers and quantitative multiplex fluorescence PGR 

PCR primers for hepatocyte growth factor (HGF) and c-myc 
were synthesized as previously described 1 1 % PCR primers for 
E-cadherin, P-cadherin, a-calenin, 0-catenin» and 0-acUn cDNA 
were designed according to cDNA sequences provided by the 
GeneBank (WWW2.ncbi.nlm.govygeneb3nk/query). The primers 
were selected to amplify 150-250 bp target genes and the PCR 
products from each target gene were designed to have a different 
size (Table 1). The reverse primers were synthesized with fluores- 
cein molecules covalently altached to the 5' end (BRL, Ro^^Je, 
MD) Twenty-five microliters of PCR reactions for QMF-PCR 
contained primers (20 uM each), cDNA corresponding lo 50 ng or 
total RNA, dNTPs, and reaction buffer. The reactions were am- 
plified for 21 cycles al 94 *C for 1 min, 57 *C Tor 2 mm; and 72 "G 
for 2 min. Five microliters of QMF-PCR reactions were mixed wiib 
an equal volume of sequencing gel loading buffer, denatured, and 
aliquots were elecirophoresed on an AB1 373A automated sequencer 
(Applied Biosysiems, Foster City. CA) using a matrix specific for 
fluorescein (Fig. I A). 

Automated sequencer gels were run for 6 h at 30 W using 
Genescan software (ABI. Foster City, CA). Lane assignments and 
areas of the peaks conespbnding lo fluorescent peaks were ^V*** 
and quantitated by the Genescan software using PhotorouHipbcr 
tube (PMT) voltages (Fig. IB). All experiments were done in 
triplicate and the results presented as means and standard devia- 
tions (SD). 



Antibodies and immunohistochemistry 

Five-micron sections were obtained from formalin-fixed, paraffin- 
embedded tissue blocks. They were deparaffimzed and hyd'atcd 
with graduated cthanols. Slides were then mtcrowaved m 



T»We 1 Summary of PCR 
primer sequences and PCR 
products 



Primer Sequence 



Taigel gene 



Size of PCR product (bp) 



1 Forward cccacactgtECCcaiclacg 

2 Reverse gcllciccttaaiglcacgc 

3 Forward caaagtgggcacagatggigtg 

4 Reverse clgcttggallccagaaacgg 

5 Forward gcaagagccagclctgmagc 

6 Reverse acttgagclgailcagtctctgg 

7 Forward gatggacaaclalgagccagg 

8 Reverse tataccaggcgggaagcalcg 

9 Forward llctggtgccactaccacagc 
10 Reverse tgcatgcccicaictaatgtc 



0-Actin 
E-cadherin 
P-cadherin 
at- Catenin 
0-Catenin 



170 
190 
220 
182 
218 
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Fig- 1 A Genescan image of 
QMF RT-PCR or five paired 
non-neoplastic and neoplastic 
primary prostate cultures. 
cDNA corresponding to 50 ng 
of total RNA was subjected to 
21 cycles of PCR, and all 
reactions were done in tripli- 
cate. KNA of both E-cadhcrin 
and P-cadherin showed moder- ' 
ate- to-marked reduction in pri- 
mary neoplastic cultures in four 
. of five cases (Cases I, 2, 4, and 
. 5). B Representative automated 
sequencer traces (AB1 373A) 
from 21 cycles of QMF 
RT-PCR using cDNA from a 
paired non-neoplastic (top 
panel) and neoplastic (bottom 
panel) primary prostate cultures 
(Case 2), and copy number of 
PCJl products (at-actin, 
E-cadherin, and P-cadherin) 
, was calculated using pealt areas. 
Neoplastic culture showed 
markedly decreased expression 
of both E-cadherin and 
P-cadberin 
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curate buffer (pH 6.0) for antigen leirieval. The avidin-biotin 
complex method for .mmunohistqcbemislry was utilized from 
Vector Laboratories (Burlingamc. Calif., USA). The E-cadherin 
P-cadnenn a-catemn, and ^catenin antibodies, all mouse mono-' 
clonal antibodies, were purchased from Transduction Laboratories 
(Lexington. Ky., USA). 

Evaluation of immunosiaining 

Tbe percentage of positive cells, intensity of the staining, and cel- 
lular localization of the staining were examined by two independent 
observers using normal prostate epithelium fiom the same speci- 
men as an internal control. Tbe intensity of the signal wds graded 
as strong, moderate, weak, and negative. The staining pattern of 
the tumor was compared with that of normal epithelium from the 
same specimens. 



Results 

Expression of E-cadhcrin and P-cadherin 

Since many prostate cancers are known to be histolog- 
ically heterogeneous, adjacenl H&E-stained sections 
of Ibe tissue fragments sent lo culture were reviewed 



loussess tissue homogeneity and lo rule out the presence 
of other diseases. The paired primary cultures for this 
study were selected based on the following histologic 
features: (I) non-neoplastic tissue showed no cancerous 
foci or high-grade prostatic intraepithelial neoplasia 
(PIN); (2) neoplastic sections contained less than 5% 
non-cancerous epithelium. Seven out of 38 pairs of 
cultures met the selection criteria and were included in 
this study. Histologically, all seven cases were moder- 
ately differentiated adenocarcinomas, with GJeason 
scores ranging from 5 to 7 (median = 6). In six of these 
cases, adjacent sections of tumor contained no benign 
prostate epithelium, and in one case (specimen 4), there 
was less than 5% non-cancerous epithelium. Prostate 
stroma) cells express HGF, whereas the epithelial cells 
express c-m'et, the receptor for HGF [17, 20]. The cases 
included, in this study showed no detectable HGF 
expression after 2i cycles of QMF RT-PCR (data not 
shown). This indicates there was do significant stromal 
cell contamination in the current epithelial cultures. 

QMF-PCR is an accurate method of measuring the 
relative levels of mRNA in small tissue samples [41). In 
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: this study, we used this method to quantitale the mRNA 
levels of E-cadherin and P-cadherin t relative to 0-actin. 
All the RNA samples contained no DNA contamina- 
tion, as shown by the absence of automated signals when 
RT was omitted from the RT-PCR reactions. We ob- 
served the expected linear increases in 0-aclin, E-cadh- 
erin, and. P-cadherin signal intensities between cycle 
numbers 18 and 24, with a cDNA input corresponding 
to . 50 ng of total RNA. The ratios of 0-actin to 
E-cadherin and P-cadherin were constant, as was the 
E-cadherin/P-cadherin ratio. These results indicated that 
the target genes were consistently amplified in the reac- 
tions. Twenty-one cycles of PCR were chosen for all 
subsequent experiments. In addition, the raw sequencer 
quantitation of peak areas for 0-actin using the same 
cDNA input was similar among the paired primary 
prostate cultures (data not shown). 

When compared with the paired non-neoplastic pri- 
mary cultures and normalized with the /^actin internal 
controls, four of the neoplastic cultures showed marked 
(>85%) reduction of E-cadherin mRNA levels, with 
one case showing no detectable E-cadherin mRNA. The 
other three neoplastic cultures showed mild-to-moderate 
reductions (Table 2). 

Six of seven neoplastic cuUutcs showed moderately 
to markedly decreased P-cadherin mRNA levels when 
compared with non-neoplastic cultures. Interestingly, 



the four cases showing marked reduction in levels of 
E-cadberin mRNA, and also demonstrated significant 
losses of P-cadherin mRNA. The case with no delectable 
E-cadherin mRNA also demonstrated near total loss 
of P-cadherin mRNA (Case 1). Another case (Case 3), 
with only rmld reduction in E-cadherin mRNA, showed 
no significant change in Ptcadherm mRNA. Genetic 
variations among the patients were evident in trie 
marked differences in the baseline levels of E-cadherin 
and P-cadherin expression seen in the non neoplastic 
primary prostate cultures. 



Expression of cc-catenin and 0-cateniri 

The highly conserved intracytoplasmic domain of 
the cadherins interacts with a- and /^catenins, with the 
catenins serving as a link between the cadherins and the 
microfilaments of the cytoskcletoh. Six cases of neo- 
plastic cultures showed mild-to-moderate reductions in 
a-catenin mRNA levels, ranging from 26 to 62%. These 
cases also demonstrated more severe reductions of 
0-catenin mRNA levels and generally correlated with 
the changes of cadherins in the same specimen (Table 3). 
Interestingly, the case with no significant change of 
P-cadherin mRNA and only a mild Joss of E-cadherin 
mRNA also showed no change in mRNA levels of both 



Table 2 E-cadherin and Pterin expression in paired nonneoplastic and neoplastic primary prostate cultures derived from prosta- 
tecloroy specimens — 



Case 



E-cadherin" 



Percentage 
loss in lumor b 



P-cadherin' 



Percentage 
loss in tumor b 



Non-neoplastic 



Tumor 



Non-neoplastic 



Tumor 



71.0 ± 2.2 

27.4 ± 6.4 
18.0 ± 0.2 

93.5 ± IZO 
41.5 i 3 0 

56.7 ± 1.1 

53.8 ± 6.1 



3.7 ± 0.3 
12.6 ± 1.9 

5.4 ± 0.9 
24.4 ± 4.0 

8.1 ± 10 
22.2 ± 0.7 



100 
86 
30 
94 
41 
86 
59 



65.7 ± 2.0 

53.4 ± 6.5 
88.8* ± 6.5 

87.8 ± 7.0 

30.5 ± 1.4 
71.8 ± 2.9 
62.7 ± 3.5 



0.8 ± 0.1 
17.8 ± 1.3 

86.4 db 9.5 

25.5 ± 2.5 
18.1 ± Z4 
17.1 ± 22 
19.4 ± 22 



99 
67 
3 
71 
41 
76 
69 



/ JJ.a x v. i v.. — ^ . . 

■ Level or E-cadherin and P-cadherin are normalized with fractin from ihe same sample: (E-cadherin or R-cadberinWacm x 100 

b Percentage loss in tumor primary culture: l(N - T)/NJ x 100% 
' ND, not delectable 



Table 3 «-Catcnin and /J-catenin expression in paired non-neoplastic and neo] 
lectomy specimens . - 



plastic primary prostate cultures derived from prosta- 



Case 



Percentage loss 
in tumor* . 



/J-catemn" 



Percentage 
loss in tumor 



Non-neoplastic 



Tumor 



Non-neoplastic 



Tumor 



45. J ± 7.1 
49.2 ±1.6 

48.0 ± 5.9 

61.1 i 2.5 

58.2 ± 1.8 
46.7 i 4.7. 
43.2 ± 2.6 



17.3 dt 0.5 
36.2 i 4.0 
47.8 ± 9.0 
40.8 i 2.6 
27.8 ± 1.6 
33.6 i 4.7 

26.4 i 1.7 



62 
26 
< | 
33 
52 
28 
39 



19.7 ± 3.6 

21.7 ± 1.5 
40.1 ± 5.3 

36.8 ± 18 
29.8 i 2.0 
31.5 i 4.2 
22.3 i 1.5 



ND* 
3.9 ± 0.2 
38.9 ± 6.2 

22.1 ± 19 

15.2 ± I I 

13.5 ± 1.6 

11.6 i 1.4 



100 
82 
3 
40 
49 
57 
48 



•Level ofa-caiemn and pVaienin aic normalized with fi-zc\mhom the same sample: (calenin/actin) x 100 
b Percentage loss in tumor primary cuhure: |(N - T)/N) x 100% 
C ND, not delectable 
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a- and 0-catenins. Moderate interspecimen variation was 
observed in the baseline expression of a-catenin and 
/r-catemn mRNA levels in the non-neoplastic cultures. 

Immunohistochemical studies of cadherins and catenins 

Jn benign prostate tissue, E-cadherin was, in all cases, 
uniformly localized to the membranes of luminal glan- 
dular epithelial cells, predominantly at cell-cell junctions 
(Fig. 3A). One case of prostate cancer showed complete 
negative staining for E-cadherin (Fig. 3B), and the re- 
maining six cases demonstrated reduced immunoslaining 
/? E ^ dhcrm ' wiln 25 ~ 75 % of cancer cells positive 
(Mg- 3Q. The cancerous glands generally showed re- 
duced signal intensity and an altered heterogeneous 
staining pattern, which included focal cytoplasmic stain- 
ing and reduced membranous staining (Fig. 3C Table 4) 
Benign prostate tissue, showed uniform basal cell 
immunoreactivily for P r cadherin, with principally cyto- 
plasmic and focal membranous pattern staining. The 

T»Me4 Immonohistochcroidl expression of cadherins and cate- 
mm ,n prostatectomy specimens corresponding to primary proslale 



benign, glandular non-basal epithelial cells and stromal 
cells were negative for P-cadherin (Fig. 3D); P-cadherin 
immunoreactivily was completely absent in two cases of 
prostate cancer (Fig. 3E), and the remaining five cases 
showed variable focal positivity, which was predomi- 
nantly cytoplasmic (Fig. 3F). This focal P-cadherin 
immunostajning positivity was confirmed by staining 
multiple sections and by using different monoclonal 
antibodies: Jn some cases, serial sections also appeared 
to show immunostaining for both E- and P-cadherins 
with the same neoplastic cells. 

In all cases of benign prostate tissue, a- and ^calenin 
protein expression showed strong homogeneous staining 
of the luminal glandular epithelium and the basal cells 
In a pattern similar to that of normal E-cadherin the a- 
and ^catenins were localized predominately at luminal 
epithelial cell-cell borders (Fig. 2A, C). In all cases of 
prostate cancer, there was a mild-to-moderate reduction 
in staining for both catenihs with 50-7i% of cells 

^iv-r^r 6 ,ended '° * 



Case E-cadherin P-cadherin 



a-catenin 0-catenin 



I 

2 
3 
4 
5 
6 
7 



+ + 
+ + -f 

I + + 
+ + 
+ + 



+ + 

+ 
+ 
+ 



+ + 
+ + 
+ + +• 
+ + 
+ + + 
+ ■* + 
+ + 



+ + 
+ + 
+ + + 
+ + + 
+ + + 
+ -f + * 
+ + + 



- 5 £%? V \ + > <25% P™ 1 ^ + + . ^50% positive; .+ + + 
50-75 /. pos»hve; + + + + > 7J% ^j,^ 

Fig. iA-D lromunohistochemi- 

pl staining of a- and 0-catenm 

in non-neoplastic prostate and 

in prostate adenocarcinoma. 

Qnginat magnification xII5. 

A, C Normal membranous 
expression of a-catenin and 
/f-catenin in non-neoptaslic 
prostatic epithelium (Case I). 
B Decreased immunostaining ' 
for o-calenin in prostate . 
adenocarcinoma (Case I), 
p Decreased and heterogenous 
immunoslaining for 0-caienin 
in prostate adenocarcinoma 
(Case 1) 



Discussion 

In this study, we observed a coordinated down-regula- 
tion of the expression in the genes involved in the 
cadhenn and catenin mediated cc!M;ell pathway at the 
mRNA level. The protein levels, as demonstrated by 
the immunohistochemical studies on the corresponding 
tissue sections, were generally consistent with the 
mRNA data as well as with that reported in the litera- 
ture [18, 33, 34, 38, 39]. 

E-cadherin showed the most consistent loss of ex- 
pression at both the mRNA level and the protein level. 
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Fig. 3 A Iromunohisiochemical 
staining of E- and P-cadherin in 
nonneoplastic prostate and in 
prostate adenocarcinoma. 
Original magnification xl 15- ' 
A Normal membranous ex- 
pression of E-cadherin in non- 
neoplastic prostatic epithelium 
(Case )). B,C Complete 
negative (Case I) and focal 
heterogeneous (Case 3) iirimu- 
nostaihing for E-cadherin in 
prostate adenocarcinoma. 
D Norma) continuous basal 
layer expression or P-cadherm 
in a portion of non-neoplastic 
prostatic epithelium (Case 1). 
E, F Complete negative . 
(Case I) and mild focal hetero- 
geneous (Case 3) irorounostain- 
ing for P-cadherin in prostate 
adenocarcinoma 




LOH at chromosome J 6q, where both E-cadherin and 
P-cadherin are located, occurs in up to 30% of prostate 
cancers. Four of our cases showed a greater than 50% 
reduction in mRNA, ranging from 87 to 100%, sug- 
gesting thai mechanisms in addition to LOH may play a 
role in the reduction of E-cadherin mRNA levels below 
that of the 50% predicted by LOH alone. 

Previous studies have suggested, that P-cadherin 
could serve as a specific marker, for basal cell differen- 
tiation and was not expressed in prostate cancer, 
although a recent study has shown focal P-cadherin 
expression in some prostate tumors [18). In this study, 
we demonstrated P-cadherin mRNA in six of seven 
neoplastic cultures, although it was significantly reduced 
in all six cases. It is, impossible to rule but the possibility 
that some of the P-cadherin mRNA expression may 
have resulted from potential, contamination by small 
numbers of non-neoplastic basal cells admixed with the 
neoplastic cells within the culture material. However, we 
favor the interpretation that low levels of P-cadherins 
are expressed in cultured tumor cells, as well as weakly 
in some tumors in vivo owing to disruption of normal 
gene regulation. This interpretation is favored by the 
presence of focal immunostaining for P-cadherin protein 
in histologically neoplastic cells in five of seven cases. 
The histologic selection criteria (requiring minimal to no 
benign prostate glands) should also have minimized 
major contamination. Despite the presence of mRNA in 
primary tumor cultures and focal positive immuno- 



staining, P-cadherin immunostaining may still serve as a 
useful basal cell marker because the staining pattern was - 
distinctly abnormal in the cases where it was focally 
present. 

In this study, the levels of a- and 0-catenin mRNA 
were also lower than normal in six of seven cases, 
though the reductions were relatively less than those or 
cadberins, especially for the ct-calenins. Calenin protein 
expression was generally moderate to weak by immu- 
nohistochemistry and showed a heterogeneous cyto- 
plasmic and weak membranous staining pattern. The 
immunopositivity was generally similar in most cases, 
with 25-75% cells showing positivily. The results of the 
immunohistochemical studies were generally consistent 
with the mRNA data; although not in all cases (e.g.. 
Case I); perhaps due to tumor heterogeneity. 

In this study, we also observed a coordinated down- 
regulation of E-cadherin and the catenins in most cases; 
this was most observable at the mRNA level. The cad- 
herins are. tightly regulated during embryogenesis and 
appear to serve the need for precise spatio-temporal 
regulation. The promoter sequences of both E-cadhenn 
and P-cadherin have been cloned and functionally ana- 
lyzed |2, 5, 10, 151. Both promoters have similar regu- 
latory elements, such as GC-rich regions and CCAAT 
boxes. Although the two promoters share similar se- 
quences, in vitro binding studies suggest that the two 
promoters are regulated by different transcriptional 
factors | II]. The (issue specificity of these promoters 
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appears to be derived by different combinations of a 
relatively few factors common to many types of tissues 
and does not appear to be derived from transcription 
factors specific for each type of tissue. In our study, the 
degree of loss of expression of a- and 0otenins at the 
mRNA level was correlated with reductions in the levels 
of E-cadherin expression. Although the exact mecha- 
nism of this coordinated down-regulation is not known, 
the coordinated pattern supports the hypothesis that 
loss or alteration of some regulatory factors occurs 
during prostate tumor progression. PossibJe mechanisms 
include transcriptional factor alterations or hyper- 
methylauon of the promoter region |K 43]. Limited 
information is available on the transcriptional regula- 
tion of the catenins, but it is possible that similar 
mechanisms may play a role. 

In summary, this is the first comparative study of the 
expression of the genes, involved in the cadherin-medi- 
ated cell-cell adhesion pathway at the mRNA level using 
paired neoplastic and non-neoplastic primary cultures 
derived from, prostatectomy specimens. Our results 
indicate that (I) there is a marked patienMo-patient 
variation in the normal levels of the cadherins and 
catenms; (2) mRNA levels of E-cadherin as well as 
catenms are significantly reduced in some prostate can- 
cer primary cultures, and the reduction tends to be to the 
same degree m each tumor, suggesting a defect in a 
regulating mechanism common to all of these genes* (3) 
P-cadherin appears to be present at both the mRNA 
level and the protein level in some prostate cancers. This 
coordinated down-regulation of E-cadherin and catenin- 
mediated. adhesion pathways may play a crucial role in 
tumor pathogenesis arid metastasis. 
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Down-regulation of prostate-specific antigen expression by finasteride 
through inhibition of complex formation between androgen receptor and 
steroid receptor-binding consensus in the promoter of the PSA eene in 
LNCaP cells. 

WangLG. LuO^ 

Department of Medicine, New York University, Manhasset I I 030, USA. 

As a specific competitive inhibitor of 5alpha-reductase, an intracellular enzyme that 
converts testosterone to dihydrotestosterone, finasteride is being extensively used for the 
treatment of benign prostatic hyperplasia and in experimental settings for prostate cancer. 
In this study, we showed that finasteride markedly inhibited prostate-specific antigen 
(PSA) secretion and expression. The promoter of the PSA gene contains several well- 
known cis-regulatory elements. Among them, steroid receptor-binding consensus (SRBC) 
has been identified as a functional androgen-responsive element. Our previous study 
showed that PSA was not only present in conditioned medium of the PSA-positive 
LNCaP cells but was also detectable in small amounts in PSA-negative cell lines, PC-3 
and DU-I45 (L. G. Wang et a!:, Oncol. Rep., 3: 91 1-917, 1996). A strong correlation 
between binding of nuclear factors to SRJBC and the level of PSA present in the 
conditioned medium and cell extracts was found in these three cell lines, whereas no such 
correlation with binding was obtained using Spl oligonucleotide as a probe. Binding of 
LNCaP cell nuclear proteins to SRBC was diminished when the cells were exposed to 25 
microM finasteride, at which concentration 50% of both PSA mRNA and protein were 
inhibited. As a major component of DNA-protein complexes, the level of androgen 
receptor was dramatically decreased in the cells treated with finasteride. Our data indicate 
that inhibition of complex formation between SRBC and nuclear proteins due to the 
remarkable decrease in the level of androgen receptor plays a key role in the down- 
regulation of PSA gene expression by finasteride in LNCaP cells. 
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Expression of calcyclin iu human melanocytic lesions. 

Wetcrman MA , van Muijen GN , Blocmcrs HP, Rmter DJ . 

Department of Biochemistry, University of Nijmegen, The Netherlands. 

. When comparing two subsequent stages of melanocytic tumor progression we identified 
calcyclin as a new potential progression marker, the expression of which was correlated 
with metastatic behavior of various human melanoma cell lines in nude mice. In this 
study, we describe a good correlation between RJNA and protein levels in the xenografts 
of these cell lines and extended these experiments to a panel of 120 routinely processed 
human melanocytic cutaneous lesions. Northern blot analysis demonstrated that calcyclin 
RNA expression was elevated in melanoma metastases as compared to several types of 
nevoceltular nevi. Calcyclin staining using a specific polyclonal antiserum showed a 
more complex pattern. A stronger staining in a higher percentage of positive cells was 
observed in thick primary melanoma (> or = 1.5 mm) as compared to thin primary 
melanoma (< 1.5 mm). Calcyclin expression was also present in a higher percentage of 
cells showing a stronger staining in melanomas with higher Clark levels (> II) 
corresponding to the vertical growth phase of primary melanomas. Protein expression in 
nevocellular nevi was confined to the dermal part and was highest in the lower parts of 
the dermis. Remarkably, dysplastic nevi (atypical moles), potential precursors of 
melanoma, did not show any expression at all, either in junctional or dermal parts. 
Confinement of the expression to the dermal part of nondysplastic nevi and primary 
melanomas may reflect interactions with the microenvironment of the reticular dermis 
that occurs with vertical growth! 

PMID; 8261423 [PubMed - indexed for MEDLINE] 
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Estrogen regulation of the cytochrome P450 3 A subfamily in humans. 

Williams ET, Lcyk M Wrighton SA. Davies PJ , Loose PS, Shipley GL. Strobel HW . 

Department of Biochemistry, Medical School, University of Texas Health Science Center 
atHouston, 643 1 Fannin, MSB 6.200, Houston, TX 77030, USA. 

This study examines the possible role of estrogen in regulating the expression of the 
humanCYP3A subfamily: CYP3A4, CYP3A5, CYP3A7, and CYP3A43. To accomplish 
this goal, mRNA was quantified from human livers and endometrial samples, and total 
CYP3A protein levels were evaluated by Western immunoblot analysis of the liver 
samples. The human endometrial samples were from premenopausal and postmenopausal 
women. The premenopausal endometrium was either in the proliferative or secretory 
phase, whereas for the postmenopausal endometrium samples, the women had been 
treated with either a placebo or estropipate, an estrogen substitute. After analyses 
CYP3A4mRNA was shown to have lower hepatic expression in females than in males. 
In the endometrium, CYP3A4 and CYP3 A43 are down-regulated by estrogen, whereas 
CYP3A5 is expressed at higher levels during the secretory phase. CYP3A7 was not 
detected in the endometrium. In addition, the CYP3A subfamily showed increased 
mRNA expression in the liver as age increased. The expression levels of total CYP3A 
protein and total CYP3A mRNA showed good correlation. Despite apparent regulation of 
CYP3 A4 mRNA expression by estrogen, the effects of estrogen may be overshadowed . 
by additional regulators of gene expression. 
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Severely decreased MARCKS expression correlates with ras reversion but 
not with mitogenic responsiveness. 

Wojt aszck PA , Stumpo DJ . Blackshcar PJ . Macara IG . 

Department of Pathology, University of Vermont College of Medicine, Burlington 
05405. 

Phorbol ester-inducible phosphorylation of MARCKS, the '80-kDa' substrate of protein 
kinase C, was undetectable in several phenotypically dominant, non-transformed 
revertants independently derived from the ras-trans formed cell line NIH3T3 DTVras. 
Extremely low expression of MARCKS protein accounted for this apparent lack of 
phosphorylation. MARCKS-encoding mRNAlevels were correspondingly decreased 
relative to normal and ras-transformed cells in all four ras revertant cell lines studied: C- 
1 1 and F-2, derived by 5-azacytidine treatment and selection with ouabain; CHP 9CJ, 
derived by ethylmethahe sulfonate mutagenesis and selection with cis-hydroxy-L-pro'line; 
and 12- V3, derived by transfection with the human Krev-1 gene. However, re-expression' 
of MARCKS after transfection of a cloned MARCKS cDNA into the C-ll ras revertant 
cells was not sufficient to induce retransformation. In fact, no significant difference in 
sensitivity to mitogenic stimulation by phorbol esters was observed among several cell • 
lines expressing widely varying levels of MARCKS. This evidence argues against a 
direct role for MARCKS in mitogenic signaling. However, the strong correlation between 
attenuation of MARCIS expression and phenotypically dominant ras reversion suggests 
that a common negative regulatory mechanism might be responsible for both effects, 
presenting a potentially useful strategy for identifying factors involved in transducing the 
ras signal. 
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Abstract 
Background 

Prostate stem cell antigen (PSCA) is a recently defined homologue of the 
Thy-l/Ly-6 family of glycosylphosphatidylinositol (GPJ)-anchored cell 
surface antigens. The purpose of the present study was to examine the 
expression status of PSCA protein and rnRNA in clinical specimens of human 
prostate cancer (Pea) and to validate it as a potential molecular target for 
diagnosis and treatment of Pea. 

Materials and Methods 

Immunohistochemical (1HC) and in situ hybridization (ISM) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections 
from 20 benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial 
neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was 
semiquantitative^ scored by assessing both the percentage and intensity of 
PSCA- positive staining cells in the specimens. Then compared PSCA 
expression between BPH, PJN and Pea tissues and analysed the correlations 
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of PSCA expression level with pathological grade, clinical stage and 
progression to androgen-independence in Pea. 

Results 

■ In BPH and low grade PIN, PSCA protein and mRNA staining were weak or 
negative and less intense and uniform than that seen in HGP1N and- Pea 
There were moderate to strong PSCA protein and mRNA expression in 8 of 
11 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by 
JHC and ISH analyses, with statistical significance compared with BPH 
(20%) and low grade PIN (22.2%) samples (p < 0.05, respectively) The 
expression level of PSCA increased with high Gleason grade, advanced stage 
and progression to androgen-independence (p < 0.05, respectively). In 
addition, IHC and ISH staining showed a high degree of correlation between 
PSCA protein and mRNA overexpression. 

Conclusions 

Our data demonstrate that PSCA as a new cell surface marker is 
overexpressed by a majority of human Pea. PSCA expression correlates 
positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and 
androgen-independence, and speculatively with prostate carcinogenesis. 
PSCA protein overexpression results from upregulated transcription of PSCA 
mRNA. PSCA may have prognostic utility and may be a promising molecular 
target for diagnosis and treatment of Pea. 
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Abstract Estate cancer (Pea) is the second leading cause of cancer-related death in 
Introduction t American men and is becoming a common cancer increasing in China, 
neteriais end ™«\h G te Des P lte recently great progress in the diagnosis and management of 

,ocall2ed disease, there continues to-be a need for new diagnostic markers 
[><cusv,n L h3t IT aCCurate,y ^criminate between indolent and aggressive variants of 
Coimnr™ . n ./-~ "r. u a ' S ° continues to be a need for the identification and 

•^,ny .n . characterization of potential new therapeutic targets on Pea cells Current 

References diagnostic and therapeutic modalities for recurrent and metastatic Pea have 
been limited by a lack of specific target antigens of Pea. 

Although a number of prostate-specific genes have been identified (i e 
prostate specific antigen, prostatic acid phosphatase, glandular kallikrein 2) 
the majority of these are secreted proteins not ideally suited for many 
immunological slrategies. So, the identification of new cell surface antigens 
is critical to the development of new diagnostic and therapeutic approaches 
to the management of Pea. 

Reiter RE et al reported the identification of prostate stem cell antigen 
(PSCA), a ceil surface antigen that is predominantly prostate specific The 
PSCA gene encodes a 1 23 amino acid glycoprotein, with 30% homology to 
stem eel! antigen 2 (Sea 2). Like Sca-2, PSCA also belongs to a member of 
the Thy-i/Ly-6 family and is anchored by a glycosylphosphntidy 'inositol 
(GPJ) linkage. mRNA in situ hybridization (JSH) localized PSCA expression in 
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normal prostate to the basal cell epithelium, the putative stem cell 
compartment of prostatic epithelium, suggesting that PSCA may be a 
marker of prostate stem/progenitor cells. 

In order to examine the status of PSCA protein and mRNA expression in 
human Pea and validate it as a potential diagnostic and therapeutic target • 
for Pea, we used immunohistochemistry (IHC) and in situ hybridization 
(1SH) simultaneously, and conducted PSCA protein and mRNA expression 
analyses in paraffin-embedded tissue specimens of benign prostatic 
hyperplasia (BPH, n = 20), prostate intraepithelial neoplasm (PIN, n = 20) 
and prostate cancer (Pea, n = 48). Furthermore, we evaluated the possible 
correlation of PSCA expression level with Pea tumorigenesis, grade, stage 
and progression to androgen-independence. 
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Tissue samples 

All of the clinical tissue specimens studied herein were obtained from 80 
patients of 57-84 years old by prostatectomy, transurethral resection of 
prostate (TURP) or biopsies. The patients were classified as 20 cases of BPH, 
20 cases of PIN, 40 cases of primary Pea, including 9 patients with recurrent 
Pea and a history of androgen ablation therapy (orchiectomy and/or 
hormonal therapy), who were referred to as androgen-independent prostate 
cancers. Eight specimens were harvested from these androgen-independent 
Pea patients prior to androgen ablation treatment- Each tissue sample was 
cut into two parts, one was fixed in 10% formalin for IHC and the other 
treated with 4% paraformaldehyde/0. 3 M PBS PH 7.4 in 0.1% DEPC for 1 h 
for 1SH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 jim sections and mounted on the glass slides 
specific for IHC and 1SH respectively in the usual fashion. H&E-stained 
section of each Pea was evaluated and assigned a Gleason score by the 
experienced urological pathologist at our institution based on the criteria of 
Gleason score [?.). The Gleason sums are summarized in Table :i . Clinical 
staging was performed according to Jewett-whitmore-prout staging system, 
as shown in Table 2. In the category of PIN, we graded the specimens into 
two groups, i.e. low grade PIN (grade I - II) and high grade PIN (HGPIN, 
grade 111) on the basis of literatures [3,4]. 

Immunohistochemical (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, and subjected to 
microwaving in 30 mmol/L citrate buffer, PH 6.0 (Boshide, Wuhan, China) in 
a 900 W oven for 5 min to induce epitope retrieval. Slides were allowed to 
cool at room temperature for 30 min. A primary mouse antibody specific to 
human PSCA (Boshide, Wuhan, China) with a 3 :100 dilution was applied to 
incubate with the slides at room temperature for 2 h. Labeling was detected 
by sequentially adding biotinylated secondary antibodies and strepavidin- 
peroxidase, and localized using 3,3'diaminobenzidine reaction. Sections 
were then counterstained with hematoxylin. Substitution of the primary 
antibody with phosphate-buffered-saline (PBS) served as a negative- 
staining control. 
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mRNA in situ hybridization (ISH) 

Five-pm : thick tissue sections were deparaffinized and dehydrated, then 
digested in pepsin solution (4 mg/ml in 3% citric acid) for 20 min at 37.5°C, 
and further processed for ISH. Digoxigenin-labeled sense and antisense 
human P5CA RNA probes (obtained from Boshide, Wuhan, China) were 
hybridized to the sections at 48°C overnight. The posthybridization wash 
with a high stringency was performed sequentially at 37°C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 x SSC for 3 5 min and in 0.2 x SSC 
for 30 min. The slides were then incubated to biotinylated mouse anti- 
digoxigenin antibody at 37.5°C for 1 h followed by washing in 1 x PBS for 
20 min at room temperature, and then to strepavidin-peroxidase at 37.5°C 
for 20 min followed by washing in 1 x PBS for 1-5 min at room temperature. 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals. 
Sections hybridized with the sense. control probes routinely did not show any 
. specific hybridization signal above background. All slides were hybridized 
with PBS to substitute for the probes as a negative control. 

Scoring methods 

^ To determine the correlation between the results of PSCA immunostaining 
and mRNA in situ hybridization, the same scoring manners are taken in the 
present study for PSCA protein staining by IHC and PSCA mRNA staining by 
ISH. Each slide was read and scored by two independently experienced 
urological pathologists using Olympus BX-41 light microscopes, the 
evaluation was done in a blinded fashion. For each section, five areas of 
similar.grade were analyzed semiquantitative^ for the fraction of cells 
staining. Fifty percent of specimens were randomly chosen and rescored to 
determine the degree of interobserver and intraobserver concordance. There 
was greater than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopically was graded on a 
scale of 0 to 3+ with 3 being the highest expression observed (0, no 
stainmg; 1 + , mildly intense; 2+, moderately intense; 3-f , severely intense). 
The staining density was quantified as the percentage of cells staining 
positive for PSCA with the primary antibody or hybridization probe, as 
follows: 0 = no staining; 1 = positive staining in <25% of the sample; 2 = 
positive staining in 25%-50% of the sample; 3 = positive staining in >50% 
of the sample, intensity score (0 to 3+) was multiplied by the density score 
(0-3) to give an overall score of 0-9 | .1,5]. Jn this way, we were able to 
differentiate specimens that may have had focal areas of increased staining 
from those that had diffuse areas of increased staining [6]. The overall score 
for each specimen was then categorically assigned to one of the following 
groups: 0 score, negative expression; 1-2 scores, weak expression; 3-6 
scores, moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expression in BPH, PIN and 
Pea tissues were compared using the Chi-square and Student's Mest. 
Univariate associations between PSCA expression and Gleason score, clinical • 
stage and progression to androgen- independence were calculated using 
Fisher's Exact Test. For all analyses, p < 0.05 was considered statistically 
significant. 



http:/Av\w.wjso.com/conlent/2/J/I3 



5/2.V2006 



World Journal of Surgical One 



y | Full text | Prostate stem cell antigen (PS 



expression in ... Page 5 of 12 



Outline Results 



Abstract 
Introduction 
f^sterials and methods 
Results 
Discussion 
Competing inierests 
References 

Figures 



>-igure 1 

Representatives of PSCA 
)HC and ISH staining in Pea 
(A. IHC staining, B. ISH 
staining, x200 
magnification) 



PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed weakly in individual 
samples of BPH. Some areas of prostate expressed weak levels (composite 
score 1-2), whereas other areas were completely negative (composite score 
0). Four cases (20%) of BPH had moderate expression of PSCA protein and 
mRNA (composite score 4-6) by IHC and ISH. In 2/20 (10%) BPH 
specimens, PSCA mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 2) in one and 
negative (composite score 0) in the other. PSCA expression was localized to 
the basal and secretory epithelial cells, and prostatic stroma was almost 
negative staining for PSCA protein and mRNA in all cases examined. 

PSCA expression in PIN 

In this study, we detected weak or negative expression of PSCA protein and 
mRNA (^2 scores) in 7 of 9 (77.8%) low grade PIN and in 2 of II (18.2%) 
HGPIN, and moderate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate PSCA mRNA 
expression (6 score) was found weak staining for PSCA protein (2 score) by 
IHC. Strong PSCA protein and mRNA expression (9 score) were detected in 
the remaining 3 of 11 (27.3%) HGPIN. There was a statistically significant 
difference of PSCA protein and mRNA expression levels observed between 
HGPIN and BPH (p < 0.05), but no statistical difference reached between 
low grade PIN and BPH (p > 0.05). 



PSCA expression in Pea 



In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and if PSCA expression levels are increased in malignant compared 
with benign glands, Forty-eight paraffin-embedded Pea specimens were 
analysed by IHC and ISH. It was shown that 19 of 48 (39.6%) Pea samples 
stained very strongly for PSCA protein and mRNA with a score of 9 and 
another 21 (43.8%) specimens displayed moderate staining with scores of 
4-6 (Figure J). In addition, 4 specimens. with moderate to strong PSCA 
mRNA expression (scores of 4-9) had weak protein staining (a score of 2) 
by IHC analyses. Overall, Pea expressed a significantly higher level of PSCA 
protein and mRNA than any other specimen category in this study (p < 
0.05, compared with BPH and PIN respectively). The result demonstrates 
that PSCA protein and mRNA are overexpressed by a majority of human 
Pea. 



Correlation of PSCA expression with Gleason score in Pea 

Using the semi- quantitative scoring method as described in Materials and 
Methods, we compared the expression level of PSCA protein and mRNA with 
Gleason grade of PC3, as shown in Table .1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentiation, 5-7 scores = 
moderate-differentiation and 8-10 scores = poor-diflerentialion [/]. 
Seventy-two percent of Gleason scores 8- JO prostate cancers had very 
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strong staining of PSCA compared to 21% with Gleason scores 5-7 and 17% 
with 2-4 respectively, demonstrating that poorly differentiated Pea had 
significantly stronger expression of PSCA protein and mRNA than 
moderately and well differentiated tumors (p < 0.05). As depicted in Figure 
1, IHC and ISH analyses showed that PSCA protein and mRNA expression in 
several cases of poorly differentiated Pea were particularly prominent, with 
more intense and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we showed the 
results in Table 2. Seventy-five percent of locally advanced and node 
positive cancers (i.e. C-D stages) expressed statistically high levels of PSCA 
versus 32.5% that were organ confined (i.e. A-B stages) (p < 0.05). The 
data demonstrate that PSCA expression increases significantly with . 
. advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen-, 
independent progression of Pea 

All 9 specimens of androgen-independent prostate cancers stained positive 
for PSCA protein and mRNA. Eight specimens were obtained from patients 
managed prior to androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the strongest staining 
category (score = 9), compared with three out of eight (37.5%) of patients 
with androgen-dependent cancers (p < 0.05). The results demonstrate that 
PSCA expression increases significantly with progression to androgen-' 
independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates . 
significantly with increasing grade, worsening stage and progression to 
androgen-independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the results of PSCA IHC 
staining with mRNA ISH analysis. Positive staining areas and its intensity 
and density scores evaluated by IHC were identical to those seen by ISH in 
79 of 88 (89.8%) specimens (18/20 BPH, .19/20 PIN and 42/48 Pea 
respectively). Importantly, 27/27 samples with PSCA mRNA composite 
scores of 0-2, 32/36 samples with scores of 3-6 and 22/24 samples with a 
score of 9 also had PSCA protein expression scores of 0-2, 3-6 and 9 
respectively. However, in 5 samples with PSCA mRNA overall scores of 3-6 
and in 2 with scores of 9 there were less or negative PSCA protein 
expression (i.e. scores of 0-4), suggesting that this may reflect 
posttranscriptional modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data demonstrate that 
the results of PSCA immunostaining were consistent with those of mRNA 
ISH analysis, showing a high degree of correlation between PSCA protein 
and rnRNA expression. 
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PSCA is homologous to a group of cell surface proteins that mark the 
earliest phase of hematopoietic development. PSCA mRNA expression is 
prostate-specific in normal male tissues and is highly up-regulated in both 
androgen- dependent and- independent Pea xenografts (LAPC-4 tumors). We 
hypothesize that. PSCA may play a role in Pea tumorigenesis and 
progression, and may serve as a target for Pea diagnosis and treatment. In 
this study, IHC and ISH showed that in general there were weak or absent 
PSCA protein and mRNA expression in BPH and low grade PIN tissues. 
However, PSCA protein and mRNA are widely expressed in HGPIN, the 
putative precursor of invasive Pea, suggesting that up- regulation of PSCA is 
an early event in prostate carcinogenesis. Recently, Reiter RE et al [1], 
using ISH analysis, reported that 97 of 118 (82%) HGPIN specimens stained 
strongly positive for PSCA mRNA. A very similar finding was seen on mouse 
PSCA (mPSCA) expression in mouse HGPIN tissues by Tran C. P et al [S]. 
These data suggest that PSCA may be a new marker associated with 
transformation of prostate cells and tumorigenesis. 



. We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly 
differentiated, androgen-independent and metastatic cases. Fluorescence- . 
activated cell sorting and cdnfocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in Pea cells [9]. Our 
IHC expression analysis of PSCA shows not only cell surface but also 
apparent cytoplasmic staining of PSCA protein in Pea specimens (Figure 1). 
One possible explanation for this is that anti-PSCA antibody can recognize 
PSCA peptide precursors that reside in the cytoplasm. Also, it is possible 
that the positive staining that appears in the cytoplasm is actually from the 
overlying cell membrane [5]. These data seem to indicate that PSCA is a 
novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression correlates with high . 
grade (i.e. poor differentiation), increased tumor stage and progression to 
androgen-independence of Pea. These findings support the original IHC 
analyses by Gu Z et al [9], who reported that PSCA protein expressed in 
94% of primary Pea and the intensity of PSCA protein expression increased 
with tumor grade, stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al (10], in which the 
significant association between high PSCA expression and adverse 
prognostic features such as high Gleason score, seminal vesicle invasion and 
capsular involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recurrence, clinical 
progression or survival of Pea. Hara H et al []]] used RT-PCR detection of 
■ PSA, PSMA and PSCA in ] ml of peripheral blood to evaluate Pea patients 
with poor prognosis. The results showed that among 58 PCa patients, each 
PCR indicated the prognostic value in the hierarchy of PSCA>PSA>PSMA RT- 
PCR, and extraprostatic cases with positive PSCA PCR indicated lower 
disease-progression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic factor. Dubey P et al 
|]2] reported that elevated numbers of PSCA + cells correlate positively 
with. the onset and development of prostate carcinoma over a long time 
span in the pi estates of the TRAMP and PTE N +/- models compared with its 
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normal prostates. Taken together with our present findings, in which P5CA 
is overexpressed from HGP1N to almost frank carcinoma, it is reasonable . 
and possible to use increased PSCA expression level or increased numbers 
of PSCA- positive cells in the prostate samples as a prognostic marker to 
predict the potential onset of this cancer. These data raise the possibility 
that PSCA may have diagnostic utility or clinical prognostic value in human 
Pea. 

The cause of PSCA overexpression in Pea is not known. One possible 
mechanism is that it may result from PSCA gene amplification. In humans, 
PSCA is located on chromosome 8q24.2 [1], which is often amplified in 
metastatic and recurrent Pea and considered to indicate a poor prognosis 
[13-15]. Interestingly, PSCA is in close proximity to the c-myc oncogene, 
which is amplified in >20% of recurrent and metastatic prostate cancers . 
[16,17]. Reiter RE et al [18] reported that PSCA and MYC gene copy 
numbers were co-amplified in 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression is associated with PSCA and MYC 
coamplification in Pea. Gu Z et al [9] recently reporteted that in 102 
specimens available to compare the results of PSCA immunostaining with - 
their previous mRNA ISH analysis, 92 (90.2%) had identically positive areas 
of PSCA protein and mRNA expression. Taken together with our findings, in 
which we detected moderate to strong expression of PSCA protein and 
mRNA in 34 of 40 (85%) Pea specimens examined simultaneously by. IHC 
and ISH analyses, it is demonstrated that PSCA protein and mRNA 
overexpressed in human Pea, and that the increased protein level of PSCA 
was resulted from the. upregulated transcription of its mRNA. 

At present, the regulation mechanisms of human PSCA expression and. its 
biological function are yet to be elucidated. PSCA expression may be 
regulated by multiple factors [13]. Watabe T et al [19] reported that 
transcriptional control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be regulated or 
mediated through cell-cell contact and protein kinase C (PKC) [20]. 
Homologues of PSCA have diverse activities, and have themselves been 
involved in carcinogenesis. Signalling through SCA-2 has been demonstrated 
to prevent apoptosis in immature thymocytes [2J ]. Thy-1 is involved in T 
cell activation and transducts signals through src-like tyrosine kinases [22].. 
Ly-6 genes have been implicated both in tumorigenesis and in cell-cell 
adhesion [L ; : 25]. Cell-cell or cell-matrix interaction is critical for local tumor 
growth and spread to. distal sites. From its restricted expression in basal 
cells of normal prostate and its homology to SCA-2, PSCA may play a role in 
stem/progenitor cell function, such as self-renewal (i.e. anti-apoptosis) 
and/or proliferation [1]. Taken together with the results in the present 
study, we speculate that PSCA may play a role in tumorigenesis and clinical 
progression of Pea through affecting cell transformation and proliferation. 
From our results, it is also suggested that PSCA as a new cell surface 
antigen may have a number of potential uses in the diagnosis, therapy and 
clinical prognosis of human Pea. PSCA overexpression in prostate biopsies 
could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Similarly, the detection of PSCA-overexpressing- 
cells in bone marrow or peripheral blood may identify and predict metastatic 
progression better than current assays, which identify only PSA- positive or 
PSMA- positive prostate cells. 
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In summary, we have shown in this study that PSCA protein and mRNA are 
maintained in expression from HGPIN through all stages of Pea in a majority 
of cases, which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell differentiation), 
advanced stage and androgen-independent progression. PSCA protein 
overexpression is due to the upregulation of its mRNA transcription. The 
results suggest that PSCA may be a promising molecular marker for the 
clinical prognosis of human Pea and a valuable target for diagnosis and 
therapy of this tumor. 
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Expression of superoxide dismutases, catalase, and glutathione peroxidase 
in glioma cells, 

Zhong W, YanJT, j jni R, Qbcrley LW. 

Qty 52242 R uSv Ch Ub ° t!it0ry ' De P artment of Radiology, The University of Iowa, Iowa 

Four primary antioxidant enzymes were measured in both human and rat glioma ceils 
Both manganese-containing superoxide dismutase (MnSOD) and copper-zinc-containing 
superoxide dismutase (CuZnSOD) activities varied greatly among the different glioma 
cell lines. MnSOD was generally higher in human glioma cells than in rat glioma cells 
and relatively higher than in other tumor types. High levels of MnSOD in human glioma 
cells were due to the high levels . of expression of MnSOD mRNA and protein: 
Heterogeneous expression of MnSOD was present in individual glioma cell lines and 
may be due to subpopulations or cells at different differentiation stages. Less difference 
m CuZnSOD, catalase, or glutathione peroxide was found between human and rat glioma 
cells, The human glioma cell lines showed large differences in sensitivity to the 
glutathione modulating drugs 1 ,3-bis (2-chloroethyl)-l -nitrosourea (BCNU) and 
buthionine^ulfoximine (BSO). A good correlation was found between sensitivity to 
BCNU.and the activities of catalase in these cell lines. Only one cell line was sensitive to 
BSO and tlus line had low CuZnSOD activity. 
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[Expression of human telomerase reverse transcriptase in cervix cancer 
and its significance] 

[Article in Chinese] 

Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, 
Huazhong University of Science and Technology, Wuhan 430030, China. 

OBJECTIVE: To investigate the expression of human telomerase reverse transcriptase 
(hTERT) mRNA and protein in cervix cancer, cervical intraepithelial neoplasia (CIN) 
and normal cervix. METHODS: Expression of hTERT mRNA and the other two subunits 
of telomerase, human telomerase RNA component (hTR), human telomerase-associated 
protein (hTPl) was determined by RT-PCR in 3 cervix cancer cell lines, 2 diploid cell 
lines, 38 cases of cervix cancer, 16 cases of CIN and 20 cases of normal cervix. 
Telomerase activity was also examined by telomeric repeat amplification protocol 
enzyme-linked immunosorbent assay (TRAP-ELISA). Expression of hTERT protein was 
detected in all the cell lines and 101 cases of paraffinized cervix tissue sections. 
RESULTS: hTERT mRNA expression was detected in all of the three cervix cancer cell 
lines, 8 1.6% of cervix cancer, 37.5% of CIN, 5.0% of normal cervix, while in neither of 
the two diploid cell lines. The other two subunits of telomerase were prevalently 
expressed in all of the cell lines and most cervix tissues. There was a strong correlation 
between hTERT mRNA expression and telomerase activity. Immunostaining also 
revealed that hTERT protein was expressed in all three cervix cancer cell lines, 65.5% of. 
cervix cancer, 28.0% of CIN and 4.8% of normal cervix. CONCLUSION: Up-regulation 
of hTERT may play an important role in the development of CIN and cervix cancer, 
hTERT could be used as an early diagnostic biomarker for cervix cancer. 
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ORIGINAL ARTICLE 

Gene expression signatures and biomarkers of noninvasive and invasive 
breast cancer cells: comprehensive profiles by representational difference 
analysis, microarrays and proteomics 

GM Nagaraja 1 , M Othman 2 , BP Fox 1 , R.Alsaber 1 , CM Pellegrino 3 , Y Zeng 2 , R Khanna 2 , 
P Tamburini 3 , A Swaroop 2 and RP Kandpal 1 

1 Department of Biological Sciences, Fordham University, Bronx, NY, USA; 2 Department of Ophthalmology and 
Visual Sciences, University of Michigan, Ann Arbor, Ml, USA and 3 Bayer Corporation, West Haven, CT, USA 



We have characterized comprehensive transcript and 
proteomic profiles of cell tines corresponding to normal 
breast (MCF10A), noninvasive breast cancer (MCF7) and 
invasive breast cancer (MDA-MB-231). The transcript 
profiles were first analysed by a modified protocol for 
representational difference analysis (RDA) of cDNAs 
between MCF7 and MDA-MB-231 cells. The majority of 
genes identified by RDA showed nearly complete con- 
cordance with microarray results, and also led to the 
identification of some differentially expressed genes such 
as lysyl oxidase, copper transporter ATP7A, EphB6, 
RUNX2 and a variant of RUNX2. The altered transcripts 
identified by microarray analysis were involved in cell-cell 
or cell-matrix interaction, Rho signaling, calcium home- 
ostasis and copper-binding/sensitive activities. A set of 
nine genes that included GPCR11, cadherin 11, annexin 
Al, vim en tin, lactate dehydrogenase B (upregulated in 
MDA-MB-231) and GREB1, S100A8, amyloid /? pre- 
cursor protein, claudin 3 and cadherin 1 (downregulated in 
MDA-MB-231) were sufficient to distinguish MDA-MB- 
231 from MCF7 cells. The downregulation of a set of 
transcripts for proteins involved in cell-cell interaction 
indicated these transcripts as potential markers for 
invasiveness that can be detected by methylation-specific 
PCR. The proteomic profiles indicated altered abundance 
of fewer proteins as compared to transcript profiles. 
Antisense knockdown of selected transcripts led to 
inhibition of cell proliferation that was accompanied by 
altered proteomic profiles. The proteomic profiles of 
antisense transfectants suggest the involvement of pepti- 
dyl-prolyl isomerase, Raf kinase inhibitor and 80kDa 
protein kinase C substrate in mediating the inhibition of 
cell proliferation. 
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Introduction 

The transformation of a normal cell into a cancer cell 
has been correlated to altered expression of a variety of 
genes (Perou et al., 2000; Becker et al., 2005). The 
expression of some of these genes is' a direct result of 
sequence mutation, whereas other changes occur due to 
alterations in gene products that participate in specific 
pathways. The changes in gene expression have been 
routinely characterized by classical subtraction hybridi- 
zation and differential display approaches (Cerosaletti 
et al., 1995; Alpan et al., 1996). With the availability 
of the human genome sequence and sequences for a 
number of other model organisms, traditional methods 
have largely been replaced by gene microarrays 
(Khan et al, 2001). These analyses have been used to 
characterize the molecular basis of a variety of diseases 
including cancer. A comprehensive analysis of a large 
number of cancer cell lines allowed clustering of genes 
into groups based on their expression patterns in 
phenotypically related cell lines (Khan et aL, 2001; 
Dan et al, 2002; Rosenwald et al, 2002; van't Veer 
et aL, 2002). The results of profiling experiments 
indicated expression of specific gene clusters in cell lines 
that have the same origin or have arisen from the same 
organ (Ross et aL, 2000). A complementary approach 
that has been used in limited ways is proteomics. 
Proteomics scores for changes in different proteins 
and peptides in cells with characteristic pheno- 
typic differences. However, a comparative analysis of 
transcripts and proteins to establish a relationship 
between transcript changes and protein levels has not 
yet become routine. 

Although expression profiling of tumor tissue and its* 
comparison with normal tissue, in principle, is most 
appropriate to obtain the genetic signatures of a tumor 
type, such comparisons have not been free of attendant 
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complications. These complications arise due to hetero- 
geneity of tumor specimens wherein any cell type- 
specific changes are likely to be masked by other cell 
types that constitute the tumor specimen. For this 
reason, well-characterized cell lines established 
from tumor tissue may prove more informative and 
have been considered useful by cancer researchers. 
Comparing gene profiles between cell lines has the 
potential to reveal genes that could be causative for 
the phenotype and other genes that can serve as tumor 
biomarkers. 

Our investigations are aimed at designating a subset 
of transcripts that could distinguish a normal breast cell 
from a breast cancer cell and help to predict tumorigenic 
or metastatic potential of a transformed cell. We 
describe here transcript and proteomic profiles of a 
normal breast cell line, a tumorigenic but noninvasive 
breast carcinoma cell line and an invasive breast 
carcinoma cell line, and summarize them as a set of 
candidate biomarkers or targets for therapeutic inter- 
vention. The comparison of transcript profiles with 
proteomic profiles demonstrated that altered proteins 
were not always represented in the microarray desig- 
nated profiles and vice versa. Furthermore, we have 
targeted five transcripts that were upregulated in MCF7 
cells for investigating their role in cell proliferation 
pathways. The proteomic profiles have revealed that 
inhibition of cell proliferation by antisense knockdown 
was mediated by a specific set of proteins. 



Results 

Representational difference analysis 
As described in the Materials and methods section, 
RDA was performed by using cDNAs from MCF7 and 
MDA-MB-231 as tester/driver or driver/tester combina- 
tion. The difference product in the first case represents 
the genes that are either upregulated in or specific to 
MCF7. On the other hand, the difference product of 
MDA-MB-231 (tester) and MCF7 (driver) hybridiza- 
tion resulted in the isolation of cDNAs that are either- 
upregulated in or specific to MD A-MB-23 1 . The initial 
linkers used in this protocol had internal Bglll sites. 
One strand of the linker was used to amplify both the 
tester and driver cDNAs after linkers had been ligated 
to cDNAs. After removal of linkers from amplified 
cDNAs by digestion with Bgttl, a dephosphorylated 
BgKl adaptor was ligated to tester DNA. The BglK 
adaptor had an internal EcoRl site. The difference 
product was digested with EcoRl and cloned in pBlue- 
Script vector. The cloning efficiency of the difference 
product was very low (5 x 10* c.f.u.//xg of DNA). The 
low efficiency of cloning is attributed to a substantial 
fraction of amplified DNA product that is refractory to 
restriction digestion. The sequencing of a set of 100 
clones each from the difference libraries revealed 50 
different kinds of clones. The majority of these 
sequences were short fragments and represented either 
3' regions or internal fragments of transcripts. A 
summary of these clones is presented in Table 1. The 



involvement of the majority of these cDNAs is well 
characterized either in tumorigenesis or in metastasis. 
The phenotypic characteristics of MCF-7 and MDA- 
MB-231 ideally match with the biological significance of 
these genes. The alterations in transcripts for Rho 
signaling proteins, Ca 2+ binding/requiring proteins, 
tight junctions/anchoring junctions/gap junctions, cop- 
per binding or sensitive proteins, and RUNX2 are 
particularly noteworthy. 

The differential expression of a representative number 
of RDA clones was validated by semiquantitative PCR. 
As shown in Figure 1, these transcripts were either 
specific to or upregulated in the cell line that was used as 
a tester. Such analyses demonstrated that more than 
90% of the clones were differentially expressed. The 
abundance of transcripts and the results of RT-PCR 
were also confirmed by Northern blotting (Figure 2). 
The pattern of hybridization clearly indicates that all 
these transcripts showed differential expression in 
MGF7 and MDA-MB-231 cells that were used as 
driver/tester combinations for the RDA. 



Gene microarrays 

After obtaining preliminary molecular signatures of 
MCF7 and MDA-MB-231 cells by RDA, we used 
Affymetrix gene micoarrays to expand the above 
analysis to identify a comprehensive set of transcripts 
that is deregulated in invasive breast carcinoma cells. 
The comparisons of cell lines on the basis of transcripts 
that. are either present or absent as shown in Figure. 3 
revealed that a set of 123 genes distinguishes MDA-MB- 
23 1 cells from MCF7 and MCF10A. These genes can be 
classified by their involvement in functional classes such 
as transcription, signal transduction, cell adhesion, cell 
cycle, metabolism, transport, response genes and devel- 
opment (Figure 4). The majority of these genes 
participated in the process of signal transduction 
followed by transcription, cell adhesion and metabo- 
lism, respectively. A few transcripts in these classes were 
tested by real-time RT-PCR to confirm their altered 
abundance. The selected transcripts showed changes 
ranging between two- and 10-fold, 11- and 20-fold and 
greater than 20-fold, and were in close agreement with 
the results of microarray analysis. The qualitative 
pattern of change observed in microarrays analysis 
was readily reproduced by real-time or semiquantitative 
RT-PCR for all transcripts tested. 

The number of altered transcripts was over 1000 
based on a change of twofold or greater, and a majority 
of these genes show changes varying between two- and 
fourfold (Figure 5). Interestingly, with all comparisons 
combined, there were 21 genes downregulated more 
than 50-fold and 55 genes that were upregulated more 
than 18-fold when specific cell line pairs were compared 
(Figure 5). The transcripts that represent the extremes of 
upregulated and downregulated scale can allow distinc- 
tion between MCF7 and MDA-MB-231 cells. These 
transcripts include GPCR11, cadherin 11, annexin 
A'i, vhnentin, lactate dehydrogenase B (upregulated 
in * MDA-MB-231) and GREB1, S100A8, amyloid 



Oncogene 



Molecular signatures of breast carcinoma celts 
GM Nagaraja ef ai 



2330 



Table 1 Differentially expressed transcripts identified by RDA 



Upregulated in MDA-MB-231 



Downregulated in MDA-MB-231 



Extracellular matrix/ matrix-crosslinking proteins 
LOX 

Laminin /? 1 
Collagen VI-oc 1 

Calcium-binding proteins 
Reticulocalbin 1 
S100A8 
Cullin 5 

Transcription factors! promoter-binding proteins 
RUNX2 
c-Jun 
Fra-1 

Cell-cell adhesionj cell-surface receptor proteins; 

Cdhll 
CYR61 

MHC class II antigen y chain 
Protease-activated receptor- 1 
Pro tease- activated receptor-4 



ATPase/GTPase and signal transduction proteins 
ATP7a 
Cave ol in 2 

AXL receptor tyrosine kinase 
Rho GEF 3 
Rho/Rac GEF 18 
P21-Rac2 

Metalloproteases and MMP inhibitor proteins 
TIMP-2 
MT1-MMP 

Stress-response proteins 

Dual specificity phosphatase (DUSP) 

Cytoskeletal component and binding proteins 
Moesin 
Vimentin 
Filamin B 

Cell-cycle regulation and growth! differentiation! apoptosis proteins 
Cyclin Bl 
Cyclin E 
Cyclin A2 
Bcl2-like 1 protein 

Secreted proteins and growth factors 
Milk fat globule protein 
TGF-a 

SMAD-specific E3 ub ligase 2 

Miscellaneous 
Prion protein 



Calcium-binding proteins 
Calgranulin B 



Transcription factorsjpromoter-binding proteins: 
Chromosome 4 ORF 
Estrogen receptor 1 
RUNX2 variant (exon 8 deleted) 

Cell-cell adhesionjcell-surface receptor proteins 
Claudin 3 

Amyloid p precursor protein 
Triose phosphate isomerase 
Plakoglobin 
Cdh 1 
Cdh 3 

Annexin A9 
RAR-a 
Connexin 31 

ATPase/GTPase and signal transduction proteins 

RhoD 

RhoB 

TGF-0 Rl 
Stress-response proteins 

Protein kinase H 1 1 

Cytoskeletal component and binding proteins 
Keratin 18 
Tubulin 6 1 

Microtubule-associated protein t 

Cell-cycle regulation and growth/differentiationfapoptosis proteins 
S100A13 
S100C 

Aurora kinase A1K2 
Nucleosidediphosphate kinase 

Secreted proteins and growth factors 

Trefoil factor 3 (TFF3) 

Trefoil factor 1 (TFF1) 

Four and a half LIM domain 1 

Solute carrier family 16 SLC16A6 
DNA replication 

DNA replication complex GINS-PSF2 
Miscellaneous 

Serine protease inhibitor type 1 (SPINT1) 

Human homolog of Xenopus protein XAG 

Hypothetical protein FLJ22222 

Hypothetical protein 20171 

Hypothetical protein MGC3265 



p precursor protein, claudin 3 and cadherin 1 (down- 
regulated in MDA-MB-231). The distinction between 
MCF7 and MCF10A may be made based on keratin 19, 
serine protease, amyloid J? precursor, neuropeptide Y ; 
receptor Yl (upregulated in MCF7) and caldesmon, 
annexin ;A1, epithelial membrane protein 1, S100A2, 



keratin 15 (downregulated in MCF7). Likewise, 
MDA-MB-231 cells differ from MCF10A in vimentin, 
epithelial membrane protein 3, cadherin 1 1, GPCR 
116, collagen type XIII a 1, Bcl2-associated athanogene 
2 (upregulated in MDA-MB-231) and keratin 15, 
cystatin A, cadherin 1, CD24, calcium-activated chlor- 
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ide channel, SI OOP, GPCR 87 (downregulated in 
MDA-MB-231). Thus, a small subset of transcripts 
may serve as an accurate signature of these cell lines. 
Several of these gene products have been shown to 
participate in tumorigenesis and invasiveness of breast 
carcinoma cell lines. 

The invasiveness phenotype of MDA-MB-231 cells 
specifically relates to changes in the following functional 
classes: (a) cell adhesion molecules, (b) Ca 2+ requiring, 
Ca 2+ binding or Ca 2+ regulatory genes, (c) copper- 
sensitive or copper-transporting proteins and (d) specific 
regulatory proteins of Rho signaling. Among these 
functional groups, 23 transcripts involved in cell-cell 
or cell-matrix interactions are underexpressed in MDA- 



MB-231 cells and 21 transcripts were overexpressed in 
this cell line (Table 2), The comparison of Ca 2+ - 
requiring/binding genes indicated downregulation of 
26 transcripts and upregulation of 26 transcripts in 
invasive cells as compared to noninvasive cells (Table 3). 
While Ca 2+ homeostasis is extensively investigated in 
human cancers, copper homeostasis is an underexplored 
area. The alterations in copper homeostasis in breast 
carcinoma cells were reflected by changes in trans- 
cripts corresponding to a variety of copper-binding or 
copper-sensitive proteins/enzymes (Tables 1 and 4). The 
deregulation of Rho signaling was evident from changes 
in various proteins involved in this pathway (Table 4). 
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Figure 1 Semiquantitative evaluation of selected transcripts, (a) 
RNA was isolated from confluent culture dishes containing 
MCF10A (lanes 1 and 4), MGF7 (laaes 2 and 5) or MDA-MB- 
231 (lanes 3 and 6) cells. The amount of RNA was first determined 
spectrophometrically. Equal amounts of RNA, as determined by 
absorbance at 260 nm, were amplified with primers specific to actin 
gene for 18 cycles (lanes 1-3) or 20 cycles (lanes 4-6). The lane 
containing size markers is labeled as M. (b) A set of primers 
corresponding to caveolin 2 (lanes 1 and 2), TGF-a (lanes 3 and 4), 
Moesin (lanes 5 and 6) ( LOX (lanes 7 and 8), Axl receptor (lanes 9 
and 10), RhoD (lanes U and 12), SI 00A 13 (lanes 13 and 14),TFF3 
(lanes 15 and 16) and Claudin 3 (lanes 17 and 18) were amplified 
for 32 cycles. Lanes 1, 3, 5, 7, 9, 12, 14, 16 and 18 represent 
amplified products from MCF7 and lanes 2, 4, 6, 8, 10, 11, 13, 15 
and 17 represent MDA-MB-231 cells. The lanes containing PCR 
products from MDA-MB-231 cells are marked with an asterisk. 




Figure 2 Analysis of selected transcripts by Northern hybridiza- 
tion. The probes specific to CD74 (a), CYR61 (b), SPINT1 (c) and 
DUSP (d) were labeled with a 32 P nucleotide and hybridized to 
blots containing RNA from MCF10A (lane 1), MCF7 (lane 2) 
and MDA-MB-231 (lane 3). The blots were washed stringently and 
developed as described. The amounts of RNA loaded were 
normalized as in Figure 1 . 
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Figure 3 Comparison of cell lines based on the presence or absence of transcripts. The absence or presence of a transcript in the 
Asymetrix chip was scored by the fluorescence read-out as described in the Materials and methods section. 
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Gene Ontology 



1 Absent in MCF7 - Present in MD A MB 231 & MCF10A ■ Present in MCF7 - Absent in MDAMB231 & MCF10A 
Present in MDAMB231 - Absent in MCF10A & MCF7 D Absent In MDAMB231 - Present in MCF10A & MCF7 



Figure 4 Functional classes of transcripts that differentiate a cell line pair. The transcripts identified as present or absent were 
classified based on their functional importance. 



Gene Expression Levels 




Downregulated 



Upregulated 



No. of Folds 



| □ MCF7 vs 1 0A Q MDAMB231 vs 1 0A ■ MDAMB231 vs MCF7 1 

Figure 5 Distribution of altered number of transcripts as a 
function of fold change. The altered transcripts were categorized in 
groups based on the magnitude of change in their abundance. 



Table 2 Cell adhesion molecules altered in breast carcinoma cells 
Upregulated in MDA- MB- 231 Do wnr emulated in MDA-MB-231 



Cadherin 4 
Cadherin 11 
Catenin 
Integrin ot 6 

Transmembrane anchor protein 
Eph B2 
Dystoum 
Larainin p 1 
Lam in 
Filamin B 
Filamin C 
Tailin 1 
Butyrophilin 
Spectrin-a 
Spectrin-/? 
Thrombospondin 
Plastin 3 
Adducin 3 y 
Lamin B receptor 
Laminin (3 2 
. Lamin A/C 



Claudin3 
Cadherin- 1 
Cadherin- 3 
Cadherin- 18 

Cadherin, LAG seven pass receptor 

Down syndrome cell adhesion 

Cateriin-<52 

Eph A4 

Ephrin A4 

Annexin A9 

Ankyrin 3 

Sarcbglycan 

Keratin 8 

MAP-7 

MAP-t 

Plakoglobin 

Plakophilin 

Discoidin domain receptor 
Zona occludens 3 
PeriplaJtin 
Protocadherin ot 9 
Laminin y 2 
Laminin a 3 



Proteome analysis 

To identify altered abundance of proteins and relate it to 
transcript profiles, we characterized the protein profiles 
of MCF-10A, MCF-7 and MDA-MB-231 cells. Typi- 
cally, > 300 protein spots could be visualized in silver- 
stained gels, and there were far fewer protein spots in 
gels that were stained with Coomassie blue. The 
comparison of MCF7 or MDA-MB-231 proteins with 
MCF-10A revealed that MCF-7 had 11 unique protein 
spots, while MDA-MB-231 had 15 spots that were not 
seen in MCF^IOA. These proteins were either specific to 
or upregulated in these cell lines. The identity of these 
proteins is shown in Table 5. Out of these 26 protein 
spots, only 25 yielded amino-acid sequence. As shown in 
the table, the list includes proteins involved in stress 



response, protein- tagging activities, calcium-binding and 
calcium homeostasis proteins and some regulatory 
proteins. Prominent among these changes were proteins 
involved in calcium homeostasis such as crocalbin, 
calreticulin, calcyclin and reticulocalbin. The changes in 
signaling pathways between the two cell lines were 
indicated by altered levels of Rho GDP dissociation 
inhibitor 1, an apoptosis/differentiation regula- 
ting protein galectin, Myc expression regulator far 
upstream binding protein- 1 and the microtubule reg- 
ulator protein 'stathmin. The translation initiation 
factors 5A and 4H were also selectively upregulated in 
MDA-MB-231 cells. 
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Table 3 Calcium binding or sensitive transcripts 
Upregulated in MDA-MB-231 Downreguhted in MDA-MB-231 



Reticulocalbin 
Dystonin 
Follistatin like 1 
Cullin 
Annexin A5 

EF hand domain containing-2 

Hippocalcin like-2 

LDLR 

Steroid sulfatase 

MT-actin crosslinking factor 

Inositol- 1 ,4,5-triphosphate 

receptor 3 

Sorcin 

Guanine nucleotide-binding 

protein-y 

GAS 6 

SWP-70 protein 
Jagged 1 

EGF-containing fibrulin-like 
ECM 

Transglutaminase 2 

Thrombin receptor-like 1 

Plastin 3 

FYN oncogene 

PKC-a 

Cahnegin 

Calpain 

HEG homo log 

Cyr61 



S100A8 
S100A7 
S100A13 

Tumor associated Ca J + 
signal transducer 
Notch homolog 3 
PKD2 

Adenylate cyclase 
Phospholipase-C 
Chemokine ligand 12 
Ubquitin-speciflc protease 
PKC-rj 

Ret-pro to oncogene 

Signal peptide-CUB domain 

Mannosidase a 

Bradykinin receptor B2 

Solute carrier family 24 

Ca 2+ channel voltage-dependent /?3 

Regulator of G protein signaling 17 

Dystrobevin-a 

Synaptogamin 1 

Matrix gla protein 

EF hand domain family member D 1 

PK-cyclic AMP dependent 

ATPase-Ca 2+ transporting 

Calmodulin 1 

CaM kinase 



Table 5 Proteomic profiles of MCF7 and MDA-MB-231 cells as 
compared to MCF10A cells 



MCF7 cells 



MDA-MB-231 cells 



Cell-cell adhesionj cell-surface 
receptor proteins 
Triose-phosphate isomerase 

Stress-response proteins 
Hsp27 

Superoxide dismutase 
Peroxiredoxin 2 



Cytoskeletal component /binding 
proteins 
Stathmin 

Cell- cycle regulation and growth/ 
differentiation/ apopto sis proteins 

Nucleoside diphosphate kinase A 

S100C 

Secreted proteins and growth factors 
Macrophage migration inhibitory 
protein 

Miscellaneous 
Cyt c oxidase Vlb 
Peptidyl-prolyl cis-trans isomerase 
Ubiquitin 



Calcium-binding proteins 
Calcyclin 
Calreticulin 
Crocalbin 
Reticulocalbin 

Transcription Factors/ 
Promoter-binding proteins 

Far upstream element 

binding protein- 1 

Far upstream element 

binding protein-2 

Cell-cell adhesion/ cell-surface 
receptor proteins 
Galectin 

A TPase/GTPase/ signal trans- 
duction/ trafficing proteins 

Rho GDP dissociation 

inhibitor 1 

Stress-response proteins 
Hsp 70 

Peroxiredoxin 2 

Cytoskeletal component/ 
binding proteins 
Stathmin 



Table 4 Altered transcripts involved in copper homeostasis and Rho 
signaling 



Copper-binding proteins 


Rho signaling proteins 


LOX 


Rho3 . 


LOX-1, LOX-2 


Rho/Rac GEF 18 


SCO cyt oxidase-deficient 


Rho GEF 12 . 


homolog 2 


Ras related- C3 botulinum toxin 


COX 17 homolog 


substrate 2 


MetaJlothionein IE, IF and 2a 


Cdc 42 effector protein 3 


Ring finger protein7 


Rho GEF 3 


Amiloride-binding protein 1 


Rho GDP dissociation inhibitor /? 


Neurotrypsin/motopsin 


RhoD 



Changes in proliferation characteristics and protein 
profiles in response to transection with antisense 
constructs of selected transcripts 

We had observed significant upregulation of transcripts 
for DNA replication complex protein GINS PSF2, 
trefoil factor 3, aurora kinase AIK2, protein kinase HI I 
and secreted protein XAG in MCF7 cells. We reasoned 
that antisense knockdown of the above genes in MCF7 
cells might indicate pathways involved in tumorigenesis 
and invasiveness. 

MCF7 cells were transfected with empty vector 
pCDNA3.1 or antisense constructs of the above genes. 
A semiquantitative amplification of pCDNA marker 
gene by RT-PCR confirmed the presence of the 
transfected construct in a significant proportion of the 
cell population. The transfected cells also showed a 
decrease in the target transcripts as observed by RT- 



Miscellaneous 
Heterogeneous nuclear 

. ribonucleo protein H 
eIF4H (translation) 
eIF5A (translation) 



PCR. The effects of antisense transfections were scored 
by growth characteristics of the transfectants. The cell 
proliferation was reduced between 15 and 40% when 
antisense transfectants were compared to cells trans- 
fected with empty vector. 

In order to relate decreased proliferation of transfec- 
tants to altered proteins, proteomic profiles of transfec- 
tants were compared with vector controls. The 
comparison of protein profiles of cells transfected with 
empty vector or antisense construct revealed alterations 
in several proteins for each transfectant (Table 6). These 
proteins included stress-response proteins, calcium- 
regulating proteins, translation factors, ubiquitin, pro- 
teins of electron transport chain and oxidative phos- 
phorylation, signaling proteins, cytokeratins, actin and 
actin regulating proteins and general regulatory factors. 
The number of altered proteins varied between 5 and 15 
for various transfections. Pep tidy 1 prolyl cis-trans 
isomerase, calcium- regulating proteins, SOD, galectin, 
histidine triad protein and PKC substrate were promi- 
nent among altered proteins. We performed database 
searches to identify interactors for all proteins that were 
altered in transfected cells and identified nearly '350 
proteins (data not shown). A significant number of these 
interacting proteins are involved in transcriptional 
regulation. . 
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Table 6 Altered proteins in MCF7 cells after antisense knockdown of specific transcripts 



After transfeciion with as- trefoil factor 3(TFF3) 



After transfeciion with as-protein kinase HI I 



Calcium-binding proteins 

Calmodulin 
Cell-cell adhesion I cell-surface receptor proteins 

Retinoic acid-binding protein II 
ATPase-GTPase/signal transduction/ trafftcing proteins 

Raf kinase inhibitor 
Stress-response proteins 

Hsp27 

Peroxiredoxin 1 
Cytoskeletal component /binding proteins 

Cofilin-nonmuscle isoform 
Cell-cycle regulation and growth/ differentiation/ apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

Translationally controlled tumor protein 
Miscellaneous 

40S ribosomal protein S 12 

ATP synthase D chain 

Cancer-associated Sm-like protein 

Cyt c oxidase polypeptide Va 

eIF5A (translation) 

Ferritin heavy chain 

His triad nucleotide-binding protein 

HistoneH2B.n 

Peptidyl-proiyl isomerase 

Proteosome subunit /?-type 6 
> RNA-binding protein 8A 

Thioredoxin 

Ubiquitin crossreactive protein 

After transfeciion with as-aurora kinase A1K2 

Calcium-binding proteins 

Calgranulin B 

Calgranulin A 
Stress-response proteins 

Hsp27 

Superoxide dismutase 
Miscellaneous 
60S acidic ribosomal protein P2 
Peptidyl-proiyl isomerase 
Ubiquitin-cross re active protein 



Celt-cell adhesion/ cell-surface receptor proteins 
Galectin 

Retinoic acid-binding protein II 
ATPase-GTPase/signal transduction/trafficing proteins 

PKC substrate 
Stress-response proteins 

Hsp27 

Cytoskeletal component /binding proteins 

Actin-a, skeletal muscle 

Cytokeratin 18 
Miscellaneous 

30H-acyl Co A dehydrogenase II 

ATP synthase A chain 

Cyt c oxidase peptide Va 

Enhancer of rudimentary homolog 

Thioredoxin 

Ubiquinol-cyt C reductase 
No match 

After transfeciion with as-ONA replication complex GINS PSF2 
Calcium Binding proteins: 
Calreticulin 

. ATPase-GTPase/signal transductionitrafficing proteins 
14-3-3 if 
PKC substrate 
Cytoskeletal component /binding proteins: 
Cytokeratin 1 
Cytokeratin 18 

Cell-cycle regulation and &. growth/ differentiation/ apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

MAP/MT affinity regulator 
Miscellaneous 

ATP synthase D chain 

Peptidyl-proiyl isomerase 

XTbiquinol-cyt c reductase 

After transfeciion with as-human homolog ofXAG 
Calcium-binding proteins 

Calreticulin 
Stress-response proteins 

Hsp27 
Miscellaneous 

ATP synthase A chain 

Peptidyl-proiyl isomerase 

Ubiquitin crossreactive protein 

No match 



Discussion 

The results presented here validate the gene profiles 
obtained from different expression platforms ranging 
from subtractive hybridization to gene microarrays and 
proteomic analysis. The RDA protocol is powerful 
enough to yield important genes that show significant 
alterations in their expression between cell lines, and can 
lead to isolation of full-length cDNAs by using 
appropriate modifications (Baskaran et aL, 1996; Jacob 
et ai, 1997). The detection of RUNX2, variant of 
RUNX2, EphB6, prion protein, lysyl oxidase and a 
copper transporter ATP7A transcripts by RDA warrant 
specific mention. RUNX transcription factors bind 
specific motifs on target gene promoters and regulate 
gene expression leading to cell growth, proliferation and 



differentiation (Pratap et aL, 2003). RUNX2 and its 
variant have differential repression activity toward the 
promoter of the cyclin-dependent kinase inhibitor 
(p21CIPl) (Westendorf et aL, 2002). The loss of EphB6 
expression due to methylation of its promoter is related 
to invasiveness of MDA-MB-231 (Fox and Kandpal, 
2004; unpublished observations). Lysyl oxidase, a 
copper-sensitive enzyme, causes oxidative deamination 
of lysine and hydroxy lysines of collagen to aldehyde 
forms to stabilize collagen fibrils (Siegel, 1976) that are 
found in invasive breast carcinoma cells (Akiri et aL, 

2003) . The activation of LOX is dependent on copper 
that is internalized and then transported to trans golgi 
network by copper transporter ATP7A (Pase etaL, 

2004) , a protein mutated in Menkes disease (Molfer 
et aL, 2005). Prion protein has also been reported as a 
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chaperone for copper (Jones et aL, 2004). Thus, EphB6 
can serve as a marker for invasiveness, and LOX and 
ATP7A may be exploited as relevant targets for 
therapeutic intervention. 

The downregulation of junctional proteins along with 
inactivation of TTMPs as shown here is in agreement 
with other reports describing their relationship with 
invasiveness of carcinoma cells (Johnson, 1991; Kousi- 
dou et al, 2004; Shao et aL, 2005) and promoter 
methylation (Costa et aL, 2004). As several transcripts 
coding for junctional proteins are downregulated in 
invasive cells, we postulate that methylation-specific 
PCR can be exploited to use these transcripts as 
biomarkers of tumor cells in general and invasiveness 
in particular. The changes in cell-cell interaction 
correlate to cell phenotypes because such interactions 
influence Rho/Ras signal transduction pathways and 
vice versa (Malliri and Collard, 2003; Nagaraja and 
Kandpal, 2004; Ridley, 2004), and lend credence to the 
significance of altered transcripts for Rho and Rho 
GEFs as presented here. 

Early changes in calcium homeostasis as measured by 
calcium excretion have been reported in breast cancer 
(Campbell et aL, 1 983), and altered calcium signaling has 
been shown in invasive lung carcinoma cells (Amuthan 
et aL, 2002). Prominent among calcium-binding proteins 
are S-100 protein, a group of intracellular messengers 
that respond to transient changes in calcium concentra- 
tion by binding to specific receptors (Marenholz et aL, 
2004) and regulate cell growth, differentiation and 
motility, transcription and cell cycle. The S-100 proteins 
detected in the present study map to chromosome lq21, 
a region of genome that is frequently altered in human 
breast cancer cells (Bieche et aL, 1995). Calcium ions act 
as a second messenger in specific signaling pathways in a 
variety of cancers (Missiaen et aL, 2000) and are known 
to alter calcineurin to activate transcription factors such 
as NFATc (Luo et aL, 1996). 

As dictated by post-transcriptional regulation, protein 
profiles showed far fewer changes as compared to 
transcript profiles, and the knockdown of five selected 
genes in MCF7 cells produced interesting changes in 
protein profiles. These genes, namely, XAG (secretory 
Xenopus laevis protein), trefoil factors 3, human aurora2 
kinase AIK2, protein kinase Hll and DNA replication 
complex GINS PSF2, have been shown to be estrogen 
responsive, oncogenic or involved in tumorigenesis (Yu 
et aL, 2001; Fletcher et aL, 2003; Katoh, 2003; Warner 
et aL, 2003; Takayama et aL, 2003). The antisense 
constructs of these genes appeared to work as siRNAs 
as suggested by the reduction in the transcript detected 
in RT-PCR of RNA isolated from the transfected cells. 
The involvement of the above transcripts in invasive 
potential is apparent from the observed upregulation of 
calcium-binding proteins in transfected MCF7 cells, 
which is comparable to their levels in MDA-MB-231 
cells. The proteins that appear to mediate inhibition of 
proliferation in antisense-transfected cells include PKC 
substrate, Raf kinase inhibitor, histidine triad nucleo- 
tide-binding protein and peptidy -prolyl isomerase 
(Pinl). We believe histidine triad protein effects are 



most likely mediated via its interaction with ATM 
protein. Raf kinase inhibitor (Keller et aL, 2004) and 
ATM (Hall, 2005) have been conclusively linked to 
transformation of cells, and the activity of Pinl has been 
related to p53-mediated signaling pathways (Mantovani 
et aL, 2004; Berger et aL, 2005). In this context, p53 
activation has also been hypothesized by Cu-SOD 
prion-like enzyme (Wiseman, 2005). Thus, alterations 
in copper homeostasis and p53-mediated signaling may 
be considered as a significant regulatory mechanism in 
tumorigenesis. 

In summary, we have presented here a set of 
candidate genes that, can serve as biomarkers for 
tumorigenesis and invasiveness, and some of these 
markers may be used to develop DNA-based diagnostic 
tests. The alterations in transcripts for copper home- 
ostasis genes suggest copper chelation or inhibition of 
copper transporter ATP7A as potential targets for 
therapeutic application. The modulation of RUNX2 
splicing variants by chemicals that affect splicing 
machinery may also be explored as a therapeutic 
modality. The changes in EphB6 expression, if con- 
firmed in tumor specimens, may have prognostic 
significance. 



Materials and methods 

Breast cancer cell lines 

We used MCF-10A, a cell line established from normal breast, 
and two breast carcinoma cell lines MCF-7 and MDA-MB : 
231 that vary in their in vitro and in vivo invasiveness. All ceils 
were cultured at 37°C/7% C0 2 . MCF-10A cells were grown in 
1:1 DMEM:F12 media (Gibco) with 5% horse serum (Gibco), 
20mM HEPES, lOng/ml EGF (Invitrogen), 10ml/l PenStrep- 
Glutamine (lOOOOU/ml penicillin, 10000^g/ml streptomycin 
and 29.2mg/ml L-glutamine), 10^g/ml insulin (Invitrogen), 
0.1^g/ml Cholera Toxin (Sigma) and 500ng/ml hydrocorti- 
sone (Sigma). MCF-7 and MDA-MB-231 cells were grown in 
DMEM (Gibco) supplemented with 2mM L-glutamine (Gib- 
co), 1 mM sodium pyruvate (Gibco), 5 ml/1 penstrep (5000 U/ 
ml penicillin and 5000^g/ml streptomycin), and 10% fetal 
bovine serum (Hyclone). 

Total RNA isolation 

RNA was isolated from 85 to 95% confluent 10 cm tissue 
culture dishes using TRI reagent (Molecular Research Center 
Inc.) with slight modifications to the recommended protocol. 
Approximately 10 million cells were mixed with 1.0 ml Tri 
Reagent, the mixture was extracted with chloroform and the 
aqueous phase containing RNA was separated. The RNA was 
precipitated with isopropanol, the pellet washed sequentially 
with 80 and 100% ethanol, then dried and resuspended in 
DE PC- treated water. RNA was stored in aliquots at -70°Q 
The quality of RNA was visualized by running on a 
formaldehyde gel. The appearance of ribosomal RNA bands 
indicated that RNA was not degraded during the procedure. 
The amount of RNA was determined by its absorbance at 
260 nm. . . 

BNAase treatment of total RNA 

To remove DNA contamination, 20 fig of RNA (quantified 
speciiophotometrically) was treated with 500 ng DNAase I, 
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80 U RNasin (Promega) and 1 mM MgCl 2 in Tris buffer in a 
total volume of 50 /d. The reaction was carried out at 37°C for 
1 h and the DNAase inactivated by heating to 65°C for 30 min. 

Representational difference analysis (RDA) 
RDA of cDNAs is a modification of genomic RDA (Lisitsyn 
et ai, 1993). We performed RDA in the following two ways. In 
one experiment, MCF-7 cDNA was used as a driver and 
MDA-MB-231 cDNA as a tester. In the second experiment, 
MCF-7 cDNA was used as a tester and MDA-MB-231 cDNA 
as a driver. The protocol has been describer previously (Jacob 
et ai, 1997). Briefly, first-strand synthesis was carried out using 
a commercial cDNA synthesis kit as per the manufacturer's 
protocol. A linker with a BgUl site was ligated to the tester as 
well as the driver cDNA. A primer specific to one of the linker 
strands was used to PCR amplify the linker-ligated cDNAs. 
The linkers were then removed by digesting the cDNA with 
BgH\ and the digested cDNA was gel purified. A second set of 
unphosphorylated BgUl adaptors was ligated to the tester 
cDNA only. The tester and driver DNAs were hybridized in a 
5 fi\ reaction volume at a ratio of 1:40. After hybridization, the 
ends of the tester homoduplexes were repaired with Klenow 
polymerase and 1 fi\ of the reaction mixture was diluted to 
100 /il. The difference product was obtained by amplifying 1 /d 
of the diluted mixture using the top strand of the ligated 
adaptor as a primer. The amplified difference product was 
digested with EcoRl and cloned in a pBlueScript vector. 
Individual clones were picked up and sequenced by Sanger* s 
dideoxy chain termination method. Representative clones were 
validated by Northern analysis and semiquantitaive RT-PCR. 

Microarray analysis 

The GeneChips, Human Genome U133A 2.0, (Affymetrix, 
Santa Clara, CA, USA) used in this study contained 
approximately 22000 probe sets corresponding to 1840O 
transcripts and variants, including 14 500 well-characterized 
human genes. 

Total RNA was converted into double-stranded cDNA by 
using Superscript II (Invitrogen, Carlsbad, CA, USA) and an 
oligo-dT primer containing a heel of the T7 RNA polymerase 
promoter sequence. The reaction mixture containing double- 
stranded cDNA was extracted with phenol-chloroform, 
precipitated with ethanol, and dissolved in 12^1 RNase-free 
water. The cDNA was transcribed in vitro by using a RNA 
transcription labeling kit (Enzo Biochem, Famingdale, NY, 
USA) with 6 y\ of double-stranded cDNA in the presence of 
ATP, CTP, GTP, UTP, bio-ll-CTP and bio-16-UTP. The 
biotinylated RNA was purified by using an affinity column 
(Qiagen, Valencia, CA, USA) and fragmented randomly, by 
heating to 95°C in the presence of fragmentation buffer, . 
between sizes of 35 and 200 bases. The GeneChips were : 
hybridized overnight at 45°C in hybridization oven in a. 
solution containing fragmented cRNA, control oligonucleo- / 
tide B2, 20 x eucaryotic hybridization controls, herring sperm 
DNA, acetylated BSA and 2x hybridization buffer. The 
GeneChips were washed and stained with streptavidin- 
phycoerythrin and the antibody in 2 x MES stain buffer, 
acetylated BSA, and optically read at a resolution of 6 jmi with 
a Affymetrix GeneChip scanner 3000. Affymetrix MICRO- 
ARRAY SUITE ' was used for initial data preparation 
(generation of .CHP files). Normalization (quantile method) 
and calculation ^of signal intensities was performed with the 
software package RMA from the R project (http://www.r- 
project.org/). For every cell line, three replicates were ' 
performed with Affymetrix Gene Chips. The Gene Chip data 
were used for further calculations after the raw image and 



MASS analysis revealed a positive quality report. Ratios of 
average signal intensity, (log 2) were calculated for the probe 
sets between pairs of cell lines and then converted to an 
average fold change (AFC). Statistical validation was per- 
formed on probe sets as described (Yoshida et al, 2004). The 
statistical method used to assign P- values to the fold changes 
of gene responses is described by Yoshida et al. (2004) and is a 
two-step procedure based on the Benjamini and Yekutieli 
construction of false discovery rate confidence intervals 
(FDRCI) (Reiner et al, 2003). Functional annotation of 
proteins was assigned through Gene Ontology (http://www. 
geneontology.org) or Locuslink (http://www.ncbi.nlm.nih.gov/ 
LocusLink) classifications obtained through appropriate pub- 
lic databases. 

Quantitative RT-PCR 

RNA was reverse transcribed with Superscript EI (Invitrogen, 
Carlsabad, CA, USA) RT by priming with oligodT. The 
primers specific to validated genes were synthesized from the 3' 
untranslated region using Primer 3 software. PCR reactions 
were then performed in triplicates in an I-cycler Thermocycler 
with optical module (BioRad, Hercules, CA, USA). The 
amplified products were quantified by reading fluorescence of 
SybrGreen I (Molecular Probes, Eugene, OR, USA). Average 
fold changes were calculated by differences in threshold cycles 
(CJ between pairs of samples to be compared. HPRT gene was 
used as a control. 

Semiquantitative RT-PCR 

The spectrophotometrically determined concentration of RNA 
was confirmed by amplifying actin message at different cycles. 
The cycling conditions that yielded proportional increment of 
amplified product was used to normalize the RNA concentra- 
tion- The normalized RNA was used as template to determine 
relative abundance of transcripts corresponding to clones 
identified by RDA experiments. The conditions were standar- 
dized in the range of cycles that yielded a PCR product for at 
least one of the pairs of compared RNAs. Such experiments 
dictated cycles between 30 and 35 to be appropriate to 
compare abundance of selected transcripts in MCF7 and 
MDA-MB-231 cells. 

Northern analysis' 

The expression pattern of selected transcripts in cell lines was 
also analysed in Northern blots. RNA (20 /an), as determined 
by spectrophotometer and confirmed by actin amplification, 
was electrophoresed on a formaldehyde agarose gel. A RNA 
ladder was used as a size marker. The RNAs were transferred 
from the gel to a Hybond nylon membrane by capillary 
transfer. The RNA was fixed onto the membrane by 
irradiation in a Stratalinker. The blot was hybridized at 65°C 
for 12-1 5 h with a radioactive probe and the blot was 
subsequently washed with 0.1 x SSC and 1% SDS at 65°C. 
The hybridized probe was detected by autoradiography. 

Transfection of MCF-7 cells with antisznse constructs 
The genes . selected on the basis of their upregulation were 
cloned in antisense orientation in pCDN A3. 1 vector (Inviro- 
gen). MCF-7 cells were grown to 70-80% confluence. 
Approximately 4/*g of DNA was transfected into MCF-7 
cells by using Lipofectamine 2000. The transfected cells were 
grown in the presence of G418 (400^g/ml). The transfectants 
were processed for protein isolation. A control set of cells was 
transfected with an empty pcDNA3.1. The proteins were 
analysed by two-dimensional electrophoresis, and altered 
bands were excised for mass spectrometry. 
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Protein isolation 

The cultured cells were harvested by trypsinization and 
centrifuged at 220 g for 5min at 4°C. The cell pellet was 
washed once with ice-cold 1 x PBS. . The proteins were isolated 
by using a commercial kit (Bio Rad, Hercules, CA, USA). 
Briefly, pelleted cells (0.05 ml) were mixed with 0.5 ml ice-cold 
CPEB solution containing protease inhibitors cocktail 
(Roche), vortexed and stored on ice for 30min. The cell 
suspension was passed through a syringe needle (20 gauge) for 
10-20 strokes to ensure complete cell lysis. The cytoplasmic 
protein fraction was collected by centrifugation at 100 g for 
lOmin at 4°C. The nuclear pellet was washed once again with 
0.5 ml CPEB solution. The nuclear pellet was resuspended in 
0.75 ml PSB buffer, vortexed briefly and centrifuged at 1000 g 
for lOmin at 4°C, and the supernatant containing nuclear 
protein was collected into a new tube. The samples were 
quantified using 2D Quant kit (Amersham Biosciences), 
aliquoted and stored at -80°C to prevent protein degradation. 
To reduce streaking, background staining and the other gel 
artefacts associated with substances contaminating 2D/IEF 
samples, the samples were cleaned with 2D Clean up kit (Bio 
Rad, Hercules, CA, USA) before running on the gel. 

Two-dimensional gel electrophoresis 

The protein mixtures were separated based on isoelectric 
points by using commercial pre-cast pH gradient gel strips 
according to the manufacturer's instructions. The protein 
sample (175 /ig) in 185 /il of sample buffer (8M urea, 2% 
CHAPS, 0.2% bioiytes, 3/10 ampholytes, 65 mM DTT and 
0.002% bromo phenol blue) was loaded in the sample loading 
trays at the end of 1 1 cm immobilized rehydrated strips (pH 3- 
10) (Bio Rad, Hercules, CA, USA). Following isoelectric 
focusing, proteins were reduced and alkylated by successive 
15min treatments with equilibration buffer (6M urea, 0.375 M 
Tris-HQ pH 8.8, 2% SDS, 20% glycerol, 2% DTT) and 2.5% 
(W/V) iodoacetamide, respectively. Proteins were then re- 
solved in the second dimension on 8-16% gradient SDS- 
PAGE gel (Bio Rad, Hercules, CA, USA). The protein spots 
were visualized by staining with either silver stain or 



Coomassie blue stain. The gel images were compared and 
bands showing significant (greater than twofold) alterations in 
intensity were excised and processed for mass spectrometry. 
Comparisons were made between protein lysates from MCF- 
10A, MCF-7 and MDA-MB-231 cell lines or between MCF-7 
and MCF-7 cells transfected with specific antisense constructs. 

Protein identification by enzymatic digestion followed by mass 
spectrometry 

Prior to performing trypsin digestion, the gel pieces containing 
protein spots were washed sequentially once with water and 
twice with acetonitrile. The gel pieces were then allowed to 
swell in lOOmM ammonium bicarbonate and finally washed 
with acetonitrile. The washed slices were dried in a Speed Vac 
concentrator, and subsequently incubated with 20 /il of 
Promega's autocatalysis-resistant trypsin (12.5ng/fd in 50 mM 
ammonium bicarbonate and 5 mM CaCl 2( pH 8.0) overnight at 
37°C. The supernatant (5 fi\) from tryptic digest was injected 
for peptide sequence analysis using on-line capillary liquid 
chromatography-electro spray ionization- tandem mass spectro- 
metry (LC-MS/MS). The front end HPLC utilized a Dionex 
(San Francisco, CA, USA) Vydac 300 fxm inner diameter 
x 15 mm C18 column. The linear acetonitrile gradient (3%/ 
min, containing 0.02% TFA) was developed using a Hewlett- 
Packard 1 100 pump operating at 0.1 ml/min, and the flow was 
split before the injector such that the flow rate through the 
column was 3 /xl/min. Peptides were detected at 215nM The in- 
line mass spectrometer was a ThermoElectron LCQ-DECA 
instrument operated in data-dependent MS/MS mode, and 
proteins were identified by searching a non redundant protein 
database using the Sequest program. 
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Identification of androgen-regulated genes in the 
prostate cancer cell line LNCaP by serial analysis 
of gene expression and proteomic analysis 

A common therapy for nonorgan-confined prostate cancer involves androgen depriva- 
. tion. To develop a better understanding of the effect of androgen on prostatic cells, we 
have analyzed gene expression changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer line LNCaP, at both RNA and protein levels. 
Changes at the RNA level induced by DHT were determined by means of serial analysis 
of gene expression (SAGE), and protein profiling was done by means of quantitative 
two-dimensional polyacrylamide gel electrophoresis. Among 123371 transcripts anal- 
yzed, a total of 28844 distinct SAGE tags were identified representing 16570 genes. 
Some 351 genes were significantly affected by DHT treatment at the RNA level 
ip < 0.05), of which 147 were induced and 204 repressed by androgen. In two indepen- 
dent experiments, the integrated intensity of 32 protein spots increased and 12 
decreased at least two-fold in response to androgen, out of a total of 1031 protein 
spots analyzed. The change in intensity for most of the affected proteins identified 
could not be predicted based on the level of their corresponding RNA. Our study pro- 
vides a global assessment of genes regulated by DHT and suggests a need for profiling 
at both RNA and protein levels for a comprehensive evaluation of patterns of gene 
expression. 
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1 Introduction 

Androgens affect numerous aspects of prostate biology 
including development, growth, and maintenance. They 
also play a critical role in tumorigenesis and progression 
of prostate cancer. Androgen deprivation is an estab- 
lished treatment modality for prostate cancer. However, 
the disease eventually progresses into a hormone refrac- 
tory cancer. Several mechanisms have been identified 
which may contribute to androgen independence: (1) 
Mutations in the androgen receptor (AR) lead to ligand- 
independent activation or promiscuity of the receptor 
[1 , 2]. The ability of the receptor to activate or repress 
downstream genes can also be affected by mutation. AR 
mutation is associated with advanced phases of prostate 
cancer [3, 4]; (2) AR can be activated in a ligand-indepen- 
dent manner by specific growth factors and cytokines [5]; 
(3) AR gene amplification has been found to occur in 
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28-30% of tumors that recurred post androgen -ablation 
therapy [6, 7], Wallen et al. [8] have recently shown that 
one -third of locally recurrent hormone refractory prostate 
cancer contain AR gene amplification; (4) Coactivator 
amplification and corepressor down-regulation have 
been shown to increase receptor transactivation [9, 10]. 
Regardless of which pathway(s) is taken by the tumor 
cells, androgen-regulated genes may ultimately be the 
key players in the development of hormone refractory 
cancer. As a step towards a better understanding of the 
effect of androgen on gene expression, we have under- 
taken a comprehensive assessment of gene expression 
changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer cell line LNCaP. An 
important feature of our study is the parallel assessment 
of expression changes at both RNA and protein levels. 



2 Materials and methods 

2.1 Prostate cell line and DHT treatment 

LNCaP human prostate cancer cell line (American Type Cul- 
ture Collection, Rockville, MD f USA) was cultured for 3 d in 
phenol-free RPMI 1640 supplemented with 5% charcoal- 
stripped fetal bovine serum (FBS) at 37°C. Half of the cul- 
tures were then treated with 10" 9 m DHT for 24 h. Total RNA 
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and protein were extracted from untreated and DHT treated 
cells using TRIzol (Invitrogen, Carlsbad, CA, USA) reagent 
for RNA and a solubilization cocktail for proteins, consisting 
of 9 m urea, p-mercaptoethanol and 1 % NP-40. For kinetics 
experiments 1 x 10 6 cells were plated in phenol-free RPMI 
medium with 5% charcoal-stripped FBS. On day three, 
cells were treated with 5 mL of media containing 10~° m 
DHT and total RNA was extracted at different time points 
from 0-96 h. To determine the direct/indirect effect of andro- 
gen on RNA levels, cells were plated as above and treated 
with 5 ng/mL of cyclohexamide in the presence or absence 
of 1 0" 9 m DHT for 24 h and total R NA was extracted. 

2.2 Serial analysis of gene expression 

Serial analysis of gene expression (SAGE) was performed 
as described previously [11] with the following modifi- 
cations: ditags were PCR amplified using biotinylated pri- 
mers and digested with /vValll enzyme [12]. Concatemers 
were heated for 15 min at 65°C and chilled on ice for 
10 min before separation on an 8% polyacrylamide gel 
[13]. The concatemers were then cloned into the Sphl 
site of the pZero vector (Invitrogen). Concatenated tags 
were screened by PCR using M13 forward and M13 
reverse primers. PCR products with inserts greater than 
500 bp were isolated and sequenced with M13 forward 
primer on an automated 3700 DNA sequencer (Perkin- 
Elmer, Norwalk, CT, USA). For microSAGE, 1 \ig of total 
RNA per tube was used for cDNA synthesis in two tubes, 
with the rnRNA Capture Kit (Boehringer Mannheim, 
. Indianapolis, IN, USA. cDNAs were cleaved with N!a\\\, 
ligated to linkers and then digested with Bsm Fl enzyme. 
The released tags were ligated, and processed for the 
rest of steps as with the standard SAGE protocol. 

2.3 SAGE data analysis 

SAGE tags were extracted using the SAGE software 
V 4.12 [11]. Tags were matched to the SAGE reliable 
map (release 10-26-2000) (http://www.ncbi.nlm.nih.gov/ 
SAGE/). Due to the fact that some tags map to multiple 
genes and some genes have multiple tags, SAGE data 
were analyzed in two different ways: (1) exclusion 
method: tags that match to multiple genes were dis- 
carded. Only tags that match to a single gene were tabu- 
lated and composite counts analyzed for their signifi- 
cance; (2) inclusion method: tags that match to multiple 
genes were counted at 100% toward each gene. All tags 
were tabulated and composite counts analyzed for their 
statistical significance. Lists of differentially expressed 
genes (p<0.05) obtained from the exclusion and inclu- 
sion methods were compared, and finally only genes that 
have a p value <0.05 in both lists were considered statis- 



tically significant. The total number of genes identified 
was estimated by N m + (N um - 0.1 N um )/3.5, where N m is 
the number of genes matched to SAGE tags, N um is the 
number of SAGE tags that do not match to known genes 
or ESTs, with 10% representing the estimated sequencing 
error per SAGE tag and 3.5 r the average number of tags 
per gene in the SAGE reliable map (release 10-26-2000). 

2.4 RT-PCR and real-time PCR quantification 

For reverse transcription-PCR (RT-PCR), 1 ng of total RNA 
was reverse transcribed into cDNA. One fortieth of cDNA 
was used for PCR reaction. Samples were collected at 
different cycles and separated on a 2% agarose gel with 
ethidium bromide. Image was captured and quantified 
using NucleoVtsion 760 Image Workstation (Nucteotech, 
CA, USA). Amplification curves were generated. Sub- 
sequently, RT-PCR was done at cycles within the log 
phase of amplification. 

Real-time quantification was performed in the (Cycler (Bio- 
Rad, Hercules, CA, USA). Briefly, one fortieth of cDNA was 
used in each reaction. Six reactions were carried out for 
each gene and three independent experiments were per- 
formed. PCR mix comprised of 1X PCR buffer, 1.5 mM 
MgCI 2 , 0.1 mM dNTP, 200 nM primers (listed below), 0.05 U 
platinum Taq polymerase (Invitrogen) and 0.1 x SYBR green 
(Molecular Probes, Eugene, OR, USA). PCR was carried out 
at 94°C for 2 min, and 40 cycles of 94°C for 30 s, 55°C for 
30 s, and 72°C for 30 s. Primers used for real-time PCR 
qua nt i fic a ti on: PSA (prostate specific antigen) (Hs.1 71 995, 
5 ' -GGAAATGACC AGGCC AAG AC-3\ S'-CAACCCTG GAC 
C TCACACCTA-3'), SCMH1 (Hs.57475, S'-GCCTTGACC 
ACATCACTCCAT-3', 5'-AGGCCTAGGGCTGCAAAAG-3'), 
and clusterin (Hs.75106, 5'-GCAGGAATACCGCAAAAA 
GC-3\ 5"-GACTCAAGATGCCCCCGTAAG-3'). Standard 
samples (50, 25, 1 0, 5, 2.5, 1 and 0:5 cDNA) and experi- 
mental samples were used in real-time quantification PCR. 
Each sample was run in quadruple reactions. Standard 
curve (Ct = mX + B) was obtained, where Ct is threshold 
cycle number, X is log quantity of target molecules, m is 
curve slope and B is Y-axis intercept value. Number of 
fold induction or repression for a given target molecule 
was calculated by Qa/Qb, where Qa is the target quantity 
in experimental sample A and Qb is the target quantity 
in experimental sample B. Cta = mlogQa +■ B or Qa = 
10 (cta-B)An ( therefore, Qa/Qb = 10^*°^. 

2.5 2-D PAGE 

The procedure followed was as previously described 
[14]. Cells were solubilized in 200 of lysis buffer con- 
taining 9.5 m urea .(Bio-Rad), 2% NP-40, 2% carrier 
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ampholytes pH 4-8 (Gallard/Schlessinger, Carle Place, 
NY, USA), 2% p-mercaptoethanol and 10 mw PMSR 
Aliquots containing approximately 5x10 6 cells, were 
applied onto isofocusing gels. IEF was conducted using 
pH 4 to 8 carrier ampholytes at 700 V for 16 h, followed 
by 1000 V for an additional 2 h. The 1-D gel was loaded 
onto the 2-D gel, after equilibration in 1 25 mM Tris, pH 6,8, 
10% glycerol, 2% SDS, 1% DTTand bromophenol blue. 
For the second dimension separation, a gradient of 
11-14% of acrylamide (Serva, Hauppauge, NY, USA) 
was used. Following 2-D PAGE, proteins were visualized 
by silver staining of the gels or transferred to an Immobi- 
lon-P PVDF membrane (Millipore, Bedford, MA). 



2.6 Protein identification by mass spectrometry 

The 2-D gels were silver stained by successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1 % silver 
nitrate for 40 min and 0.014% formaldehyde plus 2% 
sodium carbonate. The proteins of interest were excised 
from the 2-D gets and destained for 5 min in 15 rnw 
potassium ferricyanide and 50 mM sodium thiosulfate 
as described [15]. Following three washes with water, 
the gel pieces were dehydrated in 100% acetonitrile for 
5 min and dried for 30 min in a vacuum centrifuge. 
Digestion was performed by addition of 100 ng of trypsin 
(Promega, Madison, Wl, USA) in 200 mM ammonium 
bicarbonate or by addition of 100 ng of the endopro- 
teinase Glu-C (Promega) in 100 mM ammonium bicarbo- 
nate. The Lys-C digestion was performed with 500 ng of 
the endoproteinase Lys-C (Roche, Mannheim, Germany) 
in 100 mM Tris-HCI, pH 9. Following enzymatic digestion 
overnight at 37°C, the peptides were extracted twice 
with 50 \iL of 60% acetonitrile/1 % TFA. After removal of 
acetonitrile by centrifugation in a vacuum centrifuge, the 
peptides were concentrated by using pipette tips C18 
(Millipore). 

Analyses were performed primarily using a PerSeptive 
Biosystem MALDI-TOF Voyager-DE mass spectrometer 
(Framingham, MA, USA), operated in delayed extraction 
mode. Peptide mixtures were analyzed using a saturated 
solution of cyano-4-hydroxycinnamic acid (CHA) (Sigma, 
St. Louis, MO, USA) in acetone containing 1% TPA 
(Sigma). Peptides were selected in the mass range of 
800-4000 Da. Spectra were calibrated using calibration 
mixture 2 of the Sequazyme peptide mass standards kit 
(PerSeptive Biosystems). The search program MS-Fit f 
developed by the University of California at San Francisco 
(http://prospector.ucsf.edu), was used for searches in the 
NCBI database. Search parameters were as follows: 
maximum allowed peptide mass error of 400 ppm, con- 
sideration of one incomplete cleavage per peptide and 



pH range between 4 and 8. MALDI-TOF mass spectro- 
metry was also used for molecular weight determina- 
tion as described [16]. In some cases, the amino acid 
sequence of some peptides of interest was determined 
by ESI MS analysis. 

3 Results 

3.1 SAGE analysis of the effect of androgen 
on gene expression 

SAGE libraries were generated from LNCaP cells cul- 
tured in the presence or absence of DHT. A total of 
123371 tags were generated of which 62 878 were from 
the LNCaP cell line and 60 493 from the LNCaP cells 
treated with DHT for 24 h (Table 1). Sequence analysis 
identified a total of 28844 distinct tags representing 
16570 genes, 1 1 243 from LNCaP and 12 203 from DHT 
treated cells. A total of 351 transcripts were differentially 
expressed at a significant level (p<0.05). Eighty-seven 
percent of transcripts matched GenBank entries; 79% 
corresponded to known sequences and 8% to ESTs. 
RNA levels for 147 genes were increased and 204 genes 
were decreased after DHT treatment (Table 2). Therefore, 
at the RNA level more genes were repressed than 
activated by androgens. Of these androgen-regulated 
genes, 149 were changed >: five-fold by DHT treatment 
(Table 3). 



Table 1. Summary of SAGE analysis in LNCaP cells 



Sample 


Total 


Distinct 


Number 


transcripts 


tags 


of genes 


-DHT 


62 878 


17 050 


11 243 


+ DHT 


60 493 


18510 


12 203 


Total 


123 371 


28 844 


16 570 


- DHT: LNCaP cells without dihydrotestosterone; + DHT: 


LNCaP cells treated with 10^ m dihydrotestosterone for 


24 h 








Table 2. DHT regulated genes in LNCaP cells 




Genes 




LNCaP vs LNCaP + DHT 






(gene #) 








p 0.05 


p 0.01 


Known genes 




277 


147 


ESTs 




29 


8 


No match 




45 


13 


Total 




351 


168 


Up-regulated by DHT (24 h) 


147 


65 


Down-regulated by DHT (24 h) 


204 


103 
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Table 3. List of candidate genes that are regulated by DHT (> 5 fold) 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 




\ 


82 


Induced 


0 


immunoglobulin heavy contant alpha 1 


I OO f Oil 


n 

u 


42 




0 


microseminoprotein, beta- 


140 


1 


AA 




n 


immunoglobulin heavy constant gamma 3 (Gm marker) 




1ft 


3J 


InHi iopH 


0 


beta-2-microglobulin 


OA OOO 

o4 *:yo 


u 


1 7 


InHi ic*af\ 


1 nnF-05 


CD74 antigen (invariant polypeptide of major histocomp ability 






\ 






complex, class II antigen -associated) 


yo i o 


u 


16 


Induced 

II IUUWVI 


1 .O0E-05 


myosin regulatory light chain 2, smooth muscle isoform 


77/1/1*3 


n 


I o 


11 IUULvU 


1 OuE-05 


actin, gamma 2, smooth muscle, enteric 


1 1 1 Q 


n 


■ j 


InHi irprl 


2 33E-05 


nuclear receptor subfamily 4, group A, member 1 


04 / OO 




o 
c 


ntjjji ooocu 


3 OOE-05 

WaW^ Uw 


KIAA0246 protein 


777 

f D f f 1 


o 


14 


Induced 


3.66E-05 


transgelin 




U 


1 4 






immunoglobulin J polypeptide, linker protein for immuno- 


76325 










alobulin aloha and mu DolVDetides 




1 
I 


17 
I / 


InHi ifpHi 

II lUUbCU 




acid phosphatase, prostate 




A 
U 


lo 




S.UUC. UvJ 


v-jun avian sarcoma virus 1 7 oncogene homolog 


78344 


-4 
I 


1 ft 

id 


muuceu 




myosin, heavy polypeptide 1 1 , smooth muscle 


83 00b 


i Q 

iy 


o 


Danrac eorl 
nc|Jioov>cU 


1 .50E-04 


nrotein 

^/^l 1 1 w \J Ul V Iwll 1 


OCO OHO 


1 o 


1 


Danroccorl 


3.19E-04 


nuclear distribution gene C (A. nidulans) homolog 




u 


1 1 


InHi if*oH 
II lUUtcU 


4.20E-04 


Novel 


CjOO 1 abb 1 vjo 


1 h 

1 1 


n 
u 


Ronroc coH 
nc|JicoooU 


6.63E-04 


Novel 


GACToAUAO 1 


lb 






8.40E-04 


Novel 


284 296 


0 


10 


Induced 


O.OUE-UH 


Hnmo Qnnian<; SI JRF-4 mRNA COITIDlete cdS 


75105 


13 


1 


Repressed 


^ iflC AO 


emopamii-Dinaing proiein ^oiorui lauiiioiaotj/ 


128075 


10 


0 


Repressed 


1 .30E-03 


bo is 


154162 


15 


2 


Repressed 


1 .43E-03 


ADP-ribosylation factor-like 2 


6895 


1 


12 


Induced 


1 .49E-03 


actin reiaieo protein cjs complex, sudumu o \c i r<uj 


143 240 


17 


3 


Repressed 


1.63E-03 


ESTs 


211582 


0 


9 


Induced 


1 .66E-03 


myosin, light polypeptide kinase 


180266 


0 


9 


Induced 


1 .66E-03 


tropomyosin 2 (beta) 


93002 


14 


2 


Repressed 


2.53E-03 


ubiquitin carrier protein E2-C 


17883 


14 


2 


Repressed 


2.53E-03 


protein phosphatase 1(3 (Tormeny magnesium- 










HonDnHont namms icrtform 

(JcLJUnUcjl 1 L t UCllfilllCI loUIUlMI 


126023 


14 


2 


Repressed 




F^T<s Hinhlx/ similar to NTC1 HUMAN NEUROGENIC LOCUS 

CO 1 O f niUlllyOlll Midi IU 1 1| 1 \m/ 1 1 1 \J If 1 /^l ^ 1 t W WI IWV< ^1 ^ 1 V WW -V W V 








NOTCH PROTEIN HOMOLOG i PRECURSOR 


69 469 


o 
c 




inauceu 


2.92E-03 


dflndritir cell Drotein 


( i oyy 


9 


IO 


InHi ippH 
II iuulcu 


2.92E-03 


tropomyosin 1 (alpha) 


ooc ecu 


o 

c. 




InHi icorl 


3 O7E-03 


Human rearranged immunoglobulin lambda light chain mRNA 


119209 


0 


Q 

o 


Induced 


O.U/ C"UO 


inci ilin-MUA nrnwth fartor bindina Drotein 7 

II loUlll 1 lll\0 m 1 w VV U 1 1 uulul fcyll lull ly fJ> 11 1 1 


1 73 043 


0 


a 
O 


tnaucea 




miata<itflQiQ-a«i^npiatfld 1 -like 1 


75866 


ft 


Q 
O 


inuucea 


O.Uf U*UO 


Himpthvlarninine dimethvlaminohvdrolase 1 

LIU 1 ICll Iy lul y I" *" Villi 1 IwU IJ It** HKIvi • J t-ii wiuww i 


171 695 


0 


8 


Induced 


O.U / C"UO 


Hi ial erkorifir'itv nhn^nhataQA 1 
UUal oLftJC<l IIOI Ly |Jl lUo^Jl idicioc i 


TACGbbbAI 1 


U 


Q 
O 


InHi ir^aH 




Novel 


172 791 


13 


2 


Repressed 


4.19E-0i 


ubiquitously-expressed transcript 


13561 


13 


2 


Repressed 


4.19E-03 


ESTs, Weakly similar to dJ37E16.5 [H. sapiens) 


98260 


8 


0 


Repressed 


4.34E-03 • 


.ESTs 


CATAAGACTT 


8 


0 


Repressed 


4.64E-03 


> Novel 


TACGGGGACA 


2 


12 


Induced 


5.76E-03 


Novel 


180034 


12 


2 


Repressed 


6.91 E-03 


cleavage stimulation factor, 3* pre-RNA, subunit3, 77kD 


5753 


12 


2 


Repressed 


6.91 E-03 


inosito(myo)-1 (4)-monophosphatase 2 


278941 


10 


1 


Repressed 


6.94E-03 


PRO0628 protein 


19500 


0 


7 


Induced 


7.25E-03 


nuclear localization signal deleted in velocardiofacal syndrome 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


9006 


0 


7 


Induced 


7.25E-03 


VAMP (vesicle-associated membrane protein) -associated 
protein A (33kD) 


26471 


0 


7 


Induced 


7.25E-03 


Homo sapiens clone HQ0692 


AACTGCTGGC 


12 


2 


Repressed 


7.63E-03 


Novel 


GCTGACCGTC 


7 


0 


Repressed 


8.59E-03 


Novel 


CCCCCCTGTC 


7 


0 


Repressed 


8.59E-03 


Novel 


243901 


2 


11 


Induced 


8.82E-03 


Homo sapiens cDNA FU20738 fis f clone HEP08257 


119000 


1 


9 


Induced 


9.12E-03 


actinin, alpha 1 


78596 


1 


9 


Induced 


9.12E-03 


proteasome (prosome, macro pain) subunit, beta type, 5 


171 955 


7 


0 


Repressed 


9.61 E-03 


trophinin associated protein (tastin) 


180545 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens mRNA for hypothetical protein (TR2/D1 5 gene) 


77 71 Q 


7 


u 


PonraccoH 


Q 61 F-03 


gamma-glutamyl carboxylase 


AA77 
Hot f 


7 

I 


n 


nu|J i ooocu 


9 61 E-03 


CGI-51 protein 


OO 7QK 


7 


n 


Donroccorl 




ESTs 


7Q QOC 


7 
1 


n 

LI 


nCUl COOCU 


9 61 E-03 


Homo sapiens SWI/SNF-related, matrix-associated, actin- 










dependent regulator of chromatin D1 (SMARCD1) mRNA 


1 1Q 177 


g 


1 


Repressed 


1 .26E-02 


ADP-ribosylatin factor 3 


pcv^fi Af^p Ann 


g 




Repressed 


1 .29 E-02 


Novel 


182 217 


0 


6 


Induced 


1 .32E-02 


succinate-CoA ligase, ADP-forming, beta subunit 


7£l 1 nR 

/ O l UD 


n 


w 


Inriurpd 


1 .32E-02 


clusterin (complement lysis inhibitor, SP-40, 40, sulfated gly- 
coprotein 2, testosterone-repressed prostate message 2) 


25631 1 


o 


6 


Induced 


1 .32E-02 


granin-like neuroendocrine peptide precursor 


1 03 1 80 


o 


6 


Induced 


132E-02 


DC2 protein 


TAOfifiGGATG 


1 


8 


Induced 


1 .70E-02 


Novel 


A A A^AAATPA 


o 

c. 


I u 


Inrli tf art 


1 .70E-02 


Novel 




o 


10 


Inriurfid 

II lUUUVU 


1.71E-02 


CS box-containing WD protein 




O 
£. 


1 u 


InHt jppH 

If IUUuvU 


1 .71E-02 


glioblastoma overexpressed 




O 


10 


Inrii ippH 

11 luUvvU 


1.71E-02 


ESTs, Weakly similar to Similar to S, cerevisiae hypothetical 
protein L3111 


ATGG HTG ATP 


5 


o 


Repressed 


1 .75E-02 


Novel 




5 


o 


Repressed 


1 .75E-02 


Novel 


ACATCATCAG 


6 


0 


Repressed 


1.75E-02 


Novel 




c 

D 


n 
u 




1 .75E-02 


Novel 






o 
o 


1 1 lUUlrCU 


1 .78E-02 


ESTs 






Q 
O 


In/Hi i/*oH 


1 .78E-02 


ATPase, Ca++ transporting, cardiac muscle, slow twitch 2 


7Q1 i 




Q 
O 


II lUUlrCU 


1 .78E-02 


KIAA0323 protein 


007 /inn 
CC ( 4UU 




Q 
O 


inuuucvj 




mitogen -activated protein kinase kinase kinase kinase 3 


77269 


1 


8 


Induced 


1.78E-02 


guanine nucleotide binding protein (G protein), alpha inhibiting 

duuviiy puiyjjcpuuo c 


794o 


c 
D 


u 


DcnroccaH 
ncpicaocu 




RPRS-mpdiatina Drotein 


^/ob44 


e 
D 


u 


Qanrae cqH 
ncpicoocu 


I.OI C. *Ut 


acetyl -Coenzyme A acetyltransf erase 2 (acetoacetyl 
Coenzyme A thiolase) 


23111 


6 


0 


Repressed 


1.81E-02 


phenylalanine-tRNA synthetase-like 


31442 


6 


0 


Repressed 


1.81E-02 


RecQ protein-like 4 


38628 


6 


0 


Repressed 


1.81 E-02 


hypothetical protein 


77422 


6 


0 


Repressed 


1.81E-02 


proteolipid protein 2 (colonic epithelium-enriched) 


211973 


6 


0 


Repressed 


1.81 E-02 


homolog of Yeast RRP4 (ribosomal RNA processing 4), 
3'-5'-exoribonuclease 


171 075 


6 


0 


Repressed 


1.81 E-02 


replication factor C (activator 1) 5 (36.5kD) 


89781 


6 


0 


Repressed 


1.81 E-02 


upstream binding transcription factor, RNA polymerase I 


154149 


6 


0 


Repressed 


1.81 E-02 


apurinic/apyrimidinic endonuclease(APEX nuclease)-like 2 



protein 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


279 772 


8 


1 


Repressed 


2.28E-02 


brain specific protein 


CUJ \Jj i 


8 




Repressed 


2.28E-02 


ESTs, Weakly similar to WW domain binding protein 1 1 
(M. muscu/us) 


75 658 


8 


1 


Repressed 


2.28E-02 


phosphorylase, glycogen; brain 


GGGCAGCTGT 


8 


1 


Repressed 


2.37E-02 


Novel 


1 05440 


0 


5 


Induced 


2.43E-02 


hepatocyte nuclear factor 3, alpha 


1 1 8244 


0 


5 


Induced 


2.43E-02 


protein phosphatase 2 t regulatory subunit B (B56), delta iso- 
form 


82389 


0 


5 


Induced 


2.43E-02 


CGI-1 18 protein 


31638 


0 


5 


Induced 


2.43E-02 


restin (Reed-Steinberg cell-expressed intermediate filament- 
associated protein) 


12013 


0 


5 


Induced 


2.43E-02 


ATP-binding cassette, sub-family E (OABP), member 1 


12797 


0 


5 


Induced 


2.43E-02 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 


7736 


0 


5 


Induced 


2.43E-02 


hypothetical protein 


274479 


0 


5 


Induced 


2.43E-02 


NME7 


153138 


0 


5 


Induced 


2.43E-02 


origin recognition complex, subunit 5 (yeast homoiog)-like 


16034 


0 


5 . 


Induced 


2.43E-02 


ESTs 


270072 


0 


5 


Induced 


2.43E-02 


ESTs 


187035 


0 


5 


Induced 


2.43E-02 


ESTs 


140452 


0 


5 


Induced 


2.43E-02 


cargo selection protein (mannose 6 phosphate receptor bin- 
ding protein) 


182265 


0 


5 


Induced 


2.43E-02 


keratin 19 


19762 


0 


5 


Induced 


2.43E-02 


ESTs, Weakly similar to unknown (D. melanogastet) 


317 


0 


5 


Induced 


2.43E-02 


topoisomerase (DNA) 


6236 


0 


5 


Induced 


2.43E-02 


ESTs 


GCTGGAGCCT 


5 


0 


Repressed 


3.03E-02 


Novel 


CCAGTGCT.CA 


5 


0 


Repressed 


3.03E-02 


Novel 


ACCCTACATA 


5 


0 


Repressed 


3.03E-02 


Novel 


GGGGAAATCT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACTGGTACTG 


5 


0 


Repressed 


3.03E-02 


Novel 


GCTCCGGTGT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACAGTGGTGA 


5 


0 


Repressed 


3.03E-02 


Novel 


7869 




7 


Induced 


3.04E-02 


lysophosphatidic acid acyltransferase-delta 


12101 


1 


7 


Induced 


3.04E-02 


hypothetical protein 


266940 




7 


Induced 


3.04E-02 


t-complex-associated-testis-expressed 1-like 1 






7 

f 




3.04E-02 


integrin-linked kinase 






7 


Induced 


3.04E-02 


6-pyruvoyltetrahydropterin synthase 


173611 


1 


7 


Induced 


3.04E-02 


NADH dehydrogenase (ubiquinone) Fe-S protein 2 (49kD) 
(NADH-ceonzyme Q reductase) 


102469 


1 


7 


Induced 


3.04E-02 


putative nuclear protein 


78825 




7 


Induced 


3.04E-02 


matrin 3 


284465 


; 


7 


Induced 


3.04E-02 


ESTs 


30738 


1 


7 


Induced 


3.04E-02 


hypothetical protein FLJ10407 


78687 




7 


Induced 


3.04E-02 


neutral sphingomyelinase (N-SMase) activation assoviated 
factor 


92381 


1 


7 


Induced 


3.04E-02 


nudix (nucleoside diphosphate linked moiety X)-type motif 4 


242039 


5 


0 


Repressed 


3.26E-02 


EST 


4766 


5 


0 


Repressed 


3.26E-02 


DKFZP586O0120 protein 


283109 


5 


0 


Repressed 


3.26E-02 


hypothetical protein DKFZp762L1710 


192853 


5 


0 


Repressed 


3.26E-02 


ubiquitin-conjugating entyme E2G 2 (homologous to yeast 
UBC7) 


153678 


5 


0 


Repressed 


3.26E-02 


reproduction 8 
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Table 3. Continued 



UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


251317 


5 


o 


Repressed 


3.26E-02 


EST 




5 


o 


Repressed 


3.26E-02 


selenoprotein X 




5 


o 


Repressed 


3.26E-02 


ESTs 


102456 


5-. , 


o 


Repressed 


3.26E-02 


survival of motor protein interacting protein 1 


284250 


5 


o 


Re Dressed 


3.26E-02 


AD-003 protein 


251871 


5 


o 


Repressed 


3.26E-02 


CTP synthase 


270480 


7 


1 


Repressed 


3.92E-02 


ESTs, Weakly similar to ALU5JHUMAN ALU SUBFAMILY SC 
SEQUENCE 


26655 


7 


1 


Repressed 


3.92E-02 
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CAP: LNCaP cells without dihydrotestosterone 

DHT: LNCaP cells treated with 10 -8 dihydrotestosterone for 24 h 

UGD: unigene ID 

Numbers in CAP and DHT columns are the number of tags observed in respective samples 



3.2 Confirmation of SAGE data 

To confirm the differential expression pattern at the RNA 
level, semiquantitative RT-PCR assay was performed on a 
group of selected genes. One \ig of total RNA was reverse 
transcribed into cDNAwith oligo dT. cDNA was amplified by 
PCR for various cycles. PCR products were separated on 
agarose gels and quantified by densitometry. PCR ampli- 
fication curves were plotted for each gene, and data within 
the logarithmic phase of amplification were used for quanti- 
fication (Fig. 1A, B and C). The reproducibility of gene regu- 
lation by androgen was confirmed in three independent 
experiments by monitoring the induction of PSA in the pres- 
ence or absence of cyclohexamide (Fig. 1 B). The kinetics 
of gene induction or repression was determined for PSA 
(Hs.'171995), clusterin (Hs.75106) and SCMH1 (Hs.57475) 
genes. Typical examples of kinetics are shown in Figure 
1C. PSA was induced at 4-6 h, peaked between 6-20 h, 
and gradually declined after 20 h post-treatment of DHT. 
Clusterin was induced within 0.5-1 h and gradually declined 
after 6-1 2 h. SCMH1 was repressed 2-4 h post-treatment. 
Expression of PSA, clusterin, and SCMH1 was quantified by 
real-time PCR (Fig. 1 D), 

3.3 Changes in protein expression induced 
by DHT 

Protein lysates of LNCaP cells cultured in the presence of 
DHT for 72 h and cells cultured in parallel in the absence 
of DHT were subjected to 2-D PAGE (Fig. 2). Following sil- 



ver staining, gels were digitized prior to computer-based 
matching and quantitative analysis, as previously des- 
cribed [17]. Of a total of 1031 protein spots matched 
between 2-D patterns of DHT treated and untreated cells, 
a set of 32 protein spots increased in intensity by at least 
two-fold in two independent experiments. Likewise, a set 
of 1 2 protein spots decreased in intensity by at least two- 
fold in these two independent experiments. A two-fold 
change in integrated intensity by silver staining represents 
on average a three-fold change in amount of protein, 
based on prior quantitative studies [17]. 

The 44 protein spots that changed in intensity as a result 
of DHT treatment were excised from the gels, digested 
with trypsin, and subsequently analyzed by MALDI-TOF 
MS. The resulting spectra were used to identify the pro- 
teins using the MS -Fit search program. Of the 44 spots 
excised from the gels, 29 were identified without ambi- 
guity and consisted of 21 up-regulated and eight down- 
regulated proteins (Table 4). The identified proteins repres- 
ented a heterogeneous group that included cytoskeletal 
proteins (e.g. tropomyosin, a tubulin), metabolic enzymes 
(e.g. adenine phosphoribosyl transferase, p 1 ,4 galactosyl 
transferase, galactocerebrosidase), and the products of 
previously described androgen responsive genes (e.g. 
SRY, selenium binding protein) [18, 19]. Specific anti- 
bodies confirmed the identification based on MS for all 
proteins analyzed by Western blotting, which included a 
tubulin, myosin light chain isoforms, nucleoside diphos- 
phate kinase A, glyceraldehyde 3-phosphate dehydro- 
genase (G3PD), and tropomyosin (data not shown). 
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Rgure 1. Kinetics and quanti- 
fication of androgen-regulated 
genes. Total RNA from un- 
treated and DHT treated. 
LNCaP cells was reverse 
transcribed. cDNA was PCR 
amplified for various cycles. 
PCR amplification curves were 
plotted for each gene. A cycle 
number within the logarithmic 
phase of amplification was 
selected for semi -quantitative 
analysis. cDNA was also used 
for real-time quantitative PCR. 
(A) An example of RT-PCR 
amplification of PSA at cycles 
between 20-30. (B) RT-PCR 
of PSA in three independently 
treated LNCaP samples, h: 
hours; 0: no DHT; 24: treated 
with 1CT 9 M DHT for 24 h; -:' 



without cycloheximide; +: with 5 u^/mL of cycloheximide for 24 h. (C) An example of three independent kinetics experi- 
ments for PSA, clusterin and SCMH1. (D) Comparison of SAGE results with that of real-time quantitative PCR for PSA, 
clusterin and SCMH1 . f: induction; |: repression. 



Corresponding SAGE data were available for most of the 
proteins affected by DHT treatment that were identified. It 
was therefore of interest to determine if the changes 
observed at the protein levels were matched by concor- 
dant changes at the mRNA level. Remarkably, for most of 
the proteins identified, there was no appreciable concor-. 
dant change at the RNA level (Table 4). 



4 Discussion 

We have compared the gene expression profile of 
LNCaP, an androgen responsive prostate cancer cell 
line, with or without androgen (DHT) treatment. Approxi- 
mately 350 of 16570 expressed genes detected at the 
RNA level were affected by dihydrotestosterone at the 
p<0.05 level. The DHT responsive genes included 
known genes, ESTs, and novel genes. As expected, we 
saw an induction of genes that are well known to be 
regulated by androgens. For instance, we found a 1.7- 
fold induction in kallikrein 3/PSA, a 7.6-fold induction in 
prostatic kallikrein 2, and a 15.7-fold induction in pro- 
static acid phosphatase by DHT in LNCaP cells. We 
also saw a five-fold induction in NKX3.1/NKX3A and 
three-fold in fatty acid synthase; two previously identified 
androgen-regulated genes [20, 21]. Interestingly p-micro- 
seminoprotein, reported to be reduced or lost in prostate 
tumor [22], was up-regulated more than 40-fold in 



LNCaP cells by DHT. More significantly, our data indicate 
that genes involved in a variety of tumor cell functions 
such as growth, apoptosis, and metastasis, are directly 
or indirectly regulated by androgens. In addition, it is 
noteworthy that both p-actin and G3PD, the two most 
frequently used loading controls, were up-regulated 
approximately two-fold by DHT (p<0-01). Thus, these 
two genes may not be appropriate controls in some 
experiments. 

Serial analysis of androgen-regulated gene expression 
provides us with a list of candidate genes. However, 
many factors such as the dose of DHT and the time of 
cell exposure to DHT and other unknown experimental 
variations will affect the level of gene expression. There- 
fore, it is important to confirm the SAGE results with alter- 
native methods. We performed semiquantitative RT-PCR 
(Fig. 1 B) on approximately 20 genes and real-time quanti- 
tative PCR on 10 genes in at least three independently 
DHT treated LNCaP samples. We noticed experimental 
variations in every gene determined. Some genes such 
as PSA (Fig. 1B) have smaller variations and some have 
larger variations. In addition, the fold of alteration in 
expression identified by SAGE is different for some genes 
compared to that identified by real-time quantitative PCR. 
This is probably due to the technical limitation of SAGE. 
Serial analysis of gene expression is highly quantitative 
for genes whose tags are detected at large numbers. 
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Figure 2. 2-D PAGE patterns of 
untreated (A) and LNCaP cells 
treated with DHT for 72 h (B). 
Identified proteins that were 
reduced in levels following DHT 
treatments are shown in (A) and 
induced proteins are shown in 
(B). Numbered spots represent 
either proteins that were chan- 
ged in level that have not been 
identified or isoforms of identi- 
fied proteins. 
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Numbers in protein are the intensity of protein spots 

Numbers in mRNA columns are the number of SAGE tags detected 

ND: not detected 

f: increase; J: decrease; =: equal 
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When the tag number detected for a given gene is close 
to zero, the quantitative nature of SAGE is compromised. 
For instance, in approximately 60 000 transcripts of each 
sample, the PSA tag was detected 42 times in LNCaP and 
70 times in LNCaP treated with DHT. However, the tag for 
clusterin was detected 0 times and 6 times, and the tag 
for SCMH1 5 times and 0 times in the corresponding sam- 
ples. Therefore, the SAGE data of PSA is more reliable 
than that of clusterin and SCMH1 . Indeed, SAGE data of 
PSA is virtually identical to that of real-time quantitative 
PCR, whereas the SAGE data of clusterin and SCMH1 dif- 
fer from that of quantitative PCR (Fig. 1 D). 

PSA is probably the best-known androgen-regulated 
gene. We were surprised to see only 1.7-fold induction 
by DHT. However, kinetics experiments indicate that PSA 
is induced at 4-6 h, peaked between 6-20 h, and gradu- 
ally declined 20 h post-treatment of DHT. Since the SAGE 
experiment was done in samples treated for 24 h with 
DHT, PSA mRNA level was likely to have declined from 
its peak level. Indeed, we could detect 5-1 0 fold induction 
of PSA at 6-8 h post-treatment. Clusterin was reported as 
an androgen-repressed gene in the rat prostate [23]. 
Recent evidence indicates that clusterin may not be 
directly androgen-repressed, but regulated by apoptotic 
stimuli 124]. Our results suggest that clusterin was 
induced within 0.5-1 h, gradually declined after 6-12 h, 
and after 24 h reduced to a lower level than that of 
untreated cells (Fig. 1C). 

Another important question is how many of the andro- 
gen-regulated genes Identified are directly Induced or 
repressed by androgens. In order to address this ques- 
tion, detailed analyses must be done for each gene. As 
the first step, we will divide the androgen-regulated 
genes identified in this study into two groups; the cyclo- 
hexamide-sensitive group whose induction or reduction 
of expression by DHT is blocked by the protein synthesis 
inhibitor and the cyclohexamide-insensitive group whose 
induction or reduction is not affected by cyclohexamide. 
We consider that genes in the latter group are directly 
regulated by DHT. Preliminary results suggest that 
approximately 20% of the 20 genes studied by RT-PCR 
are cyclohexamide-insensitive. Experiments are under 
way to determine the cyclohexamide sensitivity of all of 
the 149 genes listed in Table 3. Further characterization 
of androgen-regulated genes may provide some ctues 
on the transition from hormone sensitive to hormone 
refractory prostate cancer. 

A relatively small set of genes could be analyzed at the 
protein level, largely due to the limited sensitivity of 2-D 
PAGE Nevertheless, a substantial number of detected 
proteins (44 of 1031 proteins analyzed, 4.3%) were 
affected by DHT treatment. Some of the proteins in this 



subset that were identified included the products of 
genes that were previously shown to be affected by 
androgens, namely selenium binding protein, brain speci- 
fic creatine kinase, SRY protein, B23 and G3PD. Using 
a subtractive approach, the human selenium-binding 
protein gene was shown to be differentially expressed 
in LNCaP and reversibly down-regulated by exogenous 
androgen in a concentration-dependent manner, in con- 
cordance with our findings at the protein level [19]. An 
increase in levels of the brain specific creatine kinase B 
chain in response to androgen has also been described 
[25], Likewise the SRY gene has been found to be respon- 
sive to androgen stimulation in LNCaP cells [18]. Also, a 
number of glycolytic enzymes including G3PD which 
were affected at the protein level have been found to be 
responsive to androgen stimulation [26]. Androgenic reg- 
ulation of the amount and phosphorylation of the protein 
B23, included in our list of affected proteins, has been 
previously described and found to be related to the early 
changes associated with androgen mediated growth of 
the prostate [27]. 

Corresponding SAGE data was available for most of the 
proteins affected by DHT treatment that were identified. 
Interestingly, for most of the proteins identified, there 
was no appreciable concordant change at the RNA level. 
There are several potential explanations for this lack of 
concordance. A particular gene may be represented by 
more than one protein isoform in 2-D gels. For example, 
in the identified group, three proteins (myosin light chain, 
G3PD and seJenium binding protein) were represented 
by more than one isoform. Thus, a source of discordance 
between RNA and protein data may be that the protein 
change is limited to a particular isoform of a protein and 
not to overall protein products of a particular gene. 
Nevertheless, a change in a particular isoform is inform- 
ative and biologically meaningful and may not be pre- 
dictable from RNA data. A lack of concordance between 
RNA and protein data may also reflect either translational 
control, post-translational modifications, or changes in 
protein turnover due to DHT treatment. Yet another 
explanation for a lack of concordance could be a lag 
time for changes at the RNA level to be reflected in a 
protein change. It follows from the above considerations 
that monitoring gene expression at both RNA and protein 
levels may provide complementary information that could 
not be ascertained by solely measuring RNA or protein. 

We would like to thank Dr. Ken Kinzler (Johns Hopkins 
University) for providing SAGE software. This study is par- 
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Based on biomedical literature databases, we tried a first step for constructing a gene expression 
"data warehouse" specific to human colorectal cancer (CRC). Results of genome-wide tran- 
scriptomic research were available from 12 studies, using various technologies, namely, SAGE, 
cDNA and oligonucleotide arrays, and adaptor-tagged amplification. Three studies analyzed CRC 
cell lines and nine studies of human samples. The total number of patients was 144. Out of 982 
up- or down-regulated genes, 863 (88%) were found to be differentially expressed in a single 
study, 88 in two studies, 22 in three studies, 7 in four studies, and only 2 genes in six studies. 
Eight large-scale proteomics studies were published in CRC, using 2-D-, SDS- or free-flow elec- 
trophoresis, involving only 11 patients. Out of 408 differentially expressed proteins, 339 (83%) 
were found to be differentially expressed only in a single study, 16 in three studies, 10 in four 
studies, 3 in five, and 1 in eight studies. Confirmation at proteome level of results obtained with 
large-scale transcriptomics studies was possible in 25%. This proportion was higher (67%) for 
reproducing proteome results using transcriptomics technologies. Obviously, reproducibility and 
overlapping between published gene expression results at proteome and transcriptome level are 
low in human CRC. Thus, the development of standardized processes for collecting samples, 
storing, retrieving, and querying gene expression data obtained with different technologies is of 
central importance in translational research. 
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1 Introduction 

Translational research in human colorectal cancer (CRC) is 
applying a large spectrum of molecular biology, cellular bi- 
ology, and advanced validation tools. In particular, genome - 
wide techniques are now applied to decipher modifications 
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in gene expression. In recent years, transcriptomics and 
proteomics tools have been broadly applied in CRC. The 
hope is that the new data obtained will now allow a classifi- 
cation of disease on molecular basis, deep insights into the 
pathophysiology of CRC, prognostic statements, and finally a 
systematic search for diagnostic and therapeutic targets. 

Because of the complexity of the biological system under 
investigation, the most significant contribution of transla- 
tional research in CRC is expected to derive not from the 
analysis of single experiments but from libraries of experi- 
ments. In other words, the results obtained so far by trans- 
lational research tools in different clinical and experimental 
settings need to be compared, contrasted, and if possible 
synthesized. Thus, based on the biomedical literature data- 
bases, we tried a first step for constructing such "data ware- 
house" specific to human CRC. 
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The specific aims of the present study were to screen 
transcriptomics and proteomics studies published in CRC, 
and to elaborate a kind of "meta-analysis of gene expression" 
in CRC. Our hypothesis was that we would be able to deter- 
mine a common set of genes and gene products that are up- 
or down-regulated in human CRC by overlapping results 
obtained by different authors. 



2 Materials and methods 

This is a literature survey of gene expression data published 
in human CRC. A Medline search of reports published in 
English from 1990 to 2004 using the terms "colon cancer" 
and "gene expression" with the limit "human" was performed 
and yielded 1979 articles. All abstracts were reviewed and a 
related article search was performed on appropriate abstracts. 
Articles were selected by a consensus of two reviewers (E.S., 
M.R.) that satisfied these predetermined criteria: sample ori- 
gin (human and/or cell lines) and preparation detailed, tech- 
nology for gene expression studies defined (transcriptomics 
and/or proteomics studies), and quantitative results on over- 
expression or underexpression available. Studies concerning 
^ single genes or arbitrarily selected genes were discarded 
Results obtained in animal models were not considered. 

Data were entered into an Excel working sheet (Micro- 
soft, Seattle, USA). Since gene expression data were not 
always obtained with quantitative, but in most cases semi- 
quantitative gene expression analysis technologies, no 
expression values or ratios were entered into the database. 
Only genes reported as being over- or underexpressed were 
entered. No threshold was defined so that some genes 
defined as differentially expressed might have shown only 



marginal differences. Only descriptive statistics are provided, 
which were obtained with the in-built tools of the Excel soft- 
ware. 

Various methods were used for large-scale translational 
research in CRC, including transcriptomics and proteomics 
technologies. Results of genome-wide transcriptomic re- 
search are available from 12 studies, using various technolo- 
gies, namely, SAGE (five studies), complementary DNA 
(cDNA) arrays (five studies), and oligonucleotide array (one 
study), as well as adaptor-tagged amplification (one study). 
Three studies analyzed CRC cell lines and nine other studies 
analyzed human samples. The total number of patients pro- 
filed in all studies was 144. Out of the nine studies with hu- 
man samples, only two studies with a total of only 22 patients 
were performed using purified epithelial cells (one of them 
using cDNA arrays, the other one using SAGE technology). 
An overview of these, studies is provided in Table 1. 

Eight large-scale proteomics studies were published in 
CRC using various technologies, namely, 2-D PAGE (six 
studies), SDS-PAGE (one study), and free-flow electrophore- 
sis (one study). In total, all proteome studies in CRC involved 
only 11 patients. An overview of these studies is provided in 
Table 2. 



3 Results 

One thousand two-hundred and forty genes have been 
reported to be dysregulated (up* and/or down-regulated) in 
human CRC, representing about 5% of the 20000-25 000 
human genes. 



* The complete dataset can be consulted: 
http://www.chirurgiebethel.de 



Table 1. Large-scale or genome-wide transcriptomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only 144 patients 

Author Year Sample Preparation Method Number 

of patients 



Zhang etal. [181 


1997 


Normal mucosa and primary tumors 


Whole tissue 


SAGE 


2 


SAGE-NETI19] 


1997 


Primary tumors and cell lines 


Cell lines 


SAGE 




Zhang etal. [18] 


1997 


Cell lines 


Cell tines 


SAGE 




Parle-McDermott et al. [20| 


2000 


Cell lines 


Cell lines 


SAGE 




Yanagawa etal. [211 


2001 


Paired norma) and cancer 


Whole tissue 


cDNA 


20 


Takemasa etal. [22] 


2001 


Paired normal and primary tumors 


Whole tissue 


cDNA 


20 


Nottermann etal. [231 


2001 


Paired normal mucosa and primary 


Whole tissue 


Oligonuc- 


22 






tumors, or adenoma 




leotide array 




Buckhaultseta/. [241 


2001 


Normal mucosa, adenoma and 


Purified epithelial 


SAGE 


6 






primary tumors 


cells 






Birkenkamp etal. [25] 


2002 


Paired normal and primary tumors 


Whole tissue 


cDNA 


27 


Lin etal. [26] 


2002 


Paired normal mucosa and primary 


Purified epithelial * 


cDNA 


16 






tumors, or adenoma 


cells 






Muro etal.[27) 


2003 


Normal and cancer 


Whole tissue 


Adaptor-tagged 


11 


Williams etal [28] 


2003 


Paired normal mucosa and primary 


Whole tissue 


cDNA . 


20 



tumors, normal and adenoma 
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Table 2. Large-scale or proteomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only nine patients 



Author 


Publication 


Sample 


Method 


Number 




year 






of patients 


Reymond etal. [91 


1997 


Purified epithelial cells 


2-D PAGE 


1 


Simpson etal. [291 


2000 . 


Cell lines 


SDS-PAGE 




Lawrie etal. [301 


2001 


Purified epithelial cells 


2-D PAGE 


4 


Simpson etal. [311 


2001 


Cell lines 


Free-flow electrophoresis 




Medjahed etal. [321 


2003 


In silico 


2-D PAGE 




Demalte et at. [33] 


2003 


Cell lines 


2-D PAGE 




Stierum etal. [34] 


2003 


Cell lines 


2-D PAGE 




Friedman etal. [35] 


2004 


Human whole tissue 


2-D PAGE 


6 



The vast majority of dys regulated genes in human CRC 
was found using transcriptomics tools: a total of 982 genes 
was found to be differentially expressed in at least one of 
these 12 transcriptomics studies. Out of these 982 genes, 863 
(88%) were found to be differentially expressed only in a 
single study, in other words these results have not been 
reproduced so far. The other findings could be reproduced in 
two or more transcriptomics studies: 88 genes were found to 
be differentially expressed in two studies, 22 in three studies, 
7 in four studies, no gene in five studies, and 2 genes in six 
studies. The most cited genes are listed in Tables 3 (up-reg- 
ulation) and 4 (down-regulation). 

A total of 408 proteins were found to be differentially 
expressed in human CRC in at least one study. Out of these 408 
molecules, 339 (83%) were mentioned in a single study, in 
otherwords these results could not be reproduced. Differential 
expression of the remaining 70 proteins were mentioned with 
following frequenc: 40 were mentioned in two proteomics 
studies, 16 in three studies, 1 0 in four studies, 3 in five studies, 
and a single protein in eight studies. The most cited proteins 
are listed in Table 5. It has to be noted that only a single study 
[1] provided differential display protein expression data 
obtained in the human patient using whole tissue biopsy. 

It is also difficult to reproduce transcriptomics results 
with proteomics tools. Out of 982 genes found to be differ- 
entially expressed in human CRC by t genome-wide tran- 
scriptomics technologies (Table 6a), only 177 (18%) have 
been confirmed using proteome technologies. When the 
genes reported to be differentially expressed in three and 
more transcriptomics studies (n = 31) are compared with 
those reported to be differentially expressed in three and 
more proteomics studies (n=30), only two genes (actin: 
ACTB_HUMAN and creatin-kinase: KCRB.HUMAN) can 
be matched. Thus, the probability of reproducing a gene 
expression result obtained at transcriptome level is low when 
using proteomics technologies. 

In contrast, there is a better reproducibility when prote- 
omics results are verified using transcriptomics tools 
(Table 6b). In fact, when the subset of 30 proteins that are 
consistently (in three studies or more) reported to be dysreg- 
ulated in CRC is compared with transcriptomics results, 20 

j 
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genes can be matched, representing about two-thirds. Thus, 
the probability of being able to reproduce a proteomics result 
using transcriptomics tools is more than three times higher 
than the opposite way (67 vs. 18%). However, it has to be 
noted that in 16% of cases, results remain unclear or even 
contradictory. 

Both in the transcriptomics and proteomics studies, 
many genes and factors were found to be differentially regu- 
lated that obviously do not play a causal role in CRC carci- 
nogenesis. 

4 Discussion 

This study was aimed at screening transcriptomics and pro- 
teomics studies published in CRC, and to elaborate a land of 
meta-analysis of gene expression in CRC in order to generate 
a data warehouse that would be useful in translation^ 
research. A significant number of translational research 
studies have been published in CRC, starting in 1997 with 
SAGE technology, followed by an increasing number of array- 
based transcriptomics studies since 2001, and more recently 
by several proteomics studies. This has created a significant 
amount of data which we were then able to compile. 

Our starting hypothesis was that we would be able to 
determine a common set of genes and gene products that are 
up- or down-regulated in human CRC by comparing results 
obtained by different authors. This endeavor was not very 
successful; obviously, overlapping between published gene 
expression results both at proteome and transcriptome level 
is low in human CRC. In fact, more than 80% of results 
could not be reproduced. Several explanations can be pro- 
vided to explain this lack of reproducibility. 

First, the number of patients included in the studies is 
low (alltogether 144 patients in transcriptomics, 11 in prote- 
omics studies); some high impact publications having been 
conducted on samples of only two patients [18]. This is a 
problem because interindividual genetic variability is high in 
human CRC [2]. Thus, it is allowed to hypothesize that some 
results attributed to gene dysregulation might be caused by 
genetic diversity rather than by cancer-specific traits. 
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Table 3. Results of transcriptomics research in human CRC. Most cited up-regulated genes out of 12 genome-wide or large-scale studies 

TRANSCRIPTOMICS 







i Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


I 

"23 

c 
12 


I 

o 

1 

i 

i 

o 
Q 

O 

£ 

i 

(L 


Zhang et al 1997 [18] 


57 

i 

re 
« 

Z 
uu 
< 


Zhang et al 1997 [16] 


Williams et al 2003 [28] 


§ 

CM 

% 
c 

S 

o 
Z 


Accession 

Number Swiss-Prot Entry Name Name 


Paired normal and cancer 


Nonnal and cancer 


E 

J2 

a. 

T3 
C 
C3 

C 
« 

<0 
O 

zs 
E 
z 


i Paired n. and prtumors 

i 


Paired n. and pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


Norm.muc and Prim.tum 


Primary tumors and cell lines 


Cell line 


Paired n.muc and pr.tum, n and adenoma 


Paired n.muc and pr.tum or adenoma 


3 

a 

« 


| Adaptor-tagged 


SAGE 


5 

O 

o 


< 
z 
o 

o 


cONA 


| SAGE 


SAGE 


SAGE 


SAGE 


{ cDNA 


Oligonucleotide Array 


AF041260 


075363 


IFM2.HUMAN 


Interferon-inducible protein 1-Bd 


1 




1 




1 






1 










AA232508 


Q9Y397 


SAHH.HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 






1 


1 










1 


1 






AA0O7218 


Q96QY8 


BGH3.HUMAN 


Transforming growth factor, beta-induzed, 66 kDa 


1 




1 


1 










1 








G01119 




TCPD_HUMAN 


Chaperonin containing TCP-1, subunit4 (delta) 












1 








1 


1 




X75821 


Q04984 


CA11_HUMAN 


C0L1AI 










1 


1 




1 










U22055 


Q96AG0 


TF3A_HUMAN 


General transcription factor II IA 








1 




1 










1 




U14631 


P80365 


GROJWMAN 


GR01 oncogene (melanoma growth stimulating 
activity, alpha} 








1 












1 


1 




M92439 


P42704 


MDP1_HUMAN 


MP0P4, M0P7 microsomal dipeptidase 






1 


1 








1 










AF029082 


P31947 


MPI2.HUMAN 


M-phase inducer phosphatase 2 






t 




















M86400 


P29312 


PR0C_HUMAN 


Pyrroline 5-carboxilate reductase mRNA 






t 




















L76465 


PI 5428 


SPRC.HUMAN 


Secreted protein, acidic„cysteine-rich 
(osteonectin) \ 






1 




















X57352 


Q01628 


Q96AG0 


100 kDa coactivator mRNA 


























X03205 


P62888 


RL30_HUMAN 


50S Ribosomal protein L30' , 


























M95787 


Q01995 


RL23_HUMAN 


60s ribosomal protein L23 ' 


























Z25821 


P42126 


HS9B_HUMAN 


90-kDa HSP 


























U96132 


Q99714 


RLA2_HUMAN 


Acidic ribosomal phosphoprotein P2 mRNA 


























AB028893 


P04643 


CA14__HUMAN 


Alpha-1 chain of collagen IV 


























X79239 


Q02546 


APA1_HUMAN 


Apolipoprotein A-l 


























Ml 3934 


P06366 


BMP4_HUMAN 


Bone morphogenetic protein-26 (BMP-2B) 


























M60854 


PI 7008, 


CSE1.HUMAN 


CAS chromosome segregation gene homologue 


























Ml 3932 


P08708 


CD81_HUMAN 


CD81 antigen 


























X69150 


P25232 


TCPZ_HUMAN 


Chaperonin containing TCP1 f subunit 6A(zeta1) 


























L06498 


P17075 


CA21_HUMAN 


Collagen alpha-2 type 1 
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Table 3. Continued 



TRANSCRIPTOMICS 



Accession 

Number Swiss-Prot Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


T5 
% 
e 


I 

I 

« 
% 

s 

i 


Zhang et al 1997 [18] 


SAGE-NET etal 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


cDNA 


Adaptor-tagged 


SAGE 


cDNA 


cDNA 


cDNA 


SAGE 


1 SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


104483 


P35265 


:02_HUMAN 


Complement component C2 


1 


1 






















M64716 


P25111 


R10A_HUMAN 


Csa-19mRNA 


1 


1 






















X77770 


P02383 


ROAI.HUMAN 


DNA binding protein UPI, liver mRNA fragment 


























W52460 


P14798 


TP2A_HUMAN 


DNAtopoisomerase II (top2) 


1 
























X55715 


P23396 


IF39_HUMAN 


Eukaryotic translation initialion factor(EIF3)mRNA 


1 


1 






















M77234 


P49241 


IF2B_HUMAN 


Eukaryotic translation initiation factor 2, subunit 2 


























M58458 


P12750 


GGH_HUMAN 


Gamma-glytamyl hydrolase 


























016992 


P10660 


G3P2.HUMAN 


Glyceraldehyde 3 phosphate dehydrogenase 


























M77233 


P23821 


SYGJWMAN 


Glycyl-tRNA synthetase 


























F16294 


P09058 


RS8.HUMAN 


H19RNA 


























X61156 


P08865 


HS9A HUMAN 


Heat shock Protein HSP 90-Alpha 


























136055 




Hmgl mRNA for high mobility group protein 1 


























AA316619 


P04645 


RS23_HUMAN 


Human homolog of yeast ribosomal protein 523 
(D14530) 


























LI 4599 


Q15233 


IMD2_HUMAN 


IMP (inosine monophosphate) dehydrogenase 2 


























X79234 


P39026 


IFM1_HUMAN 


Interferon induced transmembrane protein 1(9- 
27) 


























L06505 


P30050 


LDHB.HUMAN 


Lactate dehydrogenase B(LOH-B) 


























X64707 


P26373 


RS2_HUMAN 


LLRep 3 


























X56932 


P40429 


4F2_HUMAN 


Lymphocyte activation antigen 4F2 large subunit 


























L25899 


P39030 


NPL1_HUMAN 


Nucleosome assembly protein 1-like 1 


























X63527 


P14118 


GSHH_HUMAN 


Phospholipid hydroperoxide glutathione per- 
oxidase 


























X89401 


P4677B 


GDFFJWMAN 


Prostate drfferentation factor 


























H59771 


P23131 


RL6JWMAN 


Ribosomal protein L 6 


























M94314 


P38663 


RL18_HUMAN 


Ribosomal protein L18(RPL18) mRNA 


























L19527 


P08526 


RL1X_HUMAN 


Ribosomal protein L18a mRNA, complete cds 


























U 14968 


P46776 


RL7_HUMAN 


Ribosomal protein L7 


























U 14969 


P46779 


RL8JWMAN 


Ribosomal protein L8 


























L38941 


P49207 


RS5.HUMAN 


Ribosomal protein S5 


























U12465 


P42766 


RS19_HUMAN 


S 19 ribosomal protein 


























F19234 


PI 8077 


MYC.HUMAN 


V-myc myelocytomatosis viral oncogene 
homolog (avain) 


























X66699 


P12751 


Q99497 


ESKPARK7; DJ1) 


1 


1 




I 

















i 
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Table 4. Results of transcriptomics resea rch in human CRC. Most cited down-regulated genes out of 12 genome-wide or large-scale studies 



TRANSCRIPTOMICS 







Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhauhs et at 2001 [24] 


Birkenkamp et al 2002 [25] 


ll 

i ! 
i \ 

A3 
M 
<D 

e 
1 


Un et al 2002 [26] 


I 

o 

g 

o 

Q 

u 
£ 

i 

aj 
CL 


Zhang et al 1997 [18] 


SAGE-NET et all 997 [19] 


oo 

5 

*»» 

To 

*J 

CD 

o 
c 
re 

f5 


WilUams et al 2003 [28] 


Notterman et al 2001 [23] j 






Paired normal and cancer 


Normal and cancer 


CD 

| 
« 

ta 
o 

E 
Z 


Paired n. and pr.tumors 


Paired n. and pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


Norm.muc and Prim .turn 


Primary tumors and cell lines 


Cell line 


Paired n.muc and pr.tuntn and adenoma 


Paired n.muc and pr.tum or adenoma 


Entry Name 


Name 


< 
z 
a 

o 


Adaptor-tagged 


LU 

| 

CO 


o 

u 


< 

Z 

o 

o 


< 
z 

O 


SAGE 


Ui 

< 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


CAH2 HUMAN 


Carbonic anhydrase II 


1 




1 




1 . 












1 


1 


prAi til IMAM 

CEA1_HUMAN 


cct fen iadv r;i vpnonnTPiMi 
tol (blUArlY bLYLUrnU 1 tllM| 






1 


1 


1 












1 


1 


BENE_HUMAN 


□ CMC 

bbNc 


1 








1 






1 








1 


PALM Lll IMAM 

LAm_nUIVIAN 


Carbonic anhydrase 1 


1 




1 


















1 


ni IAII IHItAAkl 

GUAU_HUIvlAI\l 


uliAr*ll guanyiate cyclase activator ld, uroguanyiin, uum 


1 






















1 . 


rini Ml III A M 

FAB L_HU MAIM 


L-FABP liver fatty acid-binding protein 1 


1 








1 
















ACDS_HUMAN 


Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 






















1 


1 


ADHG_HUMAi\l 


Aun gama / stiDunrt \aa-i— j/oi 










1 














1 


T4S3_HUMAN 


Co-029; transmembrane 4 superfamily member 3 






1 




















DRA_HUMAN 


Colon mucosa-associated (DRA) 
























1 


KLnD_nU(vlAN 


Creatine kinase 


1 
























PI it m ill Ik* AM 

GUAN_HUMAN 


fl 1 P A 1 Q nii!>niil nfo n\m 1 a to a rtn/atn r 1 R/mmnvlin 

bULAiu guanuiate cyclase acuvaiui iD/yudityim 
























1 


0 95784 


IgG Fc*binding protein 


1 








1 
















ITMC_HUMAN 


Integral membrane protein 2C 










1 


1 














MKfTC HUMAN 
ivi i\uo_nuiviMiii 


MAPK3 






















1 




FXY3_HUMAN 


MAT8 protein 










1 
















ACTB_HUMAN 


mRNA fragment encoding cytoplasmic actin 


1 












1 


1 










SELP_HUMAN 


Selenoprotein P 










1 










1 






ESTl.HUMAN 


Liver carboxylesterase 1 [Precursor] 


















1 


1 




1 


CFAD_HUMAN 


Adipsin, complement factor DCDA (EST) 
























1 


ATPB_HUMAN 


ATP5B ATP synthase 










1 
















CAHC_HUMAN 


CA12 










1 
















CANS.HUMAN 


Calcium dependent protease (small subunit) 
















1 










CALM_HUMAN 


Calmodulin-I (CALM1 ) mRNA, 3' UTR, partial sequence 
















1 






1 




CAH4_HUMAN 


Carbonic anhydrase IV i 








1 
















1 



] 
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TRANSCRIPTOMICS 



Entrv Name Name 


Yanagawa et al 2001 [21] 


s 

w 

o> 
o 

s 


Buckhautts et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Lin et al 2002 [26] 


Parle-McDermott 2000 [20] 


55* 

% 

2* 
<5 
j= 
rsi 


SAGE-NET et al 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


cDNA 


Adaptor-tagged 


SAGE 


< 

Q 

o 


< 
z 
a 

o 


cDNA * 


SAGE 


SAGE 


SAGE 


Ul 

< 

V3 


< 
z 
o 

o 


Oligonucleotide Array 


MT1E_HUMAN 


cDNA similar to gb: M10942_cds1 human 


















1 






1 


000748 


CES 2 carboxylesterase 2 








1 


1 
















CMGA_HUMAN 


CgA 
























1 


CLUS_HUMAN 


clusterin 
























1 


K1CT_HUMAN 


Cytokeratin 20 


















1 






1 


KDGA.HUMAN 


Diacylglycerol kinase 








} 
















1 


DTD_HUMAN 


DTD sulfate transporter 








1 


1 
















PLA8_HUMAN 


EST 122594 5. 


















1 






1 


ATPA.HUMAN 


FI-ATPase alpha subunit 










1 
















LEG3_HUMAN 


Galectin*3 (Galactose-specific lectin 3) (MAC-2 antigen) 








1 


1 
















ABP_HUMAN 


HP-DA01 (diamine oxidase) 




















1 




1 


K1CS.HU MAN 


Keratin 19 












1 








1 






K2C8 HUMAN 


Keratin, type 11 cytoskeleta! 8 (cytokeratin 8){K8) (CK 8) 














1 






1 






DHB2_HUMAN 


L1 1708 Estradiol-17 beta- dehydrogenase 2 












1 












1 


MT1H_HUMAN 


Metallothionein 1 H 
























1 


MUC2.HUMAN 


Mucin 2, intestinal/tracheal 














1 


1 










MEPA.HUMAN 


PPH alpha gene 




















1 




1 


PA2A HUMAN 


RASF-A PLA2 gene 












1 










1 




MT1F_HUMAN 


RNAhelicase-related protein 
























1 


SBP1.HUMAN 


SBP selenium-binding protein 












1 


1 












MT1LJWMAN 


Serine theonine kinase 39 (STE20/SPS1 homolog.yeast) 












1 












1 


MYL6.HUMAN 


Myosin light polypeptide 6 


















1 




1 




TETN_HUMAN 


TNA tetranectin 












1 












1 


VIPR.HUMAN 


VIPRt vasoactive intestinal polyoeptide receptor 1 












1 












1 


PLA8_HUMAN 


Placenta-specific gene 8 protein 




















1 




1 


Q9NXM9 


Hypothetical protein FU20151 












1 












1 


T4S1„HUMAN 


Transmembrane 4 superfamily member 1 








1 












1 






ALU2.HUMAN 


Alu subfamily SB sequence contamination warning entry 






1 






1 














S116JWMAN 


S100 calcium-binding protein A16 


















1 


1 






C14A.HUMAN 


Dual specificity protein phosphatase CDC14A 




















1 


1 





Second, results have been obtained using heterogeneous 
samples in particular cell lines, whole tissue biopsies, and 
epithelial cells purified from surgical specimens. Results 
obtained in cell lines do not allow accurate comparison be- 
tween normal and cancer cells, and the presence/absence of 
proteins of interest has to be confirmed in biopsies. For 
example, when 2-D PAGE protein patterns of normal human 



colonic crypts were compared with the CRC cell line L1M 
1863, the proteins spots from normal crypts matched only 
75-80% of the cell line spots and the relative expression 
levels of a large number of proteins differed [3]. When clin- 
ical biopsies are examined, results of phenotypic compar- 
isons depend on the type of samples examined {e.g., hetero- 
geneous whole tissue biopsies with inflammatory cells, 
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Table 5. Results of proteomics research in human CRC. Most cited up-regulated (orange), down-regulated (green), or mentioned (yellow) 
proteins out of 12 large-scale proteomics studies 



Accession 

Number Swiss-Prot Entry Name Name 


I 

§ 

to 

« 

c 
o 

vt 

CL 

E 
</5 


Simpson et al 2001 [31] 


Demalte et al 2003 [33] 


s 

cn 

To 
« 

c 
o 
E 
>■ 

oc 


1 

"<5 

OJ 
-a 
01 

-c 
re 

« 
2 


I 

§ 

.2 
'E 

5 
3 


Friedman et al 2004(35) 


i 

z 

OJ 
1/3 


SDS-PAGE 


Free-flow electrophoresis 


2-D PAGE 


UJ 

< 

CL 
Q 


1 2-D PAGE 


2-D PAGE 


| 2-D PAGE 


2-D PAGE 


AF041260 


075363 


/\CTB_HUMAN 


mRNA fragment encoding cytoplasmic actin 


: ;;lV 










iMi- 




AA232508 


Q9Y397 


ACTG_HUMAN 


Actin, Cytapiasmatic 2 (Gam ma -Actin) 
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Table 6. Reproducibility of transcriptome results using proteomics tools (a) and vice versa (b). Proteomics results have been reproduced in 
67% of cases in transcriptomics studies. Results from transcriptomics studies are more difficult to reproduce using proteomics 
tools (25%). Only two gene products can be retrieved in three and more proteomics and transcriptomics studies (actin and creatin- 
kinase) 

a. Transcriptomics > Proteomics 
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necrosis, blood, stool, etc.), so that standardized sample 
preparation procedures are critical for obtaining reproduc- 
ible results. Several sample preparation methods have been 
described, in particular fluorescence-activated cell-sorting 
(FACS) [4], laser capture microdissection (LCM) [5-8], 
immunomagnetic beads separation [9], and cellular frac- 
tionation [10). Unfortunately, these sample preparation 
procedures were barely applied in CRC. In our experience, 
the beads-based method is characterized by several advan- 
tages when compared with other cell purification proce- 
dures [11]. 

Third, gene expression patterns depend on the arrays 
technology platform. In transcriptomics studies any factors 
may affect the outcome of a microarray experiment, in par- 
ticular technical, instrumental, computational, and inter- 
pretative factors. In fact, lack of reproducibility and accuracy 
is a major concern in microarray studies [12]. When cross- 
platform comparison was performed, reproducibility was 
insufficient: only four genes from a set of 185 common genes 
selected behaved consistently on three array platforms, and 
agreement of about 30% was found between two brands [13]. 

Fourth, in proteomics studies, 2-D PAGE or 2-D DIGE 
have well-known technological limitations. In CRC, even 
after epithelial cell enrichment using magnetic beads, the 
mean CVof repeated 2-D PAGE analysis with silver staining 
was found to lie between 20 and 28%. Only 47% (interrun) to 
76% (intrarun) of spots could be matched within a triplicate 
experiment. Interindividual phenotypic variability was high. 
Thus, even under well-defined experimental conditions, 2-D 
PAGE parallel analysis of paired CRC samples is hampered 
by a significant variability [2]. 

Fifth, the methods applied for generating, formatting, 
storing, retrieving, and querying data are of outmost impor- 
tance to assess methodological and biological variation in 
gene expression analysis. Unfortunately, due to the small 
sample size (number of patients), large number of variables 
examined at once, and absence of double or triple experi- 
ments (arrays and gels are expensive and samples are rare) 
statistical analysis is often not valid. In particular, assessing 
the reproducibility of a variable is necessary (e.g., using the 
intraclass correlation coefficient) for comparing multiple 
samples at once. The use of median values instead of mean 
values has been shown to improve data correction [14]. It has 
also been proposed to use housekeeping genes as endoge- 
nous controls [1 5]. A dedicated society, The Microarray Gene 
Expression Data (MGED) Society, has been formed to facil- 
itate the sharing of gene expression data generated by func- 
tional genomics and proteomics experiments [16]. 

Finally, correlation between results of transcriptomics 
versus proteomics results is low. For CRC, there is no pub- 
lication comparing mRNA and protein expression for a 
cohort of genes. However, extrapolation is reasonable from 
another epithelial cancer (lung adenocarcinoma), where 
such comparison has been performed. Only a subset of the 
proteins (17%) exhibited a significant correlation with 
mRNA abundance [17]. 
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Obviously, many genes and factors found to be differen- 
tially regulated (both in transcriptomics and proteomics 
studies) do not play a causal role in CRC carcinogenesis. For 
example in the studies under investigation, at least 17 
mRNAs encoding ribosomal proteins were identified to be 
dysregulated using cDNA arrays and 39 ESTs using SAGE 
technology. This broad dysfunction of protein synthesis, in 
particular of small molecules synthesis, has been reported 
not only in cancer but also in several other human diseases, 
where etiologies have been linked to mutations in genes of 
the translational control machinery [36]. 

However, some findings might be of particular interest 
in human CRC. For example, a small molecule group found 
to be dysregulated in human CRC is the 14-3*3 proteins 
family. 14-3-3 proteins are ubiquitous within all eukaryotic 
cells and participate in protein kinase signaling pathways. In 
particular, they are involved in phosphorylation-dependent 
protein-protein interactions that control progression 
through the cell cycle, initiation and maintenance of DNA 
damage checkpoints, activation of MAP kinases, prevention 
of apoptosis, and coordination of integrin signaling and 
cytoskeletal dynamics [37], Given the prevalence of specific 
14-3-3 isoforms expression in several human epithelial can- 
cers [38, 39], these proteins may be involved in cancer 
tumorigenesis and particular isoforms may be useful as 
therapeutic targets in human CRC. 

In summary, we propose a gene expression data ware- 
house in human CRC that is intended to help researchers 
active in the field to get an overview of the data available. 
However, reproducibility of results obtained in different 
studies was disappointing. 

As a matter of fact, the development of some types of 
unified processes for generating information, formatting, 
storing, retrieving, and querying data, regardless of the tech- 
nology used to generate it, is of central importance for 
building gene expression databases. Such unified processes 
have been proposed (see above) but have found only limited 
recognition so far. This is unfortunate because these metho- 
dological issues must be solved before trying to integrate 
experimental data from different sources into functional 
proteomics studies at the bench or in silico. The present 
meta-analysis of gene expression highlights the need for 
including a sufficient number of patients, analyzing only 
purified epithelial cells, using clinical standards (such as the 
international classification of disease, histopathology, and 
staging), assessing the variability of the technology applied, 
considering common standards for microarray data annota- 
tion and exchange, and finally for developing software 
implementing these standards and promoting the sharing of 
these high quality, well -annotated data within the life sci- 
ences community. 

In the absence of such unified processes, reproducibility of 
results within a laboratory will continue to be low, confirma- 
tion of results between different groups will stay difficult, and 
identification of diagnostic and therapeutic targets will remain 
a lottery. These problems are challenging the whole drug dis- 
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covery process from the very beginning, namely target identi- 
fication, so that target validation, assay development, and 
prioritization of compounds remains a high risk endeavor. 

We are very grateful to G. Nestler for her excellent technical 
assistance. 
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Proteomics 



Kathiyn Lilley, Azam Razzaq and Michaa! J. Deery 



17.1 Introduction 

There are now numerous organisms whose genome sequences are known, for 
example Arabidopsis (Arabidopsis Genome Initiative, 2000), Drosophila (Adams 
etal. t 2000), human (International Human Genome Sequencing Consortium, 
2001) and rice (Yu et al., 2002), The prediction of open-reading frames from 
these genomic sequences has enabled the comprehensive identification of many 
putative protein sequences. These proteins can be arranged into three categories, 
namely those of known function, those with recognisable motifs and hence a 
vague idea of function, and those with no sequence similarities to any pro- 
tein (Gabor Miklos and Malenszka, 2001). Many proteins reside in this latter 
'functional vacuum' which could represent as much as 30% of the predicted 
proteins. Determining protein function is key to imdmfunrfjng cellular mecha- 
nism. Studying how protein expression is modulated in response to a given set 
of circumstances, such as infection, disease, developmental stage, senescence or 
response to drugs, will facilitate the elucidation of disease pathways and thus 
provide a mechanism for diagnosis and therapy. 

DNA chips (mRNA profiling studies) can contribute to the study of gene 
expression in response to a particular biological perturbation. However, the 
extrapolation that changes in transcript level will also result in corresponding 
changes in protein amount or activity cannot always be made, lb understand 
fully, we need integrated data sets from a variety of protein expression studies, 
providing information on relative abundances, subcellular locations, protein 
complex formation and the profiling of isofonns generated by either alternate 
mRNA splicing or post-translational modifications. Proteomics is the won) now 
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commonly used to describe the discipline associated with the acquisition of 
these data sets. 

17.2 Definitions and applications 

Before embarking on an overview of die techniques used in the field of pro- 
teomics, it is necessary to define a few terminologies; proteomics, functional 
genomics, structural genomics and post-genomics are terms that have crept into 
the scientific vocabulary with alarming stealth and are used freely, and in some 
cases interchangeably, especially in cases where the use of such 'buzz words* 
is likely to increase the attendance &t departmental seminars. 

• Genome: A genome represents the entire DNA content in a particular cell, 
whether or not it is coding, non-coding, or is located either chromosomally 
or extra-chromosomally. 

• Genomics: Genomics is the study of the genome, interrogating the complete 
genome sequence using both DNA and RNA methodologies. 

• Proteome: The proteome represents the complete set of proteins encoded by 
the genome. 

• Proteomics: Proteomics is the study of die proteome and investigates the cel- 
lular levels of all the isoforms and post-trans lational modifications of proteins 
that are encoded by the genome of that cell under a given set of circumstances. 

• Functional genomics: This is the study of the functions of genes and their 
inter-relationships. 

Whilst a genome is more or less static, the protein levels in a particular cell can 
change dramatically as genes get turned on and off during the cell's response to 
its environment The proteome originally was defined over seven years ago as 
'all die proteins coded by the genome of an organism* (Wasinger et al. 9 1995). 
Nowadays the term 'proteomics* is used more widely and implies an effort to 
link structure to function by whatever means are appropriate. We can expect the 
definition to change again with time as the field and the investigator's view of 
it evolves. 

173 Stages in proteome analysis 

Proteomic analysis can ascertain function either by looking for changes in the 
expression of either all or a subset of proteins, or by identifying binding partners 
for particular proteins and seeing how their interaction is affected by biological 
perturbation. Whatever the rationale of the investigation, or the number of pro- 
teins involved, the study of the proteome can be broken down into the following 
stages of analysis. 
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Gene Expression Analysis 
Using Microarrays 
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13.1 Introduction 



Microarrays are of increasing interest to both industry and academia as tools 
for 'gene hunting* and also as quantitative methods for routine analysis of large 
numbers of genes. Techniques such as real-time polymerase chain reaction (RT- 
PCR) TaqMan™ and SybrMan™ are generally considered to be more accurate, 
robust, larger in dynamic range and less capital intensive, but for rapid, large^ 
scale gene expression analysis using limited mRNA, microarrays and gene chips 
are preferred. 



13.2 Microarray experiments 
Platforms 

Global gene expression platforms are now available in multiple formats, 
including cDNA arrays, oligonucleotides spotted onto slides or in situ 
synthesised oligonucleotide arrays manufactured using photolithography. 
Commercial sources for these include Stratagene (La Jolla, CA), Memorec 
(Koln, Germany) and BD Biosciences (Oxford, UK) for cDNA microarrays, 
Mergen Ltd (San Leandro, CA) for spotted oligomers and Affymetrix (Palo 
Alto, CA) for ougoarrays synthesised in situ. Purchasing from a supplier is more 
expensive than generating microarrays in-house, although the latter is beneficial 
in labour-intensive institutions or when proprietary gene information is utilised. 
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'Off-the-shelf microarrays may also have the advantage of rigorous quality 
control and standardised protocols. 

It is possible to produce oligonucleotide spotted arrays in-house, by design- 
ing oligonucleotide sequences that match genes of interest and then purchasing 
purified oligonucleotides to spot down on glass or other substrates. Alternatively 
there are new systems such as that available from CombiMatrix (Mukilteo, 
WA, USA) for computer-aided design and in situ synthesis of oligonucleotides. 
However, production of cDNA microarrays is currently the most affordable and 
popular method and is now well established. Numerous sources of informa- 
tion on cDNA microanay fabrication are available in the literature and on the 
internet (Bowtell, 1999; Cheung etai, 1999; Wildsmith and Elcock, 2001 and 
http://cmgm.stanford.edu/pbrown). Thus, this chapter will focus on the 
implementation of experiments and analysis of data ftojn cDNA microarrays. 
The experimental procedure differs slightly according to the number of fluo- 
rophores (or channels) and the type and manufacturer of the array. We have 
attempted to describe a generic process, indicating where possible the different 
options. Figure 13.1 demonstrates the procedure for a two-colour hybridisation. 



Sample 1 



Sample 2 



Total RNA 




Total RNA 



AAAAAA 



Reverse 
transcription 




Hybridise labelled cDNAs { 





Wash off 
unbound cDNA 



Laser excitation 
and scanning 




Figure 13.1 The microarray experimental process for two-colour hybridisations 



RNA Extraction 
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First, RNA is extracted from the tissue or cells of interest. The quality of the 
RNA extracted is paramount to the overall success of the microarray experi- 
ment, as impurities in the sample can effect both the probe labelling efficiency 
and also stability of the fluorescent label (Hegde et al., 2000). Snap-freezing 
of tissue in liquid nitrogen, immediately after harvesting, is used to preserve 
RNA integrity. Any further sectioning of the tissues should be carried out under 
RNAse-free conditions (Fernandez et al., 1997). Total RNA can be extracted 
using kits such as TRIzol® (Invitrogen, Paisley, Scotland) and Rneasy (Qia- 
gen, GmbH, Hilden, Germany). Some researchers perform a further extraction 
of mRNA; this results in a purer starting material but has the disadvantage of 
lower yields. Affymetrix recommend between 5 and 40 u,g of total RNA is 
required for their GeneChips™ and 10 ng or less is the required amount of 
starting material for cDNA rnicroarrays (Hegde etal., 2000). 



.Sample .Labelling 

The mRNA is transcribed in virro, with the concomitant inclusion of labelled 
nucleotides. The labels may be fluorescent or radioactive. In the case of dual 
channel/colour hybridisations, two samples will be labelled with dyes that 
fluoresce at different wavelengths, with different emission spectra. Example 
fluorophores, available coupled to nucleotides, are Cy3, Cy5. fluorescein 
and lissamine. Wildsmith etal. (2001) have demonstrated that AlexaFluor 
546dUTP™ (Molecular Probes, Leiden, The Netherlands) gives a significantly 
higher signal than Cy3dCTP (Amersham Biosciences, Piscataway, NJ, USA). 
When performing two-colour hybridisations the control sample and 'test' sample 
are labelled with different fluorophores and the subsequent cDNA is then mixed 
together and hybridised simultaneously (Nuwaysir et al., 1999). An advantage of 
simultaneously hybridising control and treated sample is that it obviates the need 
to control for differences in hybridisation conditions or between rnicroarrays. A 
specific example of the huge impact this technique has had includes its use 
in the first published account of gene expression data of the entire genome of 
Saccharomyces cerevisiae (DeRisi etal., 1997). In two-colour hybridisations, 
one would assume that the properties of the two fluorescent dyes being used 
are equivocal. In fact, for Cy5 and Cy3 this is not the case as Cy5 has been 
reported to give higher background fluorescence and also is more sensitive to 
photobleaching than Cy3 (Van Hal etal, 2000). In addition, there is evidence 
from several independent sources that the combination of Cy3 and Cy5 dye 
labelling can affect data in certain genes. That is to say, when experiments 
are repeated and the dye combination for the two probes reversed, inconsistent 
results are obtained with certain genes (Taniguchi etal., 2001). Despite these 
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Treated : Cy 3-532 nm Control : Cy 5-635 nm 




Figure 13.2 Two-colour fluorescent scan of human gene cDNA array. The probe mix con- 
sists of DNA from HepG2 control cells and ceils treated with butbionine sulfoximine for 6 h. 
A colour version of this figure appears in the colour plate section 

facts, two-colour hybridisations arc widely accepted throughout the microarray 
community and an example of an image is shown in Figure 13,2. 

Hybridisation and Processing 

After labelling, the cDNA is purified (to remove unincorporated nucleotides), 
mixed with a hybridisation buffer and then applied to a cDNA microarray slide! 
The sample and the slide are heated prior to hybridisation in order to separate 
double-stranded DNA. A coversiip is applied (Shalon et aL, 1996), or prefer- 
ably a hybridisation chamber is used to avoid evaporation and enable an even 
hybridisation. The hybridisation and subsequent wash steps arc carried out at a 
buffer stringency and temperature that enables hybridisation of complementary 
strands of DNA but reduces non-specific binding. 

Image Capture and Image Analysis 

After hybridisation the microarray slides are scanned, using either a laser or a 
phosphorimager (depending on the type of label used). There are many different 
suppliers and models of fluorescence scanner, for example the ScanArray 5000 
(Perkin Elmer Life Sciences, Zaventem, Belgium), GenePix 4000B (Axon GRI, 
Essex, UK) and the GeneArray® (Affymetrix, Santa Clara, CA). The choice of 
scanner is determined by sensitivity, resolution, flexible wavelength, file size 
generated, throughput and technical support available. 

Images are analysed using software that measures the intensity of the sig- 
nal from the hybridised spotted genes (spots), which provides a measurement 
of the amount of cDNA bound. Thus the initial concentration of messenger 
RNA is inferred. Early software packages 'drew* grids around the spots and 
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integrated across the whole area of the grid. This overcame problems associ- 
ated with accurate location of the spots, which is problematic, especially if the 
spots on the printed arrays are poorly aligned. More recent versions of software 
'draw* circles around the spots themselves and perform measurements within 
and outside of this boundary. For example, the background may be calculated 
from a region outside the spot boundary. The intensity of the signal from the 
spot may be calculated using median, mode or mean values of the pixels within 
the spot. Researchers differ in their preferences regarding using median or mean 
values (Hegde et aU 2000) and this is likely to depend upon the protocols and 
software used. 

Image analysis software commonly is supplied with scanners or can be 
bought from the same supplier. This has the advantage of being opti- 
mised for that specific type of microarray and the benefit of upgrades 
and technical support. Software for microarray analysis is available 
from BioDiscovery (http://www.biodiscovery.com), Imaging Research 
(http://imagingresearch.com),'GenePix Pro (Amersham Biosciences, Pis- 
cataway, NJ), anaySCOUT 2.0 (http://www.lionbioscience.com), NIH 
(http://www.nhgri .nih.gov/DIR/LC0/15K/HTML/img analysis .html), 
Stanford University (http: //rana. Stanford.EDU/ software) Media Cyber- 
netics (Silver Spring, MD, USA) and TIGR (http//www . t igr . org/sof t lab). 
Important criteria for image analysis software include speed, ease of use, 
automation and the ability to distinguish artefact from real signal (Wildsmith 
and Elcock, 2001). 

As the technology has evolved and more experience gained, it has become 
more and more apparent that the most significant issues facing microarray users 
are the processing of the vast quantities of data generated and deciding exactly 
what tools are the most appropriate for data analysis. Because of the enormity 
of this, we have dedicated a complete section to describing the current status of 
this area. 



13.3 Data analysis 

It is important to be cognisant of the fact that the practical laboratory aspects of 
using microarrays are only part of gene expression analysis. Many researchers 
generate vast volumes of data, without a clear understanding of how to manage 
and interpret them. Furthermore, the variability in microarray data confers addi- 
tional problems for analysis. In some cases the purpose of the experiment will 
be a gene-hunting exercise, in which case a cursory indication of potential gene 
biomarkers is sufficient analysis. In other instances, such as pathway mapping 
and screening studies, it is paramount that results are statistically meaningful 
and valid. The next few sections detail some relatively simple analysis meth- 
ods and recommendations for the benefit of researchers with minimal statistical 
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Figure 133 The ideal microarray expedmeiital design and process 

training. There are also suggestions for more advanced analysis for those who 
have the assistance of a statistician or specialist data analyst. 

Most of the steps before, during and after performing a microarray experi- 
ment are optimally conducted with regard for statistics and data analysis. Careful 
planning before implementation facilitates the downstream analysis and inter- 
pretation of data. The following model summarises the entire microarray process 
with integration of the biological and data analysis components (Figure 13.3). 

Hypothesis Generation 

Any study is conceived for the purpose of investigating or obtaining supporting 
evidence for a biological hypothesis. Giving time at this early stage to consider 
downstream implications will pay dividends later. It is helpful if, rather than 
simply stating the aims of the experiment, the researcher asks the question 'What 
results do I expect?* or 'what answer will validate/invalidate my hypothesis? 1 . 
This Averse-engineering' proves useful in focusing the project, assessing the 
feasibility of the work, providing early preparation for data management and 
analysis and, importantly, in managing expectations with regard to outcomes. 

A good example of careful experimental planning is demonstrated by Golub 
etal. (1999) in the classification of acute leukaemias in order to distinguish 
between acute lymphoblastic leukaemia (ALL) and acute myeloid leukaemia 
(AML). Distinguishing between ALL and AML using conventional techniques 
is known to be a difficult task. The researchers maximised their probability of 
success by choosing an easier, more defined model (normal kidney vs. renal 
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cell carcinoma), on which to validate their analytical methods. In doing so they 
established that their techniques were suitable for classifying tissues according 
to disease and gained confidence in their approach before using the samples of 
real interest. 

Optimisation Experiments 

Although microarrays are becoming increasingly accessible to all, using these 
tools requires experience and it is unlikely that successful experiments will be 
conducted immediately. It is usual that some time is given to optimising a sys- 
tem for any specific application, for example for a given tissue or cell type. 
Additionally, the requirements for a given system may warrant some modifica- 
tions. The standard approach for a scientist to take is to vary one parameter, 
whilst keeping all others constant. This is time-consuming and does not take into 
account the interactions between different factors. Well-designed, multifactorial 
experiments (Box era/., 1978), provide a faster route for optimisation, with a 
statistical measure of confidence. An example of this technique is in the opti- 
misation of microarray experimental conditions for preparation of fluorescent 
probes from rat liver tissue (Wildsmith et al f 2001). When a major source of 
variation is revealed this can be investigated further with a view to minimising 
it or providing sufficient replicates to account for it. 

Design of Experiments 

Once confidence in the experimental procedure has been obtained the researcher 
is likely to have gained an insight into the reproducibility of the system. This 
assists in the design of the experiments, in particular in determining the mini- 
mal number of replicates necessary. Replication can be implemented at many 
stages - from biological samples through to microarray slides. 

Owing to the enzyme-catalysed transcription reactions, a large amount of 
variation occurs during the probe-making stages in microarray experiments. Our 
work indicated that replicates should be made at this step and a minimum of six 
replicate probes are made for microarray experiments (Wildsmith etal, 2001). 
These can be pooled or hybridised separately onto six microarray slides. 

Lee et al. (2000) have examined the effect of the different location of cDNA 
spots on the glass slides and concluded that replicates are essential to provide 
meaningful data and to enable reliable inferences to be drawn. 

With regard to commercially available gene chip systems, such as that avail- 
able from Affymetrix (see Figure 13.4), the variation between chips, within a 
batch, is likely to be low due to stringent quality control and highly automated 
manufacture. The use of an automated wash station also reduces variability in 
intensities between chips. However, using a multi-step approach in the probe 
preparation and subsequent antibody binding steps may lead to variation between 
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Figure 13.4 The GeneChip® Instrument System. From left to right, the hybridisation sta- 
tion, scanner and workstation. Image courtesy of AfTymetrix. A colour version of this figure 
appears in Che colour plate section 



replicate samples prepared on different days. Pooling of reagents within an 
experiment, and analysing controls together with treated samples, will both 
reduce the variability within a given experiment. 

Conduct of Experiment 

At this stage some attention may be required for verifying and validating pro- 
cesses. For example, checking that the imaging instruments give consistent 
results across the slide, on repeat use and from day to day. If two imagers 
are used it is important to verify that the results from both machines are com- 
parable. Some laboratories read fluorescence of one channel and then adjust the 
laser intensity of the second channel in order to obtain comparable readings. 
This is a method of normalising for the difference in intensities of fluorophores. 
It is important to be aware that this approach has a number of drawbacks. The 
arbitrary value of the second laser intensity setting will vary from experiment 
to experiment; thus comparisons of this channel cannot be made across experi- 
ments. Also the response of the fluorophore may not be linear across the laser 
intensity settings and this can lead to additional errors. 

Another area for investigation prior to running the study itself is the image 
analysis component. Depending on the software used, the image analysis pack- 
age may process the data to some extent, for example automatic background 
subtraction. Full understanding of the software is required so that it is clear at 
what point the data are 'raw', and the extent of inherent, inseparable manip- 
ulation. Effort may be required to determine the optimum settings for any 
software parameters. 

As data are generated it is important to be aware of the data integrity - for 
example ensuring that all data are collected, so that there are no missing data that 
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Figure 13.5 The relative fluorescence units (RFUs) of 1248 genes on seven microar- 
ray slides that were. hybridised with cDNA made from. the liver of. a rat treated with 
acetaminophen. Note the gene outliers at approximately gene number 480 

can complicate analysis later. The researcher may be intuitively aware of any 
spurious results and should be alert for anything extraordinary that could indi- 
cate problems, for example hybridisation intensities appearing inconsistent from 
sample to sample. Data analysis at this point can be a rapid indicator of dubious 
results. For example, Figure 13.5 shows a plot of the fluorescent intensities of 
1248 genes that were hybridised with probe derived from acetaminophen-treated 
rat liver tissue. The data appear consistent, with the exception of peaks in inten- 
sity on one slide at around gene 4800. Further investigation of the microarray 
revealed a large artefact that had been missed by the image analysis process 
(Figure 13.6). 



Raw Data Generation and Storage 

One issue that arises when carrying out microarray analyses is how much data 
to store and in what form. For Good Laboratory Practice (GLP) purposes, often 
required in industry, storage of the raw data is necessary. This could be construed 
as the microarray image. Storing the image analysis results requires far less 
storage space and is easier to visualise, but it has the drawback that image 
analysis cannot be redone should superior software be available in the future. 
In reality, the methods used for microarrays are continually changing and the 
likelihood of revisiting old images on which the analysis has been performed, 
using outdated protocols is quite small. 
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Figure 13.6 Portion of scanned image showing region where artefact occurred that caused 
very high signals, which were classed as outliers 



Pre-processing 

A number of pre-processing steps are often used in microarray analysis. These 
include filtering, log transformation, normalisation and background subtraction. 
Filtering may be used before or after transformation in order to extract data 
from preferred regions of interest, or in order to remove outliers (see the 
above example relating to image analysis artefact). One example of filtering 
is the removal of individual gene replicates that lie outside a given number (for 
example 5) of standard deviations from the mean. Alternatively, data points that 
lie in the top/bottom few percentiles (e.g. 0. 1 %) of the data can be removed. This 
method of removing outliers is also called 'trimming'. It is acceptable if there 
is a large volume of data where only a small proportion of data is removed and 
if the same method is applied consistently across all data. Care must be taken 
in the way in which this is carried out in order not to delete genuine data. For 
example, if one gene is consistently high or low in expression across replicates, 
then it is unlikely to be an outlier. 

Another method of detecting outliers is to plot all genes (see the section 
'Conduct of experiment' above) or to perform PCA analysis (see the section 
'Multivariate analysis' below) to detect replicate outliers. The use of a PCA 
plot to detect outliers is shown in Figure 13.7. 

Log transformation of data is accepted universally because the fluorescence 
data that are generated from microaiTays tend to be skewed towards lower val- 
ues. There are scientifically valid reasons why ratios of raw expression values 
should not be used (Nadon and Shoemaker, 2002). When using two-colour 



DATA ANALYSIS 279 



x10 7 



Score plot: class2 
T 




-0.5 0 
PC1-47.3738% 

Figure 13.7 PCA plot of data used in Figure 13.4 showing one microarray (number 2) 
an outlier 
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hybridisations it is common to express the ratio of treated to control as a loga- 
rithm in base 2 (Quackenbush, 2001). Thus genes up-regulated by a factor of 2 
have a log 2 (ratio) of l f and genes down-regulated by a factor of 2 have a loe<> 
(ratio) of -1. * 

Normalisation and background subtraction techniques are methods of data 
manipulation and their use is more subjective and often debated. The purpose of 
these techniques is to reduce the error (variability) that occurs between replicates 
and thus enable a comparison of data across samples. 

The theory behind background subtraction is that during hybridisation there 
will be non-specific binding to the slide. This will effectively •darken 1 the 
image and give falsely high readings of fluorescent intensity. Correcting for 
the non-specific hybridisation should reduce error due to background staining. 
Background subtraction often occurs automatically in microarray image analy- 
sis packages. The software may circle the spot of interest and use the region 
beyond the periphery as the measurement of background. In cases of uneven 
hybridisation this method enables locally high background to be subtracted on 
a regional basis. One criticism of this approach is that the slide surface beyond 
the periphery is not similar, in chemical terms, to that where the nucleic acid 
has been deposited, and therefore cannot act as a real control for non-specific 
binding. A more accurate measurement of non-specific binding can be gained 
from using a region where spotting chemicals have been deposited, but no target 
is present. This concept is the basis of a method using local 'blank spots' fWu 
*fa/., 2001). 

A number of methods exist for normalisation of data. These include normal- 
ising to total signal or to a 'known* spot or gene, standardisation, or proprietary 
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methods. Normalising to total signal is the simplest approach, whereby the gene 
intensity is expressed as a percentage or proportion of the signal intensity for the 
entire array. This method works best when the total intensities for the microar- 
rays are similar and the number of changes is small compared with the number 
of genes. However, we often find, when using arrays of around 1000 genes, that 
pathological disease can up-regulate a large number of genes simultaneously. In 
this case, when total signal normalisation is applied, highly up-regulated genes 
will appear less up-regulated and genes that do not change from the control will 
appear down-regulated. 

Normalisation to a control value is a more popular technique. A control value 
can be obtained from using a gene known to remain constant under the condi- 
tions of the experiment. DeRisi et al. (1997) used a panel of 90 housekeeping 
genes for. normalisation, but found considerable variation in their gene expres- 
sion. Unfortunately it is very difficult to know with certainty that a gene will 
not change and there is evidence to suggest that so-called 'housekeeping genes' 
are variable (Savonet et al. t 1997). Other control genes can be derived from an 
alternative species; these should not be expected to hybridise. We have used 
yeast and Arabidopsis genes as negative controls for hybridisations of rat tis- 
sues. No orthologs were known to the genes selected; however, in . most cases 
non-specific binding occurred. 

If two or more microarray replicates appear to be different, but they are 
expected to be the same, then they can b$ standardised. An example of this 
might occur if the total intensity of one microarray is greater than another, but 
the genes are proportionally equally up- or down-regulated. If the microarray 
sample spot data are assumed to be drawn from a normal distribution, then the *z- 
transform' can be used. This requires that the mean and the standard deviation of 
the intensity values for each microarray are determined. The mean is subtracted 
from each individual gene value and the remainder is divided by the standard 
deviation. The intensity values from each microarray will then have the same 
mean and standard deviation. This has the advantage of facilitating comparison 
of microarrays with different dynamic ranges as well as total intensities. If the 
data are not normally distributed, then alternative non-parametric methods can 
be used, such as normalising to the median. 

Univariate Analysis 

Univariate methods of analysis involve examining one variable, or gene, at a 
time. This can be a very laborious task when examining a large volume of 
data, yet it is the preferred method of biologists. The simplest technique is to 
compare control and treated values and express the result as a 'fold-change' 
ratio. Typically, when examining small volumes of data, fold changes greater 
than 2 and less than 0.5 are considered meaningful (Quackenbush, 2001). This 
cut-off is essentially arbitrary and has the distinct drawback that microarray data 
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are not homoscedastic; that is, there is more variation about the mean at low 
values than there is at high values (Draghici, 2002). 

A second method for finding up- or down-regulated genes uses the standard 
deviation (SD) of the replicate gene data. Thus if changes greater than, say, 2 
SD from the log mean ratio are considerably greater than changes associated 
with 'noise', then they are considered significant This technique means that 
when looking at a large number of genes that are normally distributed there will 
be up- and down-regulated genes, regardless of whether there are (biological) 
changes (Draghici, 2002). 

Rather than using arbitrary cut-off values it is far more meaningful to express 
the fold-change in terms of either confidence intervals, or a *p- value* (that 
is, the probability of the value occurring by chance). Thus, a fold-change of 
1.1 may be associated with a p- value of 0.001 and thus the probability that 
the gene is not up-regulated is 1 in 1000. Naturally, such small fold-changes 
may then be queried in terms of biological significance. One must then ask 
the question: Are large fold-changes more important (biologically) than small 
ones? Simple calculation of p-values for two data sets can be obtained using 
f-test functions in standard spreadsheet software. A number of replicates are 
necessary for this approach, and the data must be normally distributed. We 
have recently developed a method for calculating p-values for fold-changes that 
is not influenced by the distribution of the data or outliers and applied it to 
TbqMan™ and microarray data. Other complex and computationally intensive 
methods for calculating p-values are described in Draghici (2002) and Nadon 
and Shoemaker (2002). 

Multivariate Analysis 

Multivariate analysis of gene expression data is becoming increasingly popu- 
lar in the microarray community and in other biological domains where large 
volumes of data are generated. Multivariate analysis methods include princi- 
pal component analysis (PCA), factor analysis, multivariate analysis of variance 
(MANOVA) and cluster analysis. Currently, cluster analysis is the most widely- 
used method in die microarray community but PCA is growing in popularity 
(Crescenzi and Giuliani, 2001; Konu et al. f 2001). 

Quackenbush (2001) provides a good review of clustering tools that is rather 
unique in the regard that different clustering algorithms and linkage methods 
are presented. Clustering methods are unsupervised, and they are powerful tools 
for gaining insight into huge data sets. They enable the data to be partitioned 
in order to facilitate interpretation; however, they do suffer from subjectiv- 
ity. This is because the user selects various parameters, such as the algorithm 
used, linkage type, distance metric and, sometimes, cluster size. Whatever the 
data, clusters will always be identified, thus there is also a tendency to over- 
interpret the data - trying to attach meaning to clusters that may have no bio- 
logical relevance. 
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Software available for cluster analysis includes Cluster, the output of which 
is viewed in Treeview; both available from http://rana.stanford.edu/ 
software. This tool is particularly useful for clustering genes to identify genes 
that are co-regulated. 

The PCA is a visualisation tool that enables complex, high-dimensional data 
to be represented in two or three dimensions. It facilitates identification of groups 
of similar data, thus enabling inferences to be made about the samples. 

An example is shown in Figure 13.8. The figure shows the gene data for one 
sample (control rat liver) that was hybridised to seven microarrays according 
to the method used in Wildsmith et at. (2001). Each microarray contained two 
replicate gene sets; thus there were 14 replicate gene sets in total. The gene sets 
comprised 1248 genes (with controls). All the data (14 x 1248 data points) was 
input into the analysis and the PCA plot displays the 14 replicates individually. 
The two axes are principal components 1 and 2. Principal component 1 (PCI) 
accounts for 65.5% of the variation in the data, whereas PC2 represents only 
13%. This means that the model accounts for 78.5% of variation in the data. 

The first principal component (PCI) accounts for as much as possible of 
the variation in the .original data and subsequent components (e.g. PC2) are of 
decreasing importance. Thus, samples 8 and 9 are -very different from samples 1 
and 2. In terms of interpreting the PCA plot, it is immediately clear that there are 
three or four distinct clusters of data. These are marked by circles. Datapoints 
tend to cluster in pairs; for example replicates 1 and 2, 3 and 4, 5 and 6, 
etc. These are the duplicate gene sets on the same microarray. This indicates 
that the variation within the microarray is lower than the variation between 
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Figure 13.8 PCA plot of the microarray results from seven slides (2690, 2692, 2723, 2869, 
2876 t 2879, 2880), each with two replicate spot sets (labelled 1-14), after hybridisation with 
control rat liver. See text for explanation 
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replicate microarrays. However, given that the same sample is applied to all 
the microarrays, we must ask why we get further separation of the replicates. 
The answer lies in the associated data. The four-digit numbers associated with 
the clusters are the microarray slide numbers that indicate when they were 
printed. Numbers that are more similar seem to be more closely related, and thus 
we can hypothesise that there were some differences between the slides, such 
as differences in slide backgrounds or changes during the printing process or 
change-over of batches, between slides in the 2600-2700 region and the 2800s. 

The PCA provides a clearer overview of the data than does cluster analy- 
sis. It is a rapid method for gaining an insight into the results, in particular 
where biological meaning can be attached to the components (Crescenzi et a/., 
2001). There are a number of packages for multivariate data analysis, including 
SIMCA-P (Umetrics, AB, Umea, Sweden) and The Unscrambler (Camo ASA, 
Norway), both of which are useful for PCA. 

Other tools for data visualisation include software packages such as Spot- 
fire.net (Spotfire Inc., Cambridge, MA, USA) and GeneSpring (Silicon Graphics, 
San Carlos, CA, USA). Spotfire is particularly useful for visualisation of mul- 
tidimensional data and for visualisation of temporal data. It is possible to use 
these tools to identify genes that are co-ordinately expressed over time. 

Biological Interpretation 

After developing a sound experimental strategy, ensuring that the results are 
statistically valid, and after analysis of the data, it is down to the biologist to 
assemble the pieces of information that have been obtained. This intertwined 
information may include unexpected results that are contradictory to intuition or 
to published literature. One way to untangle the data is to map the relevant genes 
onto existing pathways and known functions. The Kyoto Encyclopedia of Genes 
and Genomes (KEGG), available at http://www. genome.ad.jp/kegg/, is a 
useful source of information, especially where the gene products are enzymes. It 
enables visualisation of the position of up- or down-regulated genes in metabolic 
pathways. 

The gene expression data obtained may differ from protein expression data, 
or information on gene product activity or location. When initiating a study 
it is useful to consider additional endpoints that can assist in the interpreta- 
tion of the data. For in vitro studies, these might include cytotoxicity endpoints, 
metabolites, key signalling molecules or perhaps protein expression. Waring 
et al. (2001) used tetrazolium dye reduction (MTT) as a measure of hepatocyte 
cell viability for their studies of gene expression in response to hepatotoxic 
insult. For in vivo studies, expression information on the tissue of interest could 
be supported by pathology, histology and blood chemistry measurements. Gene 
expression results could be confirmed by in situ hybridisations or protein activity 
assays. 
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13.4 Recent examples of mieroarray applications 

One area of rapid progress using mieroarray technology is the increased under* 
standing of cancer. Molecular pathologists are subgrouping cancers of tissues 
such as blood, skin and breast, based on differential gene expression patterns; 
For example, within a small group of breast cancer tissue samples, Perou et al 
(2000) distinguished two broad subgroups representing those expressing or alter- 
natively lacking expression of the oestrogen receptor-a gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared 
with the previous methods of gene identification. 

Another example of the impact of this technology is in the identification of 
two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran 
etal., 2001). 

Mieroarray technology has also accelerated the understanding of the molecu- 
lar events surrounding pulmonary fibrosis. Specifically, two distinct clusters of 
genes associated with inflammation and fibrosis have been identified in a dis- 
ease where, for years, the pathogenesis and treatment have remained unknown 
(Katsuma etal., 2001). 

13.5 Conclusions 

Important factors in gene expression experiments include sensitivity, precision 
and reproducibility in the measurement of specific mRNA sequences (Schmittgen 
et al., 2000). These quality metrics can be maximised by using, or fabricat- 
ing, high-quality microarrays, and by optimising each step of the mieroarray 
process. From conception to conclusion it is important to bear in mind the 
original hypothesis. 

Having considered the complexity of the mieroarray experiment, the value 
obtained from a meticulously designed experiment should not be underesti- 
mated. As the number of high-quality gene expression studies increases, we hope 
that the literature will contain increasingly detailed information that will help 
interpret complex gene expression changes, and thus elucidate the mechanisms 
of disease. 
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13.9 Useful websites 

Protocols: 

ht tp : / /cmgtn . Stanford . edu/pbrown 

ht tp : //cmgm . Stanford . edu/pbrown/protocol s . html 

Image analysis software: 

ht t p : / /www . genome . ad . j p/kegg/ 
http : //rsb . info . nih ♦ gov/nih- image 
http: //rana . stanford.edu/software 

http : / /www. nhgri . nih . gov/DIR/LCO/lSK/HTML/imgjtnalysis . html 
http : //rana . Stanford . EDU/sof tware 
http//www . t igr . org/sof t lab 
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INCREASING NUMBERS OF HUMAN Dis- 
eases, both acquired and genetic, are 
being considered to be based at least 
in part on alterations in DNA se- 
quence. For most diseases, inheritance 
and 'acquisition are likely to be com- 
plex and polygenic. The efforts of the 
Human Genome Project to elucidate the 
structural genetic background by iden- 
tifying the chromosomal positions and 
genomic organization of between ap- 
proximately 30000 and 35000 human 
genes are nearly complete. 1 Based on this 
structural knowledge, a byproduct 
should be a better "scaffolding" to help 
link specific genes to susceptibility to 
various human diseases. However, to un- 
derstand how the products of these ge- 
netic linkages work together to orches- 
trate the initiation and progression of 
particular complex diseases, there will 
be a need to apply a functional genetic 
rather than a structural genetic ap- 
proach. 1 - 1 

Until recently, functional genetic 
studies have generally been of limited 
scope, only able to elucidate the role of 
1 or a few genes at a time in 1 system. 
Information on the specificity and rela- 
tive abundance of expression prod- 
ucts has traditionally been obtained by 
techniques such as RNA Northern blot 
hybridization and ribonuclease protec- 
tion assays. Somewhat more sophisti- 
cated methods, such as differential dis- 
play 4 and Serial Analysis of Gene 
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DNA microarrays represent a technological Intersection between biology and 
computers that enables gene expression analysis In human tissues on a ge- 
nome-wide scale. This application can be expected to prove extremely valu- 
able for the study of the genetic basis of complex diseases. Despite the enor- 
mous promise of this revolutionary technology, there are several Issues and 
possible pitfalls that may undermine the authority of the mlcroarray plat- 
form. We discuss some of the conceptual practical, statistical, and logisti- 
cal Issues surrounding the use of microarrays for gene expression profiling 
These Issues Include the Imprecise definition of norma/ In expression com- 
parisons; the cellular and subcellular heterogeneity of the tissues being stud- 
led; the difficulty In establishing the statistically valid comparability of ar- 
rays; the logistical logjam In analysis, presentation, and archiving of the vast 
quantities of data generated; and the need for conflrmatlonal studies that 
address the functional relevance of findings. Although several complicated 
Issues must be resolved, the potential payoff remains large 

>*MA.2OO 1;2 86:2280-2288 wjama.com 



Expression, 1 have been used to screen 
larger numbers of complementary DNA 
(cDNA) clones. However, technical 
limitations render these techniques 
nonconducive to large-scale genetic 
survey. 

To this end, a powerful new tech- 
nology is emerging, using hybridiza- 
tion to nucleotide arrays, the so-called 
gene chips. 6,7 This technological inter- 
section of biology and computers en- 
ables the reliable screening of a vast 
number of genes simultaneously and is 
amenable to automation. On a nylon 
membrane or glass surface, gene- 
specific cDNAs can be spotted, or oli- 
gonucleotides can be synthesized in situ 
by a combination of photolithography 
and oligonucleotide chemistry. This 



permits simultaneous monitoring of the 
expression of thousands of genes in a 
single step. Individual chips can be cus- 
tomized to include any chosen set of 
fully or partially characterized ge- 
nomic or expressed sequences. Chips 
can monitor over 50000 unique se- 
quences. The power of these chips lies 
in the potential for comparative expres- 
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sion studies in diseased vs normal 
samples, and in documenting changes 
at different stages during the natural 
course of the disease or in response to 
treatment. It provides the researcher 
with a new arsenal to analyze under- 
lying pathomechanisms on a grand scale 
and also to review the rationale of thera- 
peutic concepts. 

However, despite the enormous po- 
tential of this revolutionary technol- 
ogy, there are several issues and pos- 
sible pitfalls that attenuate the power 
of microarrays. First, the definition of 
normal in expression comparisons is 
neither precise nor unambiguous. Sec- 
ond, the heterogeneity of the tissues 
being studied complicates the mean- 
ing of the expression profiles. Third, the 
statistically valid comparability of ar- 
rays is an unresolved problem. Fourth, 
the vast quantities of data create a lo- 
gistical logjam for analysis, presenta- 
tion, and archiving. Finally, confirma- 
tional studies are needed to corroborate 
the biological significance of microar- 
ray data (Figure 1). 

TROUBLE WITH NORMAL 

The standard normal vs diseased tis- 
sue type of comparison, which is the ba- 
sic design foundation of profiling stud- 
ies, may be more quicksand than 
bedrock. Normal is not so easy to de- 
fine — neither is diseased. Gene expres- 
sion in normal tissue is likely to be de- 
pendent on several factors involving 
patient and sample variation. These fac- 
tors will also have an impact on expres- 
sion profiles of diseased tissue. 

Patient Variation: Ethnicity, 
Sex, Age, Genetic Background, 
Disease States 

The ethnicity, sex, age, and genetic back- 
ground of a patient are likely to affect 
the gene expression profiles of many tis- 
sues to varying extents."' 9 A simple ex- 
ample is provided by the expression pro- 
files of genes involved in scalp and body 
hair follicle activity, which can be ex- 
pected to vary over a normal range un- 
der the influence of all of these sources 
of patient variation. The effects of these 
parameters on gene expression are likely 
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Figure 1. Possible Pitfalls in Microarray Gene Expression Profiling Analysis 
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There are Ktuei that complicate each step of microarray gene expression analysis. 



to be subtle but pervasive, not fully un- 
derstood at this time, and quite prob- 
lematic for defining normal. 

The presence of disease in a subject 
who is the source of tissue for control 
purposes, presents further potential 
variabilities. For example, there may be 
a significant difference in the conclu- 
sions reached by 2 similar microarray 
expression profiling studies. One may 
compare genes expressed in a pa- 
tient's diseased lung tissue with those 
expressed in normal, nondiseased lung 
tissue from the same patient, and an- 
other may compare genes expressed in 
the same patient's diseased lung tissue 
with those expressed in normal, non- 
diseased lung tissue from a healthy con- 
trol or normal individual. Moreover, it 
is also possible that seemingly unre- 
lated disease states may influence gene 
expression at distant sites. For in- 
stance, the presence of diabetes in 1 of 
2 renal cancer patients may compli- 
cate the direct comparison of renal 
tissues. 

Sample Variation: Proximity 
to Disease, Anatomic Location, 
and Developmental Range 

Yet another complication derives from 
the proximity of the normal tissue used 
as a control for the diseased tissue. Tis- 
sue adjacent to an area of disease may 
not be normal despite absence of evi- 
dence of disease clinically or under the 
light microscope. Normal-appearing tis- 
sue near a tumor could, for example, 
be genotypically altered or exhibit an 
altered gene expression profile. 10 ' 13 



Moreover, factors such as the degree of 
disease-associated inflammation may 
have a significant impact on gene ex- 
pression profiles. Other bystander ef- 
fects, epiphenomena, or secondary dis- 
ease processes could all play important 
roles in determining expression pro- 
files within these adjacent, so-called 
normal tissues. These factors must be 
considered in the choice of normal 

The precise location within a par- 
ticular organ may be another impor- 
tant factor that affects gene expres- 
sion. 0 For example, just as location 
relative to the urethra may influence ex- 
pression profiles in the prostate, 11 skin 
from the nose, back, and palm are cer- 
tain to have different expression pro- 
files as well, despite all being from the 
same organ. Thus, site and specific ana- 
tomic location must also be taken into 
account in a description of normal. 

It must also be kept in mind that the 
definition of normal actually repre- 
sents a dynamic state. 14 All tissues, 
which are composed of early and late- 
stage cells, have a normal developmen- 
tal range. For example, normal epithe- 
lium in prostatic ducts ranges from 
atrophic to resting to hyperplastic, and 
each has a unique pattern of gene ex- 
pression. 14 

A 3-dimertsional analytic approach is 
a strategy that has been used to address 
some of these concerns about defining 
normal. Cole et al H used a 3-dimen- 
sional model to characterize the enure 
prostate gland in their study of gene ex- 
pression profiles in prostate cancer. In 
this study, whole-mount prostactec- 
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toray specimens were divided Into trans- 
verse cross sections such that the entire 
prostate gland, including the complete 
spectrum of normal epithelium and tu- 
mor progression, was available for view- 
ing, microdissection, and microarray 
analysis. This method was used to de- 
termine the exact physical relationship 
of the normal ducts, premalignant le- 
sions, and tumors — thus obtaining an 
anatomic framework on which to over- 
lay gene expression data. This tech- 
nique offers several advantages over the 
normal vs tumor comparison. Previous 
studies had used normal epithelium in 
prostatic ducts as a baseline control 
against which to compare and contrast 
tumor gene expression profiles. uw How- 
ever, the expression profile of this nor- 
mal epithelium is affected by proximity 
to tumor, location within the gland, and 
developmental state. 11 These factors can 
be better appreciated using a 3-dimen- 
sional approach, 

Disease-Related Variation 

Of course, many of the parameters that 
affect normal expression profiles (pa- 
tient ethnicity, age, sex, and genetic 
background, location within an or- 
gan, and developmental stage) will also 
affect disease expression profiles. 17 * 1 * 
Disease heterogeneity, including sub- 
type, activity, severity, stage, and pre- 
vious as well as current treatments, also 
may have a significant impact on gene 
expression. 1 *-" 

Categorizing and subgrouping pa- 
tients on entry into a study may be use- 
ful to control for as many of these fac- 
tors as possible. However, there may be 
problems surrounding attempts to de- 
fine microarray-based categorization on 
the basis of another imperfect categori- 
zation system, such as histology, as these 
groups are sometimes arbitrary or in- 
consistently designated. Nevertheless, de- 
termining whether gene expression pro- 
files correlate with existing clinical or 
histological categories can provide new 
insights into the meaning of these cat- 
egories as can new methods of classify- 
ing cancers or other diseases into spe- 
cific diagnostic categories based on their 
gene expression signatures. Several stud- 



based criteria (class predictors) for pre- 
existing categories and then use these 
new criteria to categorize new cases (class 
prediction).*" 8 Global profiling may also 
allow the development of new classifi- 
cation systems based on gene expres- 
sion alone (class discovery). 19 - 50 

Thus, when possible, it wilt be of 
value to profile a range of normal and 
diseased cell populations from a num- 
ber of patients to distinguish between 
differences in expression that are rel- 
evant to the disease process and those 
reflecting the biological spectrum of the 
normal tissue or that have occurred for 
reasons unrelated to the disease. The 
significance of this distinction, is fur- 
ther appreciated when taking into ac- 
count the vast quantity of data gener- 
ated from microarrays and the potential 
for confounding interpretation from the 
inclusion of differential expression un- 
related to disease processes. 

It is worth noting, however, that the 
issues of patient and sample variability 
are not unique to microarray experi- 
ments. In fact, microarray experiments, 
in contrast to classic single-gene experi- 
ments, may actually provide the tools for 
identifying this heterogeneity. For ex- 
ample, DNA microarrays have been used 
to explore physiological variation in gene 
expression on a genomic scale in 60 cell 
lines derived from diverse tumor tis- 
sues. 11 Cluster analysis allows the iden- 
tification of prominent features in gene 
expression patterns that appear to re- 
flect molecular signatures of the tissue 
from which the cells originated. 31 

HETEROGENEOUS CELL 
POPULATIONS 

A further complication encountered with 
expression profiles is that any given tis- 
sue is composed of several cell types, 
members of which are likely to be within 
a spectrum of dynamic functional states. 
For example, a simple punch biopsy of 
the skin may contain keratinocytes, me- 
lanocytes, Langerhans cells, Merkel cells, 
adipocytes, smooth muscle cells of ar- 
rector pili, striated muscle cells of the 
panniculus camosus, blood cells includ- 
ing immune system cells, and cellular el- 



ements of blood vessels, nerves, hair fol- 
licles, sebaceous glands, and sweat 
glands. Moreover, cells from each of these 
populations will be at various stages of 
development and levels of activation, per- 
forming different functions and respond- 
ing to disease processes or treatments in 
different ways and to varying extents. The 
result is a highly heterogeneous sam- 
pling of cells, each expressing a unique 
set of genes. An expression profile gen- 
erated from a microarray study of the 
RNA in such a sample will thus repre- 
sent merely a snapshot of the genes ex- 
pressed by a plethora of cells at a mo- 
ment in time. Such extensive cellular 
heterogeneity complicates the ability to 
draw conclusions about specific pro- 
cesses occurring within a tissue speci- 
men. An illustrative example is pro- 
vided by Stanton et al ( n who used 
microarrays to identify genes differen- 
tially expressed during myocardial in- 
farction. The expression profiles they 
studied represented transcripts from cell 
populations as diverse as immune sys- 
tem cells, which migrated to the infarct 
region and are responsible for the in- 
flammatory response, cardiac myocytes 
within the ischemic area undergoing 
apoptosis and necrosis, fibroblasts un- 
dergoing proliferation and participat- 
ing in the formation of scar tissue to re- 
place the infarct, and cardiac myocytes 
undergoing hypertrophy to compen- 
sate for the loss of cells in the infarct 
area. M The issue of such cellular hetero- 
geneity was avoided by categorizing the 
differentially expressed genes into func- 
tional categories to look for patterns in- 
dicative of caTdiac remodeling without 
attempting to attribute specific tran- 
scripts to specific cell types. For gene ex- 
pression studies involving samples with 
mixed cellular populations, further in- 
vestigation, such as with in situ messen- 
ger RNA (mRNA) hybridization, maybe 
necessary to localize the transcripts be- 
fore conclusions can be drawn about the 
roles of specific genes in specific cell types 
during the disease process. 

Laser Capture Microdissection 

An ingenious but technically delicate 
approach to the study of complex bio- 
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logical samples has become possible 
wiih the development of laser capture 
microdissection (Figure 2). m This tech- 
nique allows for the rapid and accu- 
rate procurement of cells from spe- 
cific areas of tissue under direct 
microscopic visualization, and thus 
makes the molecular genetic analysis 
of defined populations in their native 
tissue environment possible. 13 Sgroi et 
al M demonstrated the feasibility of com- 
bining laser capture microdissection 
with high-throughput cDNA arrays. 
They showed that in vivo subpopula- 
tions of malignant cells from multiple 
stages of breast cancer progression 
could be separated from nonmalig- 
nant populations, and their expres- 
sion profiles could subsequently be ana- 
lyzed using microarrays. 

The potential is powerful. Specimens 
could be separated into tissue layers; for 
example, separating a skin biopsy into 
epidermis, dermis, and hypodermis. Tis- 
sues could be further differentiated into 
specific structural components, such as 
dermis into blood vessels, adipose, ar- 
rector pili, and sebaceous glands. Struc- 
tures could be separated into defined cell 
types, such as blood vessels into endo- 
thelial cells, erythrocytes, and lympho- 
cytes. Cell types could even be sepa- 
rated into marker-defined subtypes, such 
as lymphocytes into CD4 and CDS cells. 
Expression profiles from refined and de- 
fined structures and cell types likely 
would be extremely valuable in the study 
of disease. 

Potential aside, there are significant 
limitations to this technology at the pres- 
ent time. The standard protocols for fix- 
ing and embedding tissue samples from 
surgical resections were not designed 
to be compatible with microarray ex- 
periments, with or without laser cap- 
ture microdissection. Typically, tissue 
suspected of being important for diag- 
nosis and staging is processed through 
aldehyde-based fixatives, such as for- 
malin, which damage mRNA integ- 
rity." If frozen tissue is available, mRNA 
can be recovered and studied from dis- 
sected cell populations. However, fro- 
zen tissue sections are technically dif- 
ficult to prepare, the histology is often 



Figure 2. Laser Capture Microdissection 
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severely compromised, and the tissue 
available may contain only a limited 
portion of the lesion." 

Moreover, the sample amounts gen- 
erated from laser capture microdissec- 
tion can be small, even as minis cuie as 
a single cclL M Consequently, the yields 
of RNA are low. Arrays have a thresh- 
old for the quantity of molecular start- 
ing material: at least 5 to 15 ug for oli- 
gonucleotide arrays and between 2 and 
100 ug for cDNA arrays, depending on 
the manufacturer, the source of the 
RNA, and the use of signal amplifica- 
tion. 19 * 40 Studies that have successfully 
integrated laser capture microdissec- 
tion with microarray technology have 
used samples of approximately 1 X 10* 
to 1 X 10 5 cells with 95% to 98% ho- 
mogeneity as determined by micro- 
scopic visualization. 36 -* 1 If needed, am- 
plification techniques may be used to 
generate sufficient genetic material for 
microarray hybridizations/ 1 Laser cap- 
ture microdissection is an intriguing 
technology, but time will tell whether 
its potential is realized. 

Although some biological issues re- 
lated to gene expression may be com- 
plicated by the presence of heterog- 
eneous cell populations in studied 
samples, it is also true that some bio- 
logical conditions can be understood 
only in the context of these heterog- 
eneous cell populations. The nature of 
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global gene expression experiments is 
to uncover differences between 2 bio- 
logical samples, including those differ- 
ences based on diverse cell popula- 
tions. For example, to appreciate a 
disease that is characterized by an in- 
flammatory infiltrate, it must be un- 
derstood that the inflammatory infil- 
trate is part of the disease and is part 
of the difference between diseased and 
nondiseased tissue. Thus, the isola- 
tion of specific cell populations for 
study is not necessarily required or even 
desirable in all Instances. 

MAKING MICROARRAYS 
COMPARABLE 

Ideally, microarray experiments should 
be comparable both within and be- 
tween laboratory or manufacturing sys- 
tems, but obtaining consistent and com- 
parable data is a critical challenge for 
microarray-based expression analysis. 
Major sources for the observed vari- 
ability of microarray data include the 
normal physiological gene expression 
variations in different samples and the 
noise introduced in the microarray as- 
say process/ 1 

Physiological Gene 
Expression Variation 

Inextricably linked to the issues of pa- 
tient and sample variability and tissue 
heterogeneity discussed above, is the 
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problem of normal gene expression 
variations and how to distinguish these 
variations from significant disease- 
associated changes. Few studies have 
systematically investigated physiologi- 
cal expression changes, but data from in 
situ hybridizations suggest that normal 
variance for many tightly regulated tis- 
sue-specific genes can be within 20% to 
3Q%. n However, there can be as much 
as 2- to 4-fold random fluctuations for 
many genes in yeast, 43 * 14 Affymetrix 
(Santa Clara, Calif) guidelines have sug- 
gested that for most of the "housekeep- 
ing" genes in human tissues, which are 
likely to be less tightly regulated, differ 
ences of less than 4-fold are probably not 
biologically significant. 43 Conse- 
quently, a significant portion of micro- 
array data variability for high- or me- 
dium-abundance mRNAs may simply be 
due to normal expression variations. Sev- 
eral previous studies have used arbi- 
trary 2-fold change criteria to define sig- 
nificant expression change. 46 However, 
the 2-fold threshold has been shown to 
be statistically invalid even for dupli- 
cate experiments. 46 In a recent study that 
used cDNA microarrays to profile gene 
expression in samples of normal skin 
from breast-reduction surgeries, 71 of 
4400 genes were found to demonstrate 
variability in expression greater than 2 
SDs from the mean of each gene. 47 These 
included genes coding for transport pro- 
teins, gene transcription, cell-signaling 
proteins, and cell-surface proteins. Thus, 
physiological variation should be con- 
sidered in the analysis and interpreta- 
tion of rnicroarray data. More stringent 
criteria for defining significant expres- 
sion change may be useful. 

Noise In the AAlcroarray 
Assay Process 

For the tightly regulated (mostly low 
abundance) mRNA species, inconsis- 
tencies introduced at any stage of the 
microarray-based assay process may 
play a major role in data variability. 42 
Due to the miniaturization and the large 
number of genes involved, it is diffi- 
cult to maintain consistent processing 
conditions for each sequence across 
multiple assays, and obtaining accu- 
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rate absolute signals is unlikely. 42 Noise 
may be introduced by slide heteroge- 
neities, printing irregularities (eg, 
pin-to-pin variations), and spotting 
volume fluctuations. 48 Some of the sys- 
tematic variations may be reduced by 
the inclusion of controls, but random 
fluctuation at various manufacturing 
stages cannot be completely con- 
trolled and can accumulate quickly in 
a complicated assay. 48 

In certain rnicroarray systems, 2 
samples are competitively hybridized to 
1 array using different fluors for label- 
ing. In other systems, there is only single- 
sample hybridization. A 2 -color system 
might be expected to be more reliable 
since variations in. spot size or amount 
of cDNA probe on the chip should not 
affect the signal ratio (both signals are de- 
rived from the samespot). However, this 
only holds true if signals are well above 
the background in both detection chan- 
nels. 41 In fact, the signal level for most 
of the tighdy regulated genes will likely 
be close to the background level. 42 In ad- 
dition, background level on a slide can 
also vary significantly from spot to spot 
due to factors such as unevenness in slide 
surface properties, dust contamination, 
and incomplete washing, leading to high 
levels of signal variability for low- 
abundance mRNA species even in 
2-color systems. 42 

The high levels of variability of rni- 
croarray data also mean that subtle 
changes in experimental conditions 
may significantly alter the results, mak- 
ing it difficult for separate laborato- 
ries to compare experimental data. In 
addition, the lack of standard con- 
trols, the predominant use of relative 
signals (ratios), and the adoption of in- 
compatible data formats contribute to 
poor comparability between studies 42 
Despite the hard-wired variability in- 
troduced by chip manufacturing con- 
ditions, most of the published studies 
to dale using microarray-based expres- 
sion analysis include only limited num- 
bers of replicates. 49 In fact, many stud- 
ies conduct the experiment only once. 
Considering the potential sources of as- 
say variation, the need for sufficiently 
replicated studies is underscored/ 9 



AAlcroarray Data Normalltatton 

Because of variability of rnicroarray data 
for single sample arrays and for fur- 
ther analysis of 2-color system arrays, 
each must be brought into the same 
scale to compare 2 or more arrays. How 
to perform this normalization of gene 
expression levels across multiple ar- 
rays, thus removing systematic varia- 
tion between the arrays and rendering 
different experiments comparable, re- 
mains an issue that is not yet fully re- 
solved. 50 Many of the early rnicroarray 
studies in the literature simply ig- 
nored this issue. A more statistically rig- 
orous approach is needed. 

One difficulty has been that leading 
rnicroarray manufacturers have not 
published statistical error models for 
their products. Thus, users are un- 
clear how much to adjust data for varia- 
tions in spot intensity, hybridization 
patterns, and intensity measurement 
sensitivity. Software does exist to al- 
low for array-to-array comparisons by 
using a scaling factor to normalize gene 
expression patterns across arrays. How- 
ever, in general, these algorithms as- 
sume that intensity differences be- 
tween arrays are linearly related with 
a zero y-intercept. M This assumption al- 
lows software to trim the tails off dis- 
tributions of expression from differ- 
ent arrays at statistical cutoffs and then 
simply move the distributions along an 
axis to a common level to provide com- 
parisons. However, this linear relation- 
ship often does not hold true. 31 When 
the average expression level of 1 array 
is higher than that of a second array, a 
longer tail will be trimmed from the sec- 
ond array. Thus, a greater number of 
genes from the first array will be 
counted as being expressed because 
their expression level is above the sta- 
tistical cutoff point. In this case, the 2 
arrays cannot be considered compa- 
rable. 

Although bioinformatic software has 
recently been developed that offers 
more statistically robust normaliza- 
tion, the cost of these commercially 
available programs (combined with the 
already expensive microarrays) has 
been prohibitive for many research- 
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ers. 50 Standardization of these pro- 
cesses awaits the development of im- 
proved methods of normalization 
leading to valid statistical models widely 
available to all researchers. 

To this end, Schadt et al 51 have de- 
veloped a standard nonlinear curve 
technique for normalizing the data in 
arrays that do not demonstrate a lin- 
ear relationship between data sets. This 
model performs well when the 2 
samples being compared demonstrate 
a low number of differentially ex- 
pressed genes. However, when expres- 
sion profiles of 2 samples vary to a 
greater extent, Schadt et al 51 recom- 
mend a rank-selection method. Using 
this method, genes expressed on an ar- 
ray are ranked from highest to lowest 
level of expression. Then, for the ar- 
ray expressing a greater number of 
genes, the genes with the lowest ex- 
pression levels are removed from the list 
until the 2 arrays Hst a comparable num- 
ber of expressed genesiThis type of 
rank-selection method has gained sup- 
port from other groups, but it too has 
limitations." Removing low-expres- 
sion level dau points restricts the study 
to the more extreme and easily de- 
tected entities, a technique that blunts 
the genomic-scale potential of micro- 
array technology. 

Efforts continue to improve compa- 
rability between arrays. Jones 30 re- 
cendy applied a statistical model to nor- 
malize spotted cDNA array data that 
takes into account not only the differ- 
ences in numbers of genes expressed be- 
tween arrays, but also the interarray 
variations in fluorescent dye intensity 
and mechanical error occurring in the 
printing process. Nevertheless, the is- 
sue of how to properly normalize ar- 
ray data has not been settled. Research- 
ers must continue to demand statistical 
rigor in their comparisons before they 
can believe the mathematical results of 
their data. 

LOGISTICAL LOGJAM 

Microarrays deliver massive amounts of 
dau on tens of thousands of genes. The 
result is an immense quantity of bio- 
logical information that must be ana* 
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lyzed, presented, and archived in a 
meaningful way. 

Data Analysis 

In human studies, the number of hy- 
bridizations that can be performed for 
any set of experimental conditions is of- 
ten restricted by the limited number of 
obtainable tissue samples and by the ex- 
pense of arrays. Restricted numbers of 
hybridizations for each experiment 
hamper the ability to assess the bio- 
logical significance of variation within 
or between given sets of conditions. 
Thus, for the assessment of thousands 
of genes in a setting of limited hybrid- 
izations, the importance of reliable and 
sophisticated algorithms for data analy- 
sis becomes amplified. 31 

A logical beginning is to examine the 
extremes, that is, genes with signifi- 
cant differential expression in indi- 
vidual samples. For example, a com- 
parison of 2 samples can be visualized 
in the form of a simple btvarlate scat- 
terplot in which the expression profile 
of 1 sample (x-axis) is plotted against 
that of the second sample (y-axis). The 
distribution pattern generally demon- 
strates that the expression ratios clus- 
ter around the line in which x is equal 
to y (indicating comparable levels), with 
individual genes falling varying dis- 
tances from this line. Additional lines can 
be placed on the scatterplot to repre- 
sent various fold changes of expres- 
sion. Data points that fall above or 
below these lines represent genes ex- 
hibiting expression ratios greater or less 
than the specified fold change. Thus, one 
can begin by examining those genes that 
demonstrate a 10-fold or greater change 
in expression level. To expand the num- 
ber of genes under investigation, one can 
examine genes that demonstrate a 5-fold 
or greater change, or a 3-fold or greater 
change, and so forth Many studies de- 
fine a 2-fold or greater change in ex- 
pression level to represent significant dif- 
ferential expression. The 2-fold 
threshold, however, as noted above, has 
been shown to be statistically invalid. 46 
Although this simple technique can be 
efficient and effective for focusing on ex- 
panding sets of differentially expressed 
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sequences, again, such an analysis does 
not take advantage of the full potential 
of genome-scale experiments to en- 
hance our comprehension of cellular bi- 
ology that would be provided by an in- 
clusive analysis of the entire repertoire 
of transcripts in a cell as it goes through 
a biological process. 52 A more holistic ap- 
proach, which allows the deciphering of 
patterns from the entire data set, is 
needed. 

Data Organization 
and Presentation 

Statistical algorithms can be applied to 
detect and extract patterns within pro- 
filing data. It is a basic assumption of 
many gene expression studies that 
knowledge of where and when a gene 
is expressed provides information about 
the function of the gene. Therefore, an 
important beginning is to organize 
genes on the basis of similarities in their 
expression profiles." However, even 
this basic tenet deserves critical con- 
sideration. Similarity of gene expres- 
sion profile does not mandate similar- 
ity of function or mechanistic pathway, 
and it maybe purely coincidental. Nev- 
ertheless, the idea of clustering genes 
on the basis of their expression pat- 
terns is well established and cluster 
analysis has become the most widely 
used statistical technique applied to 
large-scale gene expression data. 31 

Although various cluster methods can 
usefully organize tables of gene expres- 
sion measurements, the resulting or- 
dered bu t still massive collection of num- 
bers remains difficult to assimilate. Thus, 
another important component of ge- 
nome-wide expression data explora- 
tion is the development of powerful data 
visualization methods and tools. Ap- 
proaches have been developed that pre- 
sent clustering results in simple graphi- 
cal displays such as dendrograms, which 
represent relationships among genes by 
a tree whose branch lengths reflect the 
degree of similarity in expression be- 
tween the genes. Similarity is mathemati- 
cally defined. 34 The computed trees can 
be used to order genes in the original dau 
table such that genes or groups of genes 
with similar expression level patterns are 

MA, November M, 200 1— Vol 286 , No. 1 8 2 28 S 



Downloaded from www.jamft.com at U.S. Patent and Trademark Office, on June 6, 2006 



DNA MICROARRAYS 



Figure 3. Real-Time Polymerase Chain 
Reaction (PCR) 
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A. ComplemenUry DMA synthesis from source mes- 
senger or total RNA proceeds as with traditional re- 
verse transcriptase PCR. B, A dual-labeled fluoro- 
ses probe (with a higher melting point than the PCR 
primers used for extension) Is annealed to the target 
sequence between the forward and reverse PCR prim- 
ers. A reporter (R) ftuorochrome (usually 6-carboxy- 
fluoroscein) is attached to the 5' end of the probe and 
a quencher (Q) molecule (usually 6 -carbon y - 
tetramethyl-modamine) Is attached to the 3' end. 
C As Taq polymerase extends in the 5' to 3* direc- 
tion, the duaMabded probe begins to be displaced from 
the target sequence. 0, As the Taq polymerase con- 
tinues to extend, the 5' to 3' endonudease activity 
cteaves the reporter (R) molecule from the prabe se- 
quence such that its emission spectra (31 8 run) are no 
longer quenched by the second (Q) fluorescent dye. 
Fluorescence is measured continuously throughout the 
PCR amplification In real time and Is proportional to 
the amount of PCR product generated in each cyde, 

placed adjacently. Clustering methods 
can also be combined with representa- 
tion of each data point with a color that 
quantitatively and qualitatively reflects 
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the original experimental observa- 
tions. 51 Visual assimilation is then more 
intuitive. 

Data Archiving and Mining 

Ultimately, successful interpretation of 
gene profiling studies is likely to be de- 
pendent on the integration of experi- 
mental data with external information 
resources. As multiple experiments in- 
volving multiple cell types and tissues 
from multiple laboratory groups accu- 
mulate, data archiving may well be- 
come the watershed issue. Ideally, all 
data, in a suitably standardized form, 
would be freely accessible in the pub- 
lic domain. Even assuming a willing- 
ness. t.o share the data, such Utopian 
goals would require a user-friendly and 
powerful database system and stan- 
dardization of correction and normal- 
ization procedures such that data points 
from various projects become compa- 
rable. 55 The National Center for Bio- 
technology information Entrez sys- 
tem (http://www.ncbi.nlm.nih.gov 
/Entrez/) does provide useful data in this 
regard, but current databases may be 
limited in scope or computability." A 
major focus of infrastructure develop- 
ment to support genomic-scale expres- 
sion studies will need to be in the area 
of electronic biological pathway data- 
bases and resources. 

CONFIRMATIONAL STUDIES 

The development of more sophisti- 
cated analytical algorithms and data- 
bases will help lend credence to the bio- 
logical significance of differential gene 
expression determined by microarray 
analysis. In the meantime, several stud- 
ies have begun to examine the sensi- 
tivity and specificity of microarray- 
based experiments. Sensitivity, defined 
as the minimum reproducible signal de- 
tected by a given array scanning sys- 
tem, has been reported for microar- 
rays to be approximately 10 mRNA 
copies per cell, which is slightly infe- 
rior to the sensitivity of Northern blot 
analyses™-" Specificity studies showed 
thai for a given probe any nontarget 
transcripts with more than 75% se- 
quence similarity may show cross- 



hybridization* The problem of clone 
misidentiflcation and the need for clone 
confirmation have also been ad- 
dressed tt One study found that of 1 189 
bacterial stock cultures, only 62.2% 
were uncon laminated and contained 
cDNA inserts that had significant se- 
quence identity with published data for 
the ordered clones. 59 Thus, the use of 
sequence-verified clones for cDNA mi- 
croarray construction is warranted. 

Additionally, potential gene candi- 
dates can be assessed for relevance to 
disease using parallel technologies. Sev- 
eral such alternative platforms have 
been used to bolster the importance of 
specific sequences first suggested in 
gene chip comparisons including (1) 
methods at the RNA level, (2) meth- 
ods at the protein level, and (3) func- 
tional studies. 

Methods at the RNA Level 

Reverse transcriptase polymerase chain 
reaction (RT-PCR) is a method often 
used to verify microarray data. Al- 
though RT-PCR is not well suited to 
quantitation, the relative technologi- 
cal ease of this assay and the ability to 
rapidly monitor multiple samples make 
it a useful technology.* 0 * 1 Hybridiza- 
tion data can be verified and multiple 
putative markers can be screened in a 
short period. 

Several other studies have used real- 
time quantitative RT-PCR (TaqMan, 
PE Applied Btosys terns, Foster City 
Calif ) .»« Real-time PCR is a tech- 
nique that increases the quantitative 
ability of RT-PCR by providing accu- 
rate and reproducible information 
on RNA copy number (FIGURE 3). In 
this method, a iluorogenic probe 
(labeled at the 5' end with a reporter 
fluorochrome and at the 3' end 
with a quencher fluorochrome) is 
annealed to 1 strand of the target 
cDNA sequence between the forward 
and reverse PCR primers. As Taq poly- 
merase extends the forward primer, its 
intrinsic 5' to 3' nuclease activity dis- 
places and degrades the dual-labeled 
probe, releasing the reporter fluoro- 
chrome from the quencher label and 
allowing the detection of a fluorescent 
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signal that is proportional to the 
amount of PCR product generated in 
each cycle,* 13 

Northern blot analysis is also com- 
monly used as a confirmational tech- 
nique, as it is a standard specific and 
semiquantitative method, 1 *-"' 61 For 
mRNA expressed at moderate-to-high 
levels, and for which cDNA probes are 
available, Northern blot analysis works 
well, but ft is not well suited for low- 
copy mRNA. 61 * 5 Furthermore, only a 
small number of genes can be ana- 
lyzed with this conventional method. 

Methods at the Protein Level 

DNA micro arTay technology is limited to 
the study of gene expression at the 
mRNA level. However, it has been es- 
tablished that mRNA levels do not nec- 
essarily correlate with protein levels. 
Moreover, the level of expression or even 
presence of a protein is not tightly linked 
to physiological consequences. An in- 
vestigation conducted by Winzeler et al* 
for example, provides a cautionary tale. 
Their study demonstrated that genes up- 
regulated in yeast growing in minimal 
medium did not prove to be more im- 
portant for growth than genes that were 
not upregulated." They found only 2 of 
8 genes required for yeast growth in 
minimal medium to be induced. The les- 
son to be learned is that genes that are 
not differentially expressed may be of 
equal functional importance in disease 
states compared with those that are. 

Furthermore, the regulation of some 
genes maybe at the translational rather 
than the transcriptional level, which 
would preclude detection by DNA mi- 
croarrays. Posttranslational modifica- 
tion of proteins is also an important 
mode of regulation that cannot be de- 
tected by DNA microarrays. Protein ac- 
tivity, particularly receptor activity, is 
heavily dependent on phosphoryla- 
tion, for example. DNA and mRNA re- 
veal nothing about whether a given pro- 
tein is active, and can be deceptive when 
used to speculate about quantities of 
proteins. It has been demonstrated that 
the correlation between mRNA and pro- 
tein abundance is less than 0.5 t AT em- 
phasizing that ideally, mRNA expres- 
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sion studies should be accompanied by 
analyses at the protein level* Radio- 
immunoassay and immunohistochem- 
istry have been used in a number of 
studies. 13 *** These techniques, how- 
ever, are not well suited to detecting low 
levels of expression, and they require 
the availability of an antibody specific 
for the protein to be studied. 

The field of proteomics, the large- 
scale parallel analysis of the proteins that 
are present in a cell, is developing rap- 
idly, but has problems of its own. Pro- 
teins vary in abundance by many orders 
of magnitude within a given cell, and 
there is no PCR equivalent for the ampli- 
fication of proteins. Moreover, proteins 
fold in many known (and unknown) 
ways that affect their function. The fea- 
sibility of the microarray analysis of pro- 
teins has begun to be explored. Antibod- 
ies attached to microarrays can be used 
to bind to and quantitatively detect pro- 
teins that have been tagged with fluo- 
rescent dyes. 75 Skeptics doubt the plau- 
sibility of identifying thousands of 
unknown proteins in this manner. 70 The 
diverse chemistry of various proteins 
poses serious difficulties, and it will be 
challenging to find antibodies for every 
protein. Thus, although it is important 
to incorporate protein analyses into 
expresslonprofiling studies, current plat- 
forms are technically limiting. 

Functional Studies 

Confirming the role of a gene initially 
identified in a microarray experiment in 
animal models with transgene or knock- 
out studies provides a particularly pow- 
erful alternative platform. Transcript 
function, rather than mere presence, is 
addressed. However, this approach is ill- 
suited for high-throughput conditions. 
It may be ideal for an in-depth investi- 
gation of 1 or 2 genes of interest, but it 
is not practical for confirming large quan- 
tities of profiling data. 

Confirmational studies are useful to 
corroborate the biological signifi- 
cance of differential gene expression de- 
termined by microarray analysis. While 
improved databases and more reliable 
statistical models will help to lend 
greater authority to array data, alter- 



native platforms can be used to assess 
the relevance of genes first identified by 
array comparisons. It should be real- 
ized, however, that the alternative tech- 
nologies are not intended for large- 
scale analyses. Realistically, only 
selected sequences from the array data 
can be confirmed with other plat- 
forms in the short-term, a retreat from 
the initial purpose of the genome- 
scale investigation by microarray. 

CONCLUSIONS 

Microarrays can be expected to prove ex- 
tremely valuable as tools for the study of 
the generic basis of complex diseases. The 
ability to measure expression profiles 
across entire genomes provides a level of 
information not previously attainable. Al- 
though complicated issues must be re- 
solved, the potential payoff is big. Mi- 
croarrays make it possible to investigate 
differential gene expression in normal vs 
diseased tissue, in treated vs nontreated 
tissue, and in different stages during the 
natural course of a disease, all on a ge- 
nomic scale. Gene expression profiles 
may help to unlock the molecular basis 
of phenotype, response to treatment, and 
heterogeneity of disease. They may also 
help to define patterns of expression that 
will aid in diagnosis as well as define sus- 
ceptibility lod that may lead to the iden- 
tification of individuals at risk. Finally, 
as specific genes are Identified and their 
functional roles in the development and 
course of disease are characterized, new 
targets for therapy should be identified. 

Despite the problems of defining nor- 
mal, understanding tissue heterogene- 
ity, making arrays comparable, analyz- 
ing and archiving massive quantities of 
data, and performing confirmational 
studies in alternative platforms, expres- 
sion profiling with microarrays stands 
as a truly revolutionary technology. As 
we continue to delve into the possibili- 
ties, we will surely progress in our un- 
derstanding of current issues and com- 
plications. No doubt the ride on the 
high-throughput highway will be ex- 
hilarating. 

Authof Contribution: Ms King and Or Snha both con- 
tributed to the conceptualization and writing of this 
article. 



(Reprinted) JAMA, November 14, 2001— Vol 286. No. 18 2287 



Downloaded from www.jama.com at U.S. Patent end Trademark Of6cc, on June 6, 2006 



DNA MICRO ARRAYS 



REFERENCES 

1. Venter JC, Adams MD, Myers EW, et al. The se- 
quence of the human genome. Sdence. 2001; 291: 
1304-1351. 

2. Fields S. The future Is function. Nat Genet 1997: 
15:325-327. 

3. Lander IS. The new genomics. Science. 1996;274: 
336-539. 

4. Uang P, Pardee AB. Differential display of eukary- 
otfc messenger RNA by means of (he polymerase chain 
reaction. Sdence. 1992;257:967-971. 

5. VeteuletcuVE ( ZhangL t VogelsteinB ( KinilerlCW ( 
Serial analysis of gene expression. Sdence. 1995;270: 
484-487. 

6. lodthart 0) f Oong H. Byrne MC, et al. Expression 
monltortng by hybridization to high-density oftgonudeo- 
tide arrays. Mat Blotechnot. 1996;14:1675-1660. 

7. Strachan T f AWtbol M, Davidson D. Geckmann JS. 
A new dimension for the human genome project Nat 
Genet 1997;16:126-132. 

8. NKhlmoto IN. Hanaoka T, Suglmura H, et al. Cy- 
tochrome P4502E1 polymorphism In gastric cancer In 
Braill. Cancer Epidemiol Biomarkers Pnv. 2000:9: 
673-680. 

9. Furuya KN, Cebhardt ft, Schuetz EC, Schuetz JO. 
Isolation of rat pgp3 cONA. Biochlm Blophys Acta." 
1994;1219:636-644. 

10. Deng C. Lu V, Zlotnlkov G, ei al. Loss of hetero- 
zygosity In normal tissue adjacent to breast carcino- 
mas. Sdence. 1996;274:2057-2059. 

11. Zhuang Z, Vortmeyer AO, Mark Ei. et al. Bar- 
retts esophagus. Career Res. 1996;56:1961-1964. 

12. Hung J, Klshlmoto Y, Suglo K, et al. Allele - 
spedfic chromosome 3p deletions occur at an early 
stage m the pathogenesis of lung carcinoma (pub- 
lished correction appears In JAMA 1995;273: 19081 
IAMK 1995;273:558-563. 

13. Shimada S, Shlomori K, Tashfma S, et at. Frequent 
p53 mutation In brain (fetaO-type glycogen prwsphory- 
Use posxjve fod adjacent to hunan 'de novo 1 cotomc- 
tal carcinomas, Br I Cancer. 200134:1497-1504. 

14. Cc4eK^Krtznianr>B.ETT¥T^^uckAAR.Theger»et- 
kaof cancer-*3Dmodel. WatCenet 1999;21:38-41. 

15. XuJ,Ste1kJA.ZhangX,etal.ldentiflcationofdlf- 
ferenttaDy expressed genes In human prostate cancer 
using subtraction and microamw. Cancer Res. 2000: 
60:1677-1682. 

16. Etek J. Pane KH, Narayanan Ft Mfcroarray-based 
expression profUng m prostate tumors. In Vivo. 2000: 
14:173-182. 

17. Chia SJ. Tang WY. Elnatan I, et al. Prostate tu- 
mours from an Asian population. Br I Cancer. 2000 
63:761-768. 

18. Dong M. Nto Y. Tamura K, et al. Kl-ras point mu- 
tation and p53 expression In human pancreatic cancer. 
Cancer Epidemiol Blomarken Prey. 20005:279-284. 

19. Pettaway CA Racial differences In the androgen/ 
androgen receptor pathway in prostate cancer. J Natl 
Med Assoc 199951:653 -660. 

20. Hata K, FujJwakl R, Nakayima K. Miyaiakl K. Ex- 
pression of the endostatin gene In epithelial ovarian 
cancer. Clin Cancer Res. 2001;7:2405-2409. 

21 . Salda T. Recent advances In melanoma research. 
fOermatolSd. 2001;26:1-13. 

22. Hegde U, Wilson WM. Gene expression profiling 
of lymphomas. Curr Oncol Rep. 20013:243-249. 

23. UuL,YangK.AstudyonC-eibB2,nm23andp53 
expressions In epithelial ovarian cancer and their dini- 

, cal significance (in Chinese). Zhonghua Fu Chan Ke 
Za Zhl. 199934:101-104. 

24. Zhang Z. DuBols RN. Detection of differentially 
expressed genes In human colon cardnoma cells treated 
with a selective COX-2 Inhibitor. Oncogene. 2001 
20:4450-4456. 

25. Oguri T, Kobe T, Fujitaka K, et al. Association be- 
tween expression of the MRP3 gene and exposure to 
platinum drugs in lung cancer. tnt J Cancer. 2001:93- 
584-589. 



26. Gotub TR, Stonbn DK, Tamayo P ( et a). Molecular 
classification of cancer. Sdence 1999;286:531-537 

27. Khan J, Wei iS, Wngner M. et al. Classification and 
diagnostic prediction of cancers using gene expres- 
sion profiting and artificial neural networks. Mat Med. 
2001;7:673-679. 

28. Zhang H Vu CY. Singer 6, Wong M. Recursive 
partitioning for tumor classification with gene expres- 
sion mlcroarray data, ProcNatt Acad SdUSA. 2001; . 
98:6730-6735. 

29. Bittner M, Mdfeer P, Chen Y, et at. Molecular clas- 
sification of cutaneous malignant melanoma by gene 
expression profiling. Nature. 2000;406:536-540. 

30. WehhiB,2arrhkarPP ( Sapliiotol^etal.Aiialy- 
sis of gene expression profiles In normal and neoplas- 
tic ovarian tissue samples identifies canddate molecu- 
lar marten of epithelial ovarian cancer. ProcNatJAcad 
Sd US A, 200158:1176-1181. 

31. Ross DT. Scherf U, Etsen MB, et at. Systematic 
variation In gene expression patterns In human can- 
cer ceO lines. Nat Genet 2000;24:227-235. 

32. Stanton LW. Canard U. Damm D, «t at. Altered 
patterns of gene expression in response to myocar- 
dial Infarction. Clrc Res. 2000^6539-945. 

-93. Abdetlatif M. Leading the way using mlcroarray. 
Clrc Res. 2000;86:919-920. 
34. Bonner RF.Ertirnert^ckM.CdeK.etal.t^ser cap- 
tun; microdissection. Sdence. 1997,278:1481,1483. 
15. Emmert-Bock MR, Bonner RF, Smith PO, et a). La- 
ser capture microdissection. Sdence, 1996274:998- 
1001. 

36. Sgrol DC, TengS. Robinson G.etaJ. In vivo gene 
expression profle analysis of human breast cancer pro- 
gresjkia Cancer to. 199959:5656-5661. 
' 17. Ktlmedd WT, Futscher BW, Dai ton WS. Effects 
of ethanol and paraformaldehyde on RNA yield and 
quality. Btotechnlques. 1994;16:1021-1023. 
18. Dolter KE, Bra man JC. Small-sample total RNA pu- 
rification. Blotechnlques. 2001 30: 1 358-1361. 

39. van Hal HU Vorst O, van Houwetlngen AM. et 
al. The application of DNA mlcroarrays In gene ex- 
pression analysis. J Biotechnol. 2000;78:271-280. 

40. Burgess DC Gene expression studies ustig micro- 
arrays. Clin fxp Pharmacol Physlot 2001 28321 -328. 

41. Kitahara O. Furukawa Y, Tanafca T, et al. Alter- 
ations of gene expression during colorectal carcino- 
genesis revealed by cONA mlcroarrays after laser- 
capture microdissection of tumor tissues and normal 
eplthefia. Cancer Res. 2001;61:3544-3549. 

42. Watson SJ. Meng F, Thompson RC, Aid] H. The 
"ch?p" as a spedfic genetic tool. Bf of Psychiatry . 2000; 
48:1147-1156. 

43. Cho RJ, Campbell MJ, Winieler EA, et al. A ge- 
nome-wide transcriptional analysis of the mitotic ceU 
cyde. Mot Cell. 1998;2:65-73. 

44. Klevea RR, Kauffman SA, Shymko RM. CeDutar 
clocks and osdllators. Ini Rev Cytol. 1984:86:97- 
128. 

45. Warrington JA, Nair A, Mahadevappa M, Tsy- 
ganskaya M. Comparison of human adult and fetal 
expression and Identification of 535 housekeeping/ 
maintenance genes. Physiol Genomics. 2000;2:143- 
147. 

46. OaverieJM. Computational methods for the Iden- 
tification of differential and coordinated gene expres- 
lion. Hum Mol Genet 1999;8:1821-1632. 

47. Cole J. Tsou R, Wallace K, et al. Comparison of 
normal human skin gene expression using cDNA mi- 
croarrays. Wound Repair Regen. 20015:77-65. 

48. Schuchhardt J, Beule D. Malik A, et al. Normal- 
ization strategies for cDNA micro arrays. Nudeic Ac- 
ids Res. 2000;28:E47. 

49. Lee ML, Kuo FC, Whltmore CA, Sklar J. impor- 
tance of replication In mlcroarray gene expression stud- 
ies. Proc Natl Acad SdUSA. 200057:9834-9839. 

50. Jones MM. Researchers attempt to defuse the mi- 
croarray data minefield. Available at: http://www 



• genome web. com/articles/ view. asp? Ar- 
tidea200142175l22. Accessfbffity verified October 
8, 2001. 

51. SchadtR,UC,SuC,WaigWH.ArwJyilnghlgh- 
density oligonucleotide gene expression array data. 
J Cell Bhchtm. 200030:192-202. 

52. Elsen MB, Speltman PT, Brown PO, Botsteln D. 
Cluster analysis and display of genome-wide expres- 
sion patterns. Proc Natl Acad SdUSA 1998:95 
14863-14868. 

53. Bassett DE Jr, Qsen MB, Boguskl MS. Cene ex- 
pression Informatics— tfs ali In your mine. Waf Genet 
199931:51-55. 

54. Aion U, Barkal N, Notterman DA, et ai. Broad pat- 
tems of gene expression revealed by dustertng analy- 
sis of tumor and normal colon tissues probed by ofl- 
gonudeotkie anayt. Proc Natl Acad SdUSA 1999 
96:6745-6750. 

55. Cranjeaud S, Bertucd F, Jcwdan BR. Expression pro- 
filing. Bloessays. 1999;21:781-790. 

56. Kane MD. Jatkoe TA, Stumpf CR, et al. Assess- 
ment of the sensitivity and spedfidty of oBgonudeo - 
We^^rner^mlcn3arrays. Nudeic Adds Res. 2000; 

57. Tanlguchi M, Mlura K. Iwao H. Yamanaka S 
Quantitative assessment of ON A mlcroarrays— 
comparison with Northern blot analyses. Genomics. 
2001;71:34-39. 

SB. BowteO 00. Options avaOable-from start to fin- 
ish—for obtaining expression data by microamn/. Nat 
Genet 1999*1 .25-32. 

59. Halgren RG, Helden MR, Fong O, Zacharewsld 
TR. Assessment of done Identity and sequence fklel- 
Ity for 11 69 IMAGE cONA clones. NttddcAdtfs Res. 
200129:582-588. 

60. Ichflcawa JK t Norris A. Bangera MG. et al. Inter- 
action ofPseudomonasaerughtou with epfthelal oetts. 
Proc NaU Acad SdUSA 200057:9659-9664. 

61. Wang K, Can u Jefiery E, et a). Monitoring gene 
expression profile changes in ovarian carcinomas us- 
ing cONA mlcroamiy. Gene. 1999329:101-108 

62. Wong KK, Cheng RS, Mok SC. Identification of 
differential^ expressed genes from ovarian cancer cdls 
by MICROMAX cONA mlcroarray system. Biotech- 
n/ques. 200130:670-675. 

69. Grove Ds. Quanbtatrve real-time polymerase chain 
reaction for the core facility using TaqMan and the 
Perkln-Elmer/Applled Biosy stems Division 7700 Se- 
quence Detector. J Blomol Techniques (serial on- 
line). 1999;10. Available at: http://www.abrf.org/ibt 
/1999/March99/mar99lndex/html. Accessibility 
verified October 9, 2001. 

64. Raval P. Qualitative and quantitative determina- 
tion of mRNA. I Pharmacol Toxicol Methods, 1994- 
32:125-127. ' 

65. Jung R, Soondrum K, NeumalerM. Quantitative 
PCR, Clin Chem tab Med. 2000;38:833-836. 

66. Winieler EA, Shoemaker OD. Astromoff A, et al. 
Functional characterization of the 5 cerevislae ge- 
nome by gene deletion and parallel analysis. Sdence. 
1999385:901-906. 

67. Garter K. Froteomlcs gears up. Signals (serial 
online!. 1999. Available at http7/www.signalsmag 
.com/slgnalsmag.nsf/657b06742bS748e888 
256570005cba0l/f8a34b7efde4eb6c882 
S68lc0O0b8a96. Accessibility verified October 16, 

68. Shlrota Y, Kaneko S, Honda M, et al. Identifica- 
tion of differentially expressed genes in hepatocellu- 
lar cardnoma with cONA mlcroarrays. Heoatohzv 
200133:832-840. 

69. Ston M, Zepter K, Kamarashev J, et al. Coex- 
pression of CD40 and CD40 tigand In cuteneous T- 
cdl lymphoma (mycosis fungotdes). Cancer Res. 2001 ; 
61:452-454. 

70. Dalton R. Abbott A. Can researchers find recipe 
for proteins and chips? Afafore. 1999;402:718-719. 



2288 JAMA, November 14, 2001— Vol 286, No. 18 (Reprinted) 



©2001 American Medical Association. All rights reserved. 



from wwwjamaxora at U.S. Patent and Trademark Office, oo June 6, 2006 



fl^ £*p. md 5>b| 



MOLECULAR ANALYSIS OP CANCER USING DNA AND PROTEIN 

MICROARRAYS 
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OfTSODUCnON 

^Substantial progress, to been made to ov vodentandrag of era e ■ mpWrtrp. 
cample* disease ihit involves progressive changes in the patera* tad proteose. Various 
types c*csitem*h*»aimtt«rrtitssj^ aid 
molecular tfiiu. MJooarmy technologies hm the potential of providing valuable insight 
regarding disease processes. Use artsy format bwvn cerab&hed method far global 
analysis of nucleic adds, and to the pas few yean mJs approach has baen adapted for 
protein studies (Table I). KficmmyB allow profiling of tumors* genomes, trsjucripiomes 
and protromei at a scale r — ' *" 
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CAN CHI fftOSUiNC USING DMA MICROAIUUVB 

Oeasmies studies, especially profiting gene espeatiofi, using DNA artsy have bad a 
otmendous Impact on biomedical leseaicb. resorting in well over 1,000 published reports 
in 2002 alone. A substsntisl nombo of published (todies dealt with cancer. Disease related 
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rppllceaons of DNA microarrays include uncovering unsuspected associations between 
genes and specific clinical features of disease that are helping devise novel molecular based 
disease classifications. Most published tumor studies using DNA microarrays have either 
examined a pathologically homogeneous set of tumors to Identify clinically relevant 
subtypes, for example survivors vs non-rurvi»ors. or pathologically distinct subtypes 
belonpog to the same lineage, for example limited stage vs advanced stage tumors to 
identify molecular correlates, or rumors of different lineages to identify molecular 
signatures for each lineage. 

One of the landmark studies thai have attracted ouch interest with respect to the 
potential contribution of DMA iroc/oarroys to uncover novel classes of rumors, is an 
analysis of diffuse large sVcell lymphoma, the most common subtype of non-Hodgjuos 
lymphoma . Large B-otll lymphoma is a ctittiealty hetErogenooaa disease. Only 4M of 
patients have a good response to current therapy with a prolonged survival. A systematic 
characterisation of gene expression in this disease using DNA asknuu i ay* uncovered a 
diversity in gene expression that reflected variation in tumor proliferation rate; host 
response and differentiation state of the tumor. Two iDutocuiarly distinct forms of diffuse 
large B«cdl lymphoma were unargued which had gene expression panares btuscfrive of 
differeot eteges of B-oeU oWerentiatioa. One type cxpicssad amies characteristic of 
germinal center B ecus and bad a tJffJUftcandy better overall survival than the second type, 
which expressed genes normally induced during re vino activation of peripheral blood B 
cells, The amlysSa mere/ere identified previously undetected and cUnicaUy significant 
eobtypes of lymphoma. 

Stffdies u> classify breast carduornss based on gene expression profiles revealed that the 
tumors could be clsasified Into a basal eplutellal*tike group, an ERBB2-o%«rexprc*siag 
ayonp and a normal breast-like group* 2 *, a luminal cpiitehal/estrogen recsptox*pca1uVe 
group could be divided into at least two subgroups, each win n dtstiitcttve expression 
profits. Survival analyses on a subeohort of patients with IccxJiy'odvanced breast cancer 
uniformly treated, m • prospective study, showed sismifioandy different outcomes for 
P°tiems belonging to the various groups, including a poor prognosis tor the basal-like 
subtype and o significant difference m outcome for the two estrogen receptor-positive 
groups. In an iadepenecm study of 38 invasive breast cancers, striking tnolccsuar 
differences between ductal carcinoma specimens were uncovered that ted to a suggested 
new classification for esuogaiHtJcapter negative breast cancer". Similarly, a study of 58 
node-eegstive breast carcinosnas discordant for BR status also uncovered a list of penes 
which discriminated tumors aeeesdrng to £R ttxtU3 a . Artificial neural rtetwerex could 
accurately predict ER status even after excluding top discrimlnstor genes. Including BR 
itself. Only a small proportion of the 100 most important ER dsscfmunauw genes are 
rcgulaied by estradiol la MCF-7 cells. 

An Informative approach to analyze DNA micfoarray data in clinical stadias is to divide 
such data into a oaining set to uncover associations between specific genes and certain 
clinics) features of the disease, and • testing set to validate these associations. However 
since both training and testing sets are derived from dm same pool of patients whose 
samples were svailable to tire, investigators, the extent to which such -r~u*rrmr may 
apply to other patients not included In the study, who may have different characteristic*, 
cannot be inferred. Beer ei al. have undertaken a study of lung cancer in which the 
association they observed between a set of genes and patient survival wss validated with o 
testing set of rumors they had aviiiabla and further validated with en ^-rrmftm set of 
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tumon for which microarrey data was collected by another group no: associated with (hit 
mi/ 9 . Such ci tensive validation clearly indicated the robustness of the associarioo 
uncovered bttweco o set of genes and survival tn lung ederrocarcmoma. 

The numerous published studies using DNA micraarrays justify the use of ihb 
technology for uncovering patterns of- gene expression that are clinically informative 
However it la substantially snore difficult to develop an uoden Lending of disease si o 
rntchardstic level using DNA uuuuai i ayt . for enost of the published studies it is unclear 
hoar well ANA levels reported correhse with protein levels. A lert of correlation may 
bupiy that site prodiedv* property of die genefc) Is mdepemdem of geno function. In studies 
of lung cancer, Chen ct al. collected both DNA microarray end 2>D PAGE data* which 
allowed them to compare mRNA and protein levels to the seme tumors 10 . The integrated 
intensities of 16$ protein spata representing protein produces of 98 genes were analysed in 
76 lung adeitocatcinojnas and 9 unaffected lung tissues using 2-D gels. Far the seme 8S 
samples, mRNA levels woe detcrnuncd using oUgenuelcoddo microarroys. Osjy 21 of 98 
genes C21.4%) had a statistically sigrtlftcani correlation between .protein end mRNA levels 
(r >OJ244S; ? 4>j05). The tnRNA/jpoteio ceoreUoon coefficients also varied between 
unarms of the seme protein, mdrceting potentiaJly isof6ro>speoftc nrntthartiam tor the 
regulation of protein i 



rROTElN IdtCBOAJtfUYS IN CANCER RESEARCH 

Despiie the advances tn our undrntaading of the molecular basis of cancer, substantial 
gaps remain both m our umlentandbig of cancer pathogenesis sod to the devcluyuwcni of 
effective soestgjes for early tf sgnesis and for creatmcoL A prc4oonsJc approach to 
uvestigatrng dUeases such as cencef any ov ercome some of the QrntatfoAs of other 
approac hsaV DNA m luoaiiaya have IfanJted utility for the analysis of blotoglea] fluids end 
for cmcovering directly in the fluid, ostayable brairortere, Ncrrxrous aheraunns may oceur 
tn protcses thai ore not reflected in changes at the RNA level* 

Unlit* DNA rrucioarrays that provide one measure of gens expression, namely RNA 
levels, there is a need lo taplemtnt protein eetcnarray strategies that address the many 
different features of proteins including determination of their tevele tn oJotojpceJ samples, 
end detent nation of their selective interactions with other nraSeihe, entibecfies, drugs or 
various small ligands. Arraya that ineoipcmue arm'bodicr^ or leeomlHnsm proteins 
obtained using cDNA expression IP^ereW 941 or phafe^displsy libwiee 4, have been 
utilized for different types of protein based assays. With other types micToarraya, whole 
tis s ue-d e rived samples neve been cSrccdy anmoJ onto slides, to asset* the reactivity of 
total protein lysates with specific Kgantf*** 5 . two practical applications of protein 
microaneya warn pieseated by KiMfilra* *. designated protein funcdon array and protefn- 
detectfng array. With protein function arrays, s large amount of protein is spotted en s solid 
support el a denned location and tested to characterise either a mcc hemic a) activity or a 
molecular Interaction, The Dratdn^cfecting array consists or en arrayed set of protein' 
Uganda used to profile gene eapression and draw signatures mcScadve of the cellular state. 
The whole process of assembling die protein array recjolres considerations related to the 
nature of the support, the typo of uamobUisailort, as well es die molecular architecture of 
the particle being snanhed. Four main different supports have been optimized to perforin 
assays of ihfs magnitude: chemically modified glass slides (poly;Ulysines, cttryaldchidns. 
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these samples. Patterns that distinguish berween cancer pau'ertts and normal subjects with 
remirknblo accarecy. have been reported for several types of earner 1 . The coupling of 
protein arrays with mass rpecoxrmttry technologies if likely to become • powerful analytic 
tool with which to profile protein expression. Such en approach, known as r>oaroChi£ 
(Gphtrgen Inc. USA), was socct^fuhy applied to study prostate and ovarian cancers 
and, more recently, heed and neck careers* 9 . Result) from tttnse studies revealed the 
involvement of proteins in c ojciii o gcna ab processes and specific ally identified protein 
fingerprints from which cancer biomarkers were eioacted. The major drowbocls of direct 
analysis of tissues or biological fluids by MALOl is the prefcjentief detection of proteins 
with a lower molecular mass and die difficulty ra utenaryiBj.|he proteins corresponding to 
die masses observed, further oxhnologica) ianmrvemertts could erthsoce dm odKiy of 
direct mass spectrometric analysis of cJsaaes and biological fluids, 
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Cancer' tissue profiling studies mat have utilised protein intooarrays am beginning to 
emerge.' As • model to better understand how patterns of protein expression shape the tissue 
nsoocnvuonmem, Knexcvtc et al. analysed protein expression in tissue derived from 
squamous cell carcinornas of the oral cavity through on sntibody luitroairay approach for 
Mgh-throoghpul proteomlc anoJytis . Utilising laser capture mterodrsserton to procure 
total . protein ftom specific microscopic cellular populations, they demunstrated thai 
ouantiuuivev and potentially qualitative, differences in expression patterns of multiphi 
protein wtlhm epithelial cells reproducibry correlated win oral enity tumor progression. 
DifferendaJ expression of multiple proteins was found m stromal cells surrounding and 
adjacent to regions of diseased epithelium uuttctaectry corrclaied with rumor progression of 
the epithelium. Most of the proteins identified m both cell types were involved in signs! 
transduction pathways. They hypooHcsned Ihcreforo mat extensive rnoJecular 
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comnfbntcaiiora involving complex cellol&r signaling between epithelium and stroma pray 
• key imJajUf^h^ oral cavity cancer progresiton. 

A clinically relevant apptioatiun or protein nucroafreys is the identification of proteins 
that induce an antibody response in autoimmune disorders 0 . Microamys were pro d uce d 
by attaching several hundred proteins and peptides to the surface of deri vented glass sGdes. 
Arrays were incubated with patient serum, end fluorescent labels wen used to detect 
autoantibody binding to specific proteins in avtoHnmtme diseases* inc luting systemic lupus 
erythematosus and rheumatoid arthritis. Such niic ro ttmy i represent e powerful tool to 
study immune responseviD e variety of diseases including cancer. 

A reverse phase protein amy approach thai iramoWUxes the whole repertoire of a 
tissue's proteins has been developed". A high degree of tensiuviry, precision and titxeariJy 
was achieved, meting it possible to quantify die phojphorytated status of signal proteins in 
human tissue ecD subpapolazions. Using this approach Pa wel cures al. 48 have Itrnjltudirmfly 
' analyted the state of pro-survival checkpoint proteins at the ntferoxeople cranstioo stage 
from patient snBtched bistoiogicoJIy norma} prostate epithelium to prostate i n t mpJ t hxh il 
oeopraria and to invasive prostate cancer. Cancer profusion wu associated wish fnrrnaml 
phosphorylation of Aid. suppression of apoptosis pathways* as watt as decreased 
phospnoiylation of ERR. At the transition from rustologlcaljy tumna) epithelium to 
Intraepithelial neoplasia, a statistically significant surge bs phospnorytafed As* and a 
concomitant suppression of dowevtrearn apoptosis pathways preceding the tasltfon fata 
invasive carciaoma were observed. 

A major challenge In snaking Mochfps for global anarysla of protein expression is the 
current lack of comprehensive acta of genome scale capture agents such an antibodies. Aa a 
result, bloc hips thai target specific classes of proteins such as sonascs or cytokines are much 
caster to produce, that would have clinical utility. Another Inrportanl consideration is 
protein oticronrrays is that proteins undergo numerous post*trsnslatiorial modifications eg 
phosphorylations, glycosylations, vmcb ore highly Ifnportent to thdr functions, as they can 
dettTrrane activity, stability, localisation and turnover. To eddreu ihe need for 
comprehensive analysis of proteins in their modified form several approaches to the liquid 
based eeparatlon of cell and dune ryiaxes were investigated in order to obtain protein 
fractions with reduced cooipleairy or pore individual OTOtoma". Ttm separation products 
can be artoyed m a manner thai allows the probing of protein constituents of cells and 
tissues to oncovcr specific targets. Tor example, using a combination of anion exchange 
and reverse phase LC Madna^urpide ct at have obtained soma 2000 individual protein 
fractions drat have been utilised to produce rrucroarraya thai mserrogate cancer otil 
proteornca, fractions thai react widi specific probes are within the reach of • 
chromatographic and gel based separation techniques for resolving their individual protein 
constituents and of mass spcctxumctric techniques far identification of their c onsti t uen t 
proteins. The LC piocedurea allow sufficient protein amounts to be resolved for the 
construction of large numbers of nncfu&rrsys from a given cell or tissue source. 

Protein microerrays of afferent types are Kfcely to become conunercfaDy ova* labia for 
assays of broad sen of proteins and may well rival or at least complement DfcA 
microarreys aa tools for global expression analysis. 
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In conclusion, array-based techoologitt have emerged that coatribulc to profiling 
tissues at die genomic, owcriptomic and proteomic levels. Analytical tools an needed to 
mine the vest amount of data generate* intimately the molecular analysis of cancer at • 
genome and preteome scale will allow better classMfcatfon of disease and tailored 
indivitfaajizcl therapy for iodtvidoal patients. 
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Abstract Novel and noverfol Cccfcootogto each as DNA 
■aicroamys sad proteomJai ban made poaJbft Cbi enslycte of 
' lbs aprcsJon tevcti of nmJtipte geees Tlrv i "" H fir nry bath fa 
health and disease. Id comhhution, these taAnloxto promise to 
rtTotntfonfat biology, to f-'t"?** fa too area of — "'f **hr 
mcdidsc n they an cjueue d to meal gene rcgnfenoa crests 
brorred fa disease seogressfoe as wcTJ as Co pbpoba pobetissl 
targets fot dreg discovery end dburnastJos. HeVe, tm icvfar* fe 
current states of these technebgiei and Ugtfir^ sores tfaale* fa 
which thry hare been applied to concert to the esaJyiB of biopsy 
gp trimrni © 2000 Feeeranea of Enjuecaa BhxtmkaJ Soci- 
eties, PtabUsbofl by Efaerter Selene* bXV. AD rights resxrved. 

Kef words: Ccoe expression profile;. Mfcrotmy; frotcorne; 
Two-dlfneojioaaJ t^lrJoctropboresb; InmmacAirtocfccmatry; 
Bladder cancer 



f. fatffpdoetfoB 

II b oow o little more than 10 yesjs since the Human 
Genome Project was bunched and during this relatively short 
period of time there have been rcma/taMe advances fa ihc 
coostnictfon of physical and genetic naps aa wrD u to the 
idcoiificotioo of gross associated with human diseases flj). 
Soon, the total sequence or the human genome will he deci- 
phered and, hopefully, nude available to researchers world* 
wfde for the bcne6t of mankind. 

Uodoubtedly, the Human Oenome Project has paved the 
way 10 the revolution fa the life sctcnoes that wo arc expert* 
cociag today. Gradually, however, tU focus is starting to shift 
towards functional genomics, on area of the neal-genensk em 
Urn deals with the functional analysis of genes and then- 
products (sec the article by OofTeau is this issue). Techniques 
of functional genomics include methods for gene apression 
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profiling at the transcripl (DNA mtcroarrayv |3-5]; see aim 
article by Brazma and Vifa fa this issue); differential dbplay 
(6); serial analysis cf gene tauimiuu f7-9), and protein lcvcb 
(prc^eornies) Q 10-12); see also article by Andersen and Mann 
fa this itnteX na wdl ju traniprntes |l% phage disptoy (141 
procedures for studying prcjtebvprotein tatercctioas ([15.161; 
see ano article by Ugrem and Sera to this tsroc) and bioin- 
formano p 1| 

Among the tcctaiaucs of factional gne^nM both DMA 
cafanmrrsyt(P^t^relcrenm^ |I0W 
12) bold great promise for the study of csmpha btntogfaal 
aysamwitb ac^Itcatiem in rnofacubr csedhane. Tbesc novel 
and powerful gene expression profiling technhrues nerroii the 
analysia of the cjpr c aiten levels of thousands of genes ifaru)- 

wu nt fc s e umty, allow hnjlMhwing^innv and fa comhtontioo 
ajeeapceted to generate a vast amount of gene and protein 
eipfcatoo data that may lead to a better understanding of the 
Rgnteiory events hrvohod fa ooitaal end disease ptcjecjsea. to 
a d dn lon . these technologies offer n syYtcmatfe app r oa ch- for 
searching for cfTective tergstj for drag o^heovery and cuagnos- 
tics. 

Here, we review the current state of DNA uiUiuajiays and 
protcomio end hjghnght some atndfas fa which they have 
been applied fa corner! to ths asaxytis of biopsy aperiinrni 

1. DNA mlueeireja 

The amount of uifofmation that is now bceosrung aveflabk 
to researchers in the life sciences b eTytoding, and eveo 
though the data can he stored fa conventional esedfa, new 
nrrth o d s an being required to analyse targe sets of genes fa 
a tfgb-througfcput fashion. For Ibis purpose the DNA array 
technology was deveJoscd. The method makes it passible to 
survey thousands of genes in parnDdl end has several areas of 
appheation. One b apression monitoring (10), fa which the 
transcript lcvcb of genes arc measured in different physiolog- 
ical conditions both in cultured cells and tissues, to search for 
regulatory cxntession patterns. Understanding patterns of ca- 
prcssed genes b expected to improve our knowledge of highly 
complex octwork* that cms ccmumrnicate In hitherto un- 
known ways, both fa .health and disease. Another area of 
application b polyrnorphem analysis (19} In thb case, poly- 
morphic regions of the genome arc scanned to search for link- 
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age to diseases, and 10 reveal disease susceptibility etna and/ 
or inherited disease gents. A simitar approach has beta used 
to analyze polymorphic regions of known gents, in particular 
to deienaine whether polymorphisms are associated with as 
altered function of the gent product, a fact that may bcrcase 
the rusceptibQiry to disease. Finally, various attempts have 
been made to utilize DNA arrays for sequencing (10). ■ 

Below we review the technology currently io use for mttro- 
array-geor rated gene expression pattern discovery as well as 
some applications. Table 1 provides a set of World Wide Web 
rites that contain useful additional tnfonnatioo (sac abo the 
article by Brmxma and Vito in this issue). 

2. 1. Micrvarrjy technology 

Microarrays are usually made by deposition of DNA spots 
on a solid support like a coited glass surface, that diners b 
■ever*! wsys from conventional tiicr-bated supports such as 
charged oyton and oiiroeettulosc. The 6a usees of the glass 
surface makes it possible (T) to array motocula b a parattd 
fashion, Co) >o trJmarurisc the procedure and (tii) to use fluo- 
rescent dyes for detection. There is no diffusion of the epphed 
maicrtal ato the support, thus allowing focusing for User 
acanobg miemscopy. 

Two main procedures have been used to produce DNA 
chips: photolithography as developed and marketed prbsnuOy 
by Affrmxtrix loc (Santa CUra, CA, USA) (29,21) and on> 
chanical grjddmg (22). Photoihhoeraphy b well known in the 
computer chip industry and (mimes an ultraviolet light tome 
that passes through a mask that directs b a step-wise mamxr 
where a photochemical reaction <on^oudec4sde ryuibesii) 
takes plaoe on a tuioonbed glass surface. The mask con be 
produced with openings as small as a few micrometers allow- 
ing a dcocity of crrcral hundred thousand probes per square 
cra rimttt t of glass. There is, however, an mhcrcat length re* 
itrtotion with this in situ synthesis technology Gmhiig the 
probes to about 25 ouef eotida in length. Higb-dcoxity arrays, 
on the other hand, allow the use or multiple probes per gene 

pq. 

Mechanical gridding methods are based oa rni-jet or phys- 
ical deposition of the material using pins tnaaitfactured whh 
very high pmetston. There is direct surface contact and the 
transport of small amounts of liquid makes these systems 
fusoeptibk to evaporation and oontammatton with dust par- 
odes. The grtddiog tnstrumenu use nn XYZ motion control 
based oa step engines that can be cootroOed with very nigh 
precision. The DNA containing raateruU can he spotted from 
96 or 3S4 weO plates to glass b predefined pstterns. 

The arrayed probes can be ohgos (photolithography and 
gnihCog) or cDNAs (erlddbf>. The bybridavtfea reaction 
conmtions are quite different b the two coses, and special 
sample preparation- are needed to optimally uttfits these 
probes. With shorter probes. It, of 20-50 nucleotides, the 
sample* is fragmented to avoid tertiary structures and to 
achieve optimal hybridization (18,30). Polymerase chain reac- 
tion (PCR) amplified probes of 300-2000 nucleotides usually 
do not require fragmentation of the sample (23). 

The type of glass used as support, the coating substanm, 
the coupling technique, the htbeUing system, and the fiuorcs* 
cent labeb used for detection are all variablo that must be 
optimised. A Dumber of coating substances am commercially 
available to immobilise DNA to the surface . Two types of 
a tides arc available that use a coating procedure designed 



for printing of amine modified DNA: these include (i) su> 
tated slides, which contain reactive aldehyde groups that react 
with amioo-groups via a Schiff base formation as the printed 
DNA dries on the surface of the slide (available from Tele- 
chem Inc., CA, USA; ail Associates loc, TX USA; for 
coupling chemistry see bttp/rwww.srrayitcxn^^ 
coupling/) and (n) activated slides prepared by the covalent 
anadunent of a bydrophilfc, polymeric amine reactive coatbg 
to suaee basc^coated slides as described by Brier and Hobriscl 
P<) (available from Sormodia Inc. MS. USA). Amins-modi- 
had DNA attaches oovalently to the activated, por/mrrfc sur- 
fane. 

Three other types of slides arc available that are based on 
more conventional fftsrobflbatioa technology r o u t inely used 
b membrane cmmcbilaatioo: these bcrudc (0 sttaabed slides 
which carry covaleotly anarhrrl primary amines oa the sur-. 
fane that can form Ionic bonds with the phosphate bsckhonc 
at osutml pH. In anclidoss, the tadicai-based coupUng between 
thymidine icriducs on the DNA and carbons oa the olbyl 
amine of the snbstmte cao be induced with UV or heat <pm- 
viced by companies Gke Tcbehcm loc, CA. USA; Sterna 
AMrteh, lac, MO. USA; Cannae, loc., NY, USAMU) eitro- 
oxlliihwtesed por/tttcr-coated slides that posses tha hmdmg 
and tawooWnzatioa pr o peitks of nitroccOuhne that binds 
DNA la a non-oovehmt but irreversible assjsacr ^chseschcr 
and SchueuX and (hi) e^lysbc<oated slides whkh require 
UV erosx&Jcbg of the DNA (avnibbfe from Sigma AMrieh 
tnc, MO, USA; sea oho IjtYpnxsnaw^stanfo 
prctocols/l_sEdcs.imii!). 

2J. Qumrttnrtos tht Jtpvk/rtm aray$ 

A Ibear icnpoasc that coven two or three orders of enag- 
nituoa Is often needed to detect low and high copy number 
transcripts on the same array, la casea where this e? set pos- 
sible ft may be necessary to scan tha chip at dttTcrcot wave* 
lengths, or to amptir/ the signal with an immune s an dw ic h oa 
top of the bound sample (25). la the latter cows, the first 
scaaabg is carried out after hyartnmattoa of the labelled sam- 
ple, end the cecond after reaction with the labelled antibodies. 

It is em u c taor y to d o cument the anearity and tepnaduettrfUty 
b each step of tha lYtccadarc, and snmetm^ 
to probe to obtain reliable data. Often, a csandard sample Is 
used to compere with tha cxperta atal sample and this may 
cc^npcnsatc for deferences b hybridisation frosn probe to 
probe. 

2,$. SundBrditBitoB 

Comparison of data obtained from bdepcadcat arrays aad 
from different laboratories itquiics standardisation. Both the 
Affymetri* chips end the custom made cDNA chips use differ- 
ent methods for stsadardisadoa. The Affymetrta chips have 
. approximately 20 probes per gene and standarotetson is ci- 
ther cased on the cipmssbn level of eclected grnca, Eke oetb 
aad OAPDH, or oa e setting of the global chip intensity to 
approjJmatrty ISO units per gene on the chip. In this way, 
chip data from different cj^sli intuits can be compared to each 
other, la our hands, the data obtained with the two staadarcV 
isntfon methods differ only by approximately 10% (unpub- 
lished observations). 

'The cvstoawnade cDNA or obgo arrays also require cun- 
dardbition, bat this is a complcs problem. Io geserat, the 
standard used often reflects the purpose for which the array ' 
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was produced. For example, for expression monitoring of 
breast caocrr cells, a couture of breast canoe r cell Uses nay 
provide a good standard (26). Today, however, tbrre is oo 
golden standard tbn can be used for all purposes aad as a 
result, it b difficult to compare data from different laborato- 
ries aad often it is accessary to use other technologies soeb as 
Northern hybridization, rcaMuoe PCR or urummostaining to 
validate the signals. A minimum requirement b that bbora- 
lories that produce arrays themselves should be able to repro- 
duce data from ooe chip generation to the next baaed on the 
analysis of well-defined controls covering different genes and 
espressioB levels. In addiiioo, it is coaunoa to use spikiog of 
samples with bacterial genes that hybridize to probes, spotted 
for control pu/pose oo the arrays. 

2.*, Samptu for txprution monitoring 

The analysis of relatively homogeneous ceO populations 
(doned cell lines, yeast, etc) has proven couch simpler than 
the analysts of tissue biopsies as the latter often contain essay 
cell types (ephhebai, endothelial, inflammatory, nerve, nruscte, 
and connective dssue cells) that arc present in variable 
amounts. Standardization may require microcUssecuon of. 
the tissue to isolate specific ceil types [27,26*), although the 
ourebct of oefls needed for the assay is vcQ above a million. 
Sampling of specific edl types using laser capture nucrodisscc- 
tion (LCM) |79) can be a ttge*conj\naing task, and ajvco that 
mRNA is prone to degradation the processing tiarc enust be 
kept to a minhnnm If only a small amount of enteral is 
available, theq a revcrge *an«ription-PCJl step is accessary 
for amplification, but this adds an addhioaal ctnnp oration 
doe to the lack of linear amplification of all transcripts, to 
ooe of our laboratories we have used pscparaooaa of single 
cett types from tumor biopsies to ttandardia poobng of senv 
pies for geaeratng profiles of gen expression at different 
stages of tumor d e v e l opm ent (manuscript in preparation). 

2.1 BioinformaUc anatyjir of exprcsfitm data 

2.5. I. Hltrarddeol sbuitr algorithm* Hierarchical duster* 
tag Blgonthmi can be divided into two types: agsjonzerabve . 
aad divisive (Fig. I) fJOJ. The aggjomeretivc method is a bot- 
tom-up approach, where the algorithm starts with * separate 
dusters (for example 4000 genes, where «s^000) and pscccs- 
sively combines dusters until only one b left The drvtafn 
method, oa the other hand, b a top-down approach etartmg 
with one cluster and sacccssively splitting dusters to produce 
others. The algorithm used to form the dusters roust also be 
defined; two widely used and simple algorithms are the single 
linkage aad average linkage methods* respectively. Single link* 
age, also called nearest neighbor, defines the disUace between 
two dusters as the minimum distance over aO pairs of dusters. 
Average linkage takes in consideration the average distance 
over all (30). 

A distance matrix must be calculated before clustering is 
performed, and it b the distances between the obtained gene 
expression profiles that arc used to form the actual dusters 
(30). Observations with small distances are sjrovped together 
as described above. The two most commonly used distance 
nteasuremeou are the Euclidean distance and the Pearson 
correlation coefficient. 

Euclidean data nee : ED « ^{X*— Jfp) 1 
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F% I. Two mam types of ensuring algorithm*, the birra/duca) 
end lbs iWD^UefejchsaJ algonthmx 
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*a b the cscajuitsneot for death variable en sampling unit L 
V% b the variance of the Alb. variable (30,31). 

No**hlaoTthttal chattr algorithm*. In oon-hicr- 
aichioal duster analysis U b assumed that the data can be 
divided iato a certain ember of etustcn and that they are 
well separated The advantage of tab approach b that targe 
d tta seta can be clustered much faster than by using rUererchJ. 
ca^astcring because a lower Dumber of dusters b aoissrad 

ysb b a-eataas. However, a method termed sclforssmcaed 
maps (SOM) has raocntly been apptiod to capitate* data 
geaeraied from DNA chip arraya PU3}> The fc-rnxanj meth- 
od fM) identifies ft points that function as cluster centers. 
Each data point b then anignrd to one of these centers in a 
way that mrnmthrs the sum of the disfanm between all 
points and their centers. Thus, it b the dutributioa of points 
that decides the value of the eacanj. One drawback of this 
method b that -a specific number of dusters b assigned, as 
the number of chatters b usually unksowv io large data 
sets. SOM b sbnrav to the ft-faaw approach, but It has a 
geonsetrical con figurati on and the number of nodes pscoxfines 
thb cooflssimtmn. InJtlaJly, data points are cnapped onto the 
gcoxnctiicel configuration, Wbco dusteriag the data with 
SOM the pothfon of a node cnigratca to fit the data points 
during rueccsatve iterations fttJJ). 



2.XM. SkfinUt** timgUMlom, Common to the clustering 
methods m which array data are used b that they are unsu- 
pervised, La> oo predefined references are knowo. An alterna- 
tive option b to construct o i 
that requires at least two references. For 
for example, the references could be the j 
files from normal and mvasfve tumor tissue. Io thb particular 
case, a vector representing gene capression over *i genes caa bo 
used to deacrfibc each tissue to a recent aructt by 

Oolubaod co-wosken pal the authors analysed 6817 genes 
using )8 bone marrow samples. Based oa these 38 samples 
they found that o vector based on 10 and 200 genes was. 
sufficient to distinguish besweco acute myeloid leukemia and' 
acute rymphobksstic leukrmb. Thus* the authors were able to 
oaasirvct a cancer classifier based oo a low number of genes, 
Usiag a similar vector-eased classifier approach. Brown and 
coworkers (37) analyzed 2467 yeast geees in 79 dmerent ea- 
pcrimeau aad were abb to dassify genes Into functional cat- 
egories based oa the csamessfoo data from DNA chip arrays. 
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2.6: Applications of arrofS for expression monitoring 

One of the main areas for array application is in the omul- 
uneous monitoring of thousand* of transcripts fo different 
biological sellings. The approach b bong used to identify 
new networks ond to understand patterns of expressed genes. 
A number of articles have been published using the array 
technology aiming at identifying diseas*»axaoriated alterations 
in humans. For this purpose clinical samples, human cell tinea, 
and ta a few cues animal models of human disease have beta 
used. 

16.1. Gent txprtssion profiting of iUsve biopsies, erf/ Rnu 
and animal modtls of disease 2.6. IJ. Tutus biopsits. Qcse 
oprcuion studies on clinical samples have been performed m 
breast and colon cancer (26.9S), as wdl as is atherosdrrosis 
P°.40J. Genes of presumably- known functions have bees 
identified aad linked to the diseases; most of these data arc 
now aveuablc in the Internet {Table I), to a study of breast 
cancer that used diniea] specimens aad cell lines, with aa 
array containing approximately 5000 genes, Ferou and coV 
leagues [76) identified a proliferation related gene duster in 
the cell line that wan unregulated in the mom agpesaive din- 
ical breast tumor r p r rf mnu fig), la a similar study of colon 
cancer based on the AfTymetrU array*. 41 BSTl hoaelogoua 
to ribosomal proteins west found to be unregulated m the 
tumor Ussue pq. In this study, a muscle iadee was used for 
correcting Car the stromal oomponenta as this showed a high 
index m the normal biopsies. Both the breui and the colon 
cancer studio tacked a corretatiea between gene aapwasion 
levels in the cell lines and in the clinical specimens. 

Human athtroscfetwju lesions from artcra ceroti* sampled 
from patients undergoing surgery have also been analyzed 
usmg the Aflymetrii expression array* (461. One important 
finding was a fivefold upregulation of the carry growth n> 
spoaae gene Egr-I, a DNA binding protein that mfluenocs the 
transcription of genes ciuoding growth factors, cyic^nnes. ad- 
hesion molecule*, and proteins related to eoagulaiioa. This 
Hading was corroboraced by immonohistotntmistry and ani- 
mal experiments, aad identified Egr-I as a possible target Gar 
therapeutic intervention. 

2.6.1.2. CHI tines. Several studies based on expression 
monitoring have been performed to human odl lines to areas 
as diverse as cancer {41). ophthalmology |42L and the central 
nervous system HI* Arrays have also been used to study the 
effect of cytokines (interferons) {44), cytomegatovirus Infection 
H5), and oncogene tranafectioo (46] on the ovcraD patterns of 
gent expression. In one study, human foreskin fibroblasts 
were infected with human cytome^tovinn and the expression 
of approximately 6000 genes was monitored for up to 24 h 
(45). A tout of 258 genes was found to be unregulated more 
than four-fold These included HLA-E (uprcgulated six-fold). 



a protein that protects against cytotoxic T-fyophocytcs; ROf 
SSA (52 kOa proton mRNA; unregulated U-fold), a com- 
monly targeted avtoantigea, as well as several components of 
the pathway that produce prostaglandin E2 from aracbidonic 
acid. Id another study, it was shown that treatment of the 
human fibiosaieunu ceU line HT 1080 with IFNs o. 0 and 
7 rcfuttcd to the upstfulation of novel genes implicated in 
anoptosis (RAP46, Bag-I, tcramblasc). while genes Ukc 
IGF-2 and ZdT-3 were strongly downicgulatcd [44|. 

The effect on gene expression of the fusion oncogene PAXJ- 
FKHR transfoocd toto NIH JTJ cells has also been studied 
ustognhmrraysfiol It waa shown that the fusion gene, but 
not the wQd type control was able to activate a myogenic 
transcription program that included induction of a number 
of transcription factors such as MyoD, myogenic SUI, and 
Shig. 

2.6*13. Animal modti systems, A common approach to 
the study of human diseases is to use animal caodd systems, 
This has been dose for a reap of diseases intruding enoepha- 
bmyeropathy, lymphoma, renal tubuli (23], aad lung fibrosis 
(41 In the tarter ease, Atymctrix chips were probed with 
pooled samples obtained from groups of ass isnab to reduce 
varbtinns and cost. Transcripts that were ngnificanUy altered 
in hmg entasis (47) cxlreoc&ular tnatrU sunt inflam- 

matory response genes. Analysis of dusters <w irr?r t nfni these 
genes to a data course experiment with the SPOTFTRE PRO 
3.0 program revealed different temporal pauems of expression 
that farther subdivided these genes. 

2.6J. Toricotozy and drug usilng. From a toxJcofogtcal 
poiai of view, there are great capectanons for esprcasioa caon- 
itoriag as the effect of drugs, both expected aad uu fo mec n 
side effect*, can be. aaoaitotcd to animals aad eveatuaOy to 
humans {4M9). One problem that baa arisen from these 
studies* even to quite aaapk modd systems, ta the often* 
lmrxptainrd chaago to tr ans o ipt levels observed. These 

plea relauonshrp^aoag geaa^pamduets than previously 
thought (49). 

The cembhutton of gene eaescastoa monitoring and testing 
of drugs on odl lines and in animal models boats great prom- 
ise. A recent puahcstiaa ihowed that the variation to the liver 
capseasion of g enre cneodmg sen obi otic eseeafaefissBg en- 
sjymes, ghitatbiene regulators, DMA repair cssynm beat 
shock prouins and housekaepiag genes b larger among indi- 
vidual animals than that mtroduosd by the array assay itself 
fdV). These studies revealed the upregulattoo of cytochromes 
even at low doses of tVaapbthofleveee treatment and showed 
a good correlation between the array and Northern hybrids 
ration date. A leoeet screening using cDNA mJcroarrays of 60 
human cell tinea used by the National Gaecer Institute for 
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Useful Boas on the World Wide Web for array data and software for data analyse 
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|l| bupiftbnpp. rah.gov/Ynnphoais/ data and software 

PI hllp-JAvMxu.oizc^u/roc8rth/toin^a/Ses^e»s^tra) data aad toftwan; 

(6,1 2.17.18) hiip^Ayww^cnome.wimii^arMPR/osta.uiiAtmJ data aad softwoic (CmcOusier) 

(II) blip^Atfvw.rUxom/uxuVpubtmm4l29)Aun data 

(O) btujU/oi«pj?i*d.hamrdxduMttwort^C4Swvcrjr/ fan <iemhs) 

|I41 biip://qnantgujtaefa?&eda/ d a* (neons) 
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drug discovery, has formed tbe basis for estabfisbiog a data- 
base that cao be linked to gece expression data and molecular 
pharmacology |4I,S0). 

2.6.3. AfottaJar classification o/disensex Qaiafitatioo of 
diseases cao generally be achieved by class discovery and class 
prediction |J6). Class discovery refers to the idratificatbo, 
based oj> gene expression, of previously unrecognised sub- 
typo of the disease. If dmicaJ follow-up material is available 
these classes con be related to signs and symptoms, disease 
course, treatment outcome, and mortality. CUss prediction 
refers to the ability to assign a particular patient to an already 
defined class, based on molecular examination of the diseased 
tissue or other specimens. 

Two recent examples have demonstrated the power of ex- 
pression arrays to classify hematological malignancies. In one 
of these studies, leukcmias wen classified as A ML ud AIX» 
respectively, based on SO genes selected from as array carry- 
ing 6S17 genes, to this study, 36 out of 38 patients were 
correctly d ossified, and two were uncertain. The SO genes 
used m the array included some that were known to differ 
between AML and ALL, as wcO as new markers (J6). 

Another study classified B-eeil l ymph o mas into two cootec- 
ularly distinct csaxscs that reflected different stages of B-cefl 
■ duTcrentiation, the germinil center-like and activated H nrll 
like lymphoma groups, respectively. For this study the lyro- 
phocfalp oiictoarfBy from Research OeztetBsv, which holds 
17896 genes, was used. Follow-up for 12 years showed a sig- 
nificant difference in survival among these groups |S1} 

It is envisaged that global surveys of gene expression will 
identify marker genet that, may be used to group patients into 
motncularly relevant categories; these snorters are ospeetsd to 
greatly improve the preonos and power of dinicoJ trials; 

2.7. Condusionj 

There is no doubt that the massive parallel gene expression 
• information generated by taieroarmys wiD have a major em- 
pact in the discovery and undemanding of pattern of ex- 
pressed (pees, lo addition, the technology ts apeeted to gen- 
erate novel and effective target* for drug discovery and 
provide, in combination with proteomies, valuable toob for 
the entire process of drug development and evaluation. 

One of the tenia challenge* we foresee in the future will be 
to solve the problems posed by the analysis, interpretation 
and access to the large amount of mTormaiiea that will be 
generated. Large studies of EffTs have not been published yet 
as dam analysis requires the development of ecw bic*nfor- 
made toob that can deal with the' huge amount of tafoimn* 
tion that is being created. These studies arc expected to Iden- 
tify orw genes of importance to specific biological processes 
and reveal new regulatory pathways through the analysis of 
the expression levels of individual BSTe ia large numbers of 
samples. 

In the future, coupling of express ino monitoring to trans- 
genic animal models may prove to be quite rewarding as the 
global effect of a gene knock-out or knock-in can be moni- 
tored with both microarrays and proteomies tools (10). 

3. Proteomies 

A complementary technology to DNA mierosrrays for 
monitoring gene capression is provided by proteomies, a 
term generally used to encapsulate all of the technology cur- 



rently available to analyze global pattern of gene capression 
at the protein level Proteins are frequently the functional 
molecules and, therefore, the most tikcry to refloat differences 
in gene expression. Genes may be present, they nay bo mu- 
tated, bat they ore not necessarily transcribed. Some messen- 
gers arc tntnseribcd but ool translated, and the number of 
mRNA copies docs not necessarily rcBost the number of fuoc- 
b'onal protein motecsifes (S2J. In addition, proteomies ad- 
dresses problems that caonot be approached by DNA analy- 
sis, sanely, restive abundance of the protein product, post* 
uonxtstional roodificajtion, subcellular tooalLmttoo, turnover, 
mteractioo with other proteins as well as functional aspects. 

The pivtcome has been defined by WBkxas asd colleagues 
as the complete set of presents encoded by the genome (S3} 
and icccoUy, the term has been broadened to include the set 
of protcms expressed both in space and tfassa. There are two 
main approaches to protectories: one b the expression model 
in which all proteins are analyzed, and the other b the cell 
msp model io which only a selected set of proteins. Eke con> 
plexcs and organdies, esc studied (54], 

The protecmuc technology b complex, and comprises a 
plethora of state of the an trrhnfqura to resolve (high resn- 
hitioB rwe*4u3iccmoeal geb), quantltite (phospttorsnsagsT, 
special scanners), identify ancVeha nactc riie protsizis (mmrose* 
quersdng, mass spectrometry), as weO as to store (tweHfinscn- 
atonal pc^yaorytantxde gal csectrepncecsn (2D PAGE) dam- 
bases; btip Jfbfobuttdkfc&k&BtaMM; http :flcxnsjy .hropjch/ 
tprot/iprut'top^tsnS) eoBununicate ssd nterhsk protein and 
DNA sequeBoe and mapping tn/onnsdoo (biomfotmstics) 
(10,124MB) 

The wrfumns of the 2D PAGE technique for ktrge-scale 
proteomic projects depends very touch on the number of pro* 
Jews that can be resolved m a complex protein mixture, for 
example a human eeO. rroteonsc pronto data from a few lab- 
oiatoiics. I n c hidi ng one of out own, have fa dies ted thai onJy 
a fraction of the human genes are swftchtar^o fas a given cell 
type and extensive aaalyxb of weiote oc0« tracts* orgsmcOes as 
wcO as partially purified subcellular fractions, ""If*** that 
individual odb may not express snore than 6000 primary 
tronsbttton products (J9J. To Ibis eocmber one has to add 

wo»w/ pw0t*CfwUlSetwl tfOwawaf ^VOCxwS&fl^ wxsVtf Cssa^CwDwSas^I 9UO&SfiEtiO&$S& 

(phospnoiytstion, g9ycnsybttton, densthytatioo, acatybzion, 
myristoybtioss, pahoitoybiioo, sulfation, ubiquination, etc), 
the latter being rather common and extensive in many pro- 
teins (60,61). Thus, as we stand today the 2D FAOB technol- 
ogy b ool ebb to resolve ood depict fa a single gel aS of the 
protases thought to be present in a caaimtraliao cefl. 

Currently, there b a groat deal of interest fa proteomies as 
appbeauoos of 0u> teehnology ate expected to reveal gene 
regulation events involved fa disease progression as well as 
polsotia) targets for drug diseovery and diagnostics. More* 
over, the teehnology b bound to have a great impact in agri- 
culture, tosicology and the industry in general. 

J./. Ptotitn aparetlttm: the 2D PACE tedunlogy 

For the past 26 years, high resolution 2D FACE has been 
the technique of choice for analysing the protein composition 
of celh, tissues and fluids, as well as for studying changes fa 
global patterns of gene expression dtritcd by a wide' array of 
effectors (12,62-64), The technique, which was originally de- 
scribed by O'ForrcD (69,661 and Klose (67J, separates proteins 
both in terms of their isoelectric point (pi) and nromcular 
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Fig, 1 A and 8: IEF 2D grfft of BOMBttnntf (A) and SVIQ trsnifoMWl (B) I 
Muni quantitation (sciatiOitioo coontins) of tone of the protons showa b (A) and (B). I 



i of (he ptb an altera. C: 



India led «itb htus arc cowrorguialfld, white I 

weight end provides the highest resolution for protein snary- 
fit. Usually, one chooses a condition of interest, for nsmpTr 
the addition of scrum to noo-dtffereotiated human keratino- 
cytes. or compare normal aod tfansfonned odb (Pit. 2A,8), 
aod let the ecus reveal the global protein response as aD dc- 
lected protons coo be analyzed both qualitatively (posMraoa- 
tational modification) and quantitatively (relative abuodaocc, 
co-ordinated expression. Fig, 2C) in relation to each other 
Q53,6g) end references therein; see also htrp:0Wobase*dk/ 
cghbin/eeUs). 

For many yean the 2D PAGE technology relied oa the uae 
of earner ampholytes (amphoteric corn pounds) to establish 
the pH gradient, but this technique has proven to be difficult 
because of the lack of reproducibility created by uncontrol- 
lable variations in the batches of ampholytes used to generate 
the pH gradients. Lately, however, with the introduction of 
immobilized pH gradients (IPOs) (69,70). which arc integral 
part of the polyacrylomide mama, it has bees possible to 
obtain focusing patterns that' can be easily reproduced by 
the non-expert. IPCs avoid some of the problems associated 



. From Cc&s end Oban |6n) 

with carrier ampholytes tab as enthorlrt drift and cp rfo eai o - 
sis* allow a higher loading capacity for iniu upreuai atrvs runs, 
end provide increaacd charge resolution when narrow plf gin* 
dicnts (OjQJ pH unities) are toed <f70.7l] asd references 
thcresnji In our bands* however, earner aenpbotyut (3.5-10, 
Fig. 1A) and broad rangt IFQs (Fig, JB) resolve sbattar num- 
ber of l^oxtbiontnc bbcQcd p o lyp eptides (about 250Q) to 
dhisuattd with the ceparatioD of wools protein calrecti from 
tabeUed human teattnocytes f72|> It has beco piupoud thai 
narrow range, overlapping IPO gTtdieots viewed side-by-ride 
may provide a solution to the problem of resolving and de- 
picting the proteome of a given cell type. Recently, however, 
Oorthals and co-workers found this sotutmn unrealistic, as tl 
win require the running of a huge number of gtb pi] 

Very basis polypeptides have proven difficult to resolve, 
although both carrier ampholytes (non-equilibrium pH pa- 
dieot eiectrophoresiSfc NEPHOE) (6e\73| and IPOs (9-12 and 
4-12) <p0] and refercn.oes therein) have bota showo to sepa- 
ratt basic proteins. 

One of the moss important steps io the 2D PAOE techno!- 
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ogjr ceoorrns sample preparation as very often tone proteins 
cannot be properly dissolved by ibe r/sb tohrtion originally 
developed by OTimll (65). Thus, there b prosing end to 
develop protocols for optiroazag ssmplc tofebflaatioo. To- 
wards this aim, Rabilloud and co-worfcera have made use of 
tbc high loading capacity of IPOs to resolve fmmbranj pro- 
teins for structural analysts, and fo doteg so have baproved 
their solubility by using a combmation of detergents sad cfaa* 
o tropes f74,75). It baa bora shown that the addidoa of thio- 
urea, CHAPS aod rulTobctaia surfactants to the lysis solution 
eoataioing urea results id a mush improved setubiluaitioo aa 
wcO as transfer to the second dtoensfon SDS get As far as 
audcar protests ate conacre ed, Oorg aod cotkaguca have 
improved considerebly the separation of very bask proteins 
by 6m precipitating the samples with acetone prior to eehibJ* 
fixation u> the lysis solution [70). The problems associated 
with the estractioa of tissue samples, on the other hand, art 
much more complex and ha we not bceo addressed yet us e 
■ysicmatic fashion. 

3.1.1. Detection An important limitation of the 2D 
PAGB technology b the lack of very seosroVe procedures to 
detect those proteins that are present fa vary low abundance. 
In addition, detection procedures are Beaded that eaa be ap- 
plied to a large number of resolved proteins whose abundance 
may span through seven or eight orders of nsagaJtude. 
Clearly, the iemilrvily of sOvcr nitrate aod Coornasstc Blue 
naming is inadequate, aod ooJy ractsbobc labelling with spe- 
cific isotopes may reveal enough proteins to warrant proteo- 
ouc projtcu. Furthermore, the use of pherphor-iroagMg tech- 
oology may enhance the sensitivity and linearity of detection. 
Limitations of the radiolabdliag approach include 0) tack of 
labelling of some proteins due to low turnover, 01) problems 
associated .with safety regulations and disposal and (uQ diffi- 
culties in obtaining fresh human biopsy material for rabeJIiog 
experiments. Ideally, one would like to have a highly sensitive 
fluoitsneacc- based protein detection technique able to support 
all types of audio irrespective of the sample or the cad point 
of the analysis. Preferably, (he dye should not alter the mo- 
lecular weight and pi of the proteins if it b to be added prior 
to electrophoresis, and should support quantitative studies 



mvoMng protein* having extreme difTercnocs in their copy 
numbers. Unfortunately, bo such ideal dye b available on' 
the narket yet, aJUsoogb Oifortf GrycoSdeocc has developed 
fluoxesonst IPO^PAOB fhtfp ?/hvww.ogs eonVpiotcome/ 
bxnn*\bnaj), a technology not available to the scrrntalc com- - 
muniry. Riaresccnce compounds such as STFRO Orange, 
SYPRO Red and SYPRO Ruby have been used to analyse 
whole protean rysates from bacterial and taammabaa cells* but 
then- sensitivity (1-2 ng) u slightly lower than that of silver 
nitrate (76,77]. Some advantages over sfher naming mctude 
short staining base and the fact tlut the gets do oot need Co be 

fixed prior to ttafamg. lo addition. Bale or no *«*»**g b 

■ ■ 
roqujteov 

For low ahondahco proteias of known identity, d stne siou 
does ooi pose a problem as Ceus and eo woikcrs have shown 
that ID'PAOB imraunobtotting in combfaattoa wfth cn- 
haaood ehemftrmincsOEacc (ECL) can detect as Ettas as I0D- 
J00 nwlrcBfrs per oeQ hi unfracu'onaicd oeBubv extracts (60). 

$JX Qmuttaitan. Even though there are several toob 
available for the quantitation of protein spots, there b at 
present no available procedure for cruantiteting oD of the pro- 
tctes resolved in a complex mixture. Part of the problem ha 
fa the huge dynamic range of pro tens expression, lack of res- 
olution, post-trauslatioaal modifications, staining behavior of - 
the proteins, as wcD as ia the tact that ntany aoupoaat pro* 
tcfas streak over sen abundant components interfering with 
the rneaiujcmcnta. At present, fluorescent technology seems to 
be way ahead; as with the fluortsocnee stain Sypro Ruby 
there b a bacar response with respect to the sample amount 
ova a wide range of abtmdance Q77| aad rtferetxecs therein). 
Quantitative mioresBeaoe measurements cars be performed 
with CCD-camera based systems as well as with laser scanner 
systems Q77J end references therein}. Id some cases, rodrokv 
belling m combination with scintillation counting oners o rea- 
sonable alternative for quaatitatfag a smeU oumber of pro- 
reins [eel 

3 J J. idtntffimtlon. Method! of proton identification 
have included onmuaoblotang (78,791, Ed man peptide se- 
quencing 480,81) and references thereto), amino acid compo- 
sition P&831, and more recently the use of matrix •assisted 
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Fig. 4. Fmetioa of the outer rynlbetk iaug* of 
Web Cmpttrtotaw rfk/cp*ja/cefiQ. reroutes Ocggvd with • red 
obtain • OH covtoiaiag infoimeuoo •boot the protein as «dl u 



separated by IBP 3D FAOB a detected b (he Warn! Wide 
10 knows pntdni. By c&omig «o any spot It b passible lo 



tint* la other die* Id (ho World Wide Web. 



burr daorptionfiooxntioo (MALDI) man ipeuiuoairy (84| 
and clectrojprey tonaaboD (ESI) 1*5). The totter tochniquca, 
which rely on the comparison of peptide man nogerprintf, 
fast and require only pieomol amounts of proteins, limited 
peptide sequencing can be pcrforned using tandem mas* spec- 
trometry or poat-sourae decay, but h it b stOl ool possible to 
make N» and Olermmal itquenoe Uentificadon. For a review 
on Edmao peptide leejumcxng sec reference (81). Andersen 
and Mano review protein identifleation by buss apeaarcmcciy 
elsewhere to this issue. 

JJ. Making wmprrhaxrite ID PACE databatfi 

Advances is hardware and software dmlopsoent (Elsie, 
OcBib, Metanie, Quest. Tycho aod Kepler) for scanning 
and image analysis of 2D gels, as weD u the development 
of sensitive toob for rapid protein identification. hav« cata- 
lyzed the establishment of comprehensive proteomie 2D 
PAOE databases (64,86). These databases aim at faterfacing 
protein information with forthcoming DMA mapping and se- 
quence data from genome projects, and offer a global ap- 
proach to the study of gene expression both m health and 
disease (bnp*y/biobase^k/cgi-bin/oclii; htip^/etpasy.hcugexh/ 
sprol/sprot-top.htal) Q 1 0.55) and references therein). Besides 
aaiiotating genomes, these databases are expected to address 
probtems thai cannot be approached by DMA analysis, 
namely, rcbtivc abundance of the protein product, pnsMrans- 
btional modifieationt, subcellular localization, turnover, in- 
unction with other proteins as well as functional aspects. 
The first prototype databases were built by Cefis and Bravo 
(87), who pioneered the use of protein identifleation tech- 
niques to establish comprehensive 2D PAOE databases (88). 



Fig, 4 shows a region of the synthetic coaster 2D PAOB 
image fiscctectrfc focosiag, IEF) of ( iy S^nrthiooinc tsbcOed 
proteins from fresh ooDHrultorcd human kcrntmocytes as da* 
pietad to the World Wide Web (http:/7Woba«attcs>*W 
oehs). Pruttuu nagged with a red cross correspond to known 
pcJypcpinies that have ban tacensaed by erne or a esmsbimv 
tion of teehnlqacs that include (i) 2D gel imrnumiMotcmg 
Kseific antibodies and the ECL detection prooednrc, 
nicseaequceKana; of Coomaszfe toucans Bhre stained pro* 
i (89}, (tii) mass specuomctry of tryptie peptides (90) and 
frv) romigrafton with known human proteins tfodividual pro- 
teins and ergsmeSe components) and (v) overlay trrhnlqirra 
|59J and (vi) traaaicnt aprccdon in aamasauaa oelb (91). 

To date, 1237 polypcpddes have bean identified b tab data- 
base (IEF and NEPHOB) of the 9159 that have been resolved 
and catalogued. In addition to p'Spnetaioasns labelled pro- 
teins* the database coo tains a Tew polypeptides that lack me- 
thionine, bat are revealed by eDver sJaiaUig, Cbomejsie bri% 
bant Blue or by bbtflmg with n mbture of 16 ("Citato© 
adds. It should be stressed that some protein* tartrate both 
in IEF and NEPHOB gab (for example, OHcnoIase, triose- 
phosphatc uomerase and cungetion factor 2) and they may 
serve as tandmarks to aGgn the gets and as references to nor* 
ma! be quantitations (* both pH d rations. 

Information gathered on any given polypeptide, known or 
unknown, can be retrieved by clicking on the ccraspondmg 
spot, » this case 14-2-3 slgsns, also known as strecmo (Foj. 4) 
(92). A file containing at) of the information enured for thb 
particular protein, mostly obtained from experiments per- 
formed in our Isboraiory, Is shown in Fig, 5 (only a fraction 
of the file is presented). Files for known proteins contain links 
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Fig, 3. Fito for u* 14-)-} rignw protein «vwfcfate bi tht Internet 
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to a subset of Medline (bnpy/wwwjicbLoJmjdb^ov/ 
PubMed/), Swiss-Proi (hltp://ex|niyJioi£)t.ch/tprol/iprc(*iop. 
html) astf PDB Cbttp^/wwv^bl^ftdddbcrfjd^bdbOi Other 
ticks include OM1M (http;//www nrbi nhn nih girW OmunA 
OeneCard* QiHp^/bioinfoTmatiq.wnnninn or itfcards). 
Gene (http^/ww^bijito.aib<OT^BiOf«/indni html) and 
other Web rites such at CySPH) (cytosnxlctat protein dale- 
baic; btip y/pariU , mad, yalc^ paaw/cytoAdb/lodga. 
btml), metabolic pathways (compiled by K£00; hnp;0 
www.fjoiDaie.adjp/kcggO, tbc cytokine explorer (JbnpJI 
kbounifjnlisoiuixdu :443/qnokincs/caplorcr.btDlX histology 
Images (hltp7/biosun.biobuc. ds/«» pdl^c aaJb/ i aihr ograpto. 
btmf), etc. lo the future, aa sew databases sad rented Web 
sites become available, h wil) be possible to navigate through- 
out various databases containing ooeaptceocpury Infoftnatjon 
(U. ouddc acid aad protein sequence, genome tnepprng, die- 
casa, protein tfraeturc; pott-rranitatioaaS modiftcaUoaa, aaoV 
bodies, signalling pathways* histology, etc)* Clearly, data* 
ban allow easy access to a targe body of data: ooec a 
protein is identified m gives databases, all of the information 
aecuxDutatcd can be easily i ctiitvc d and node avaihbte to the 
researcher. 

Today, 188 Information categories arc available io the 
WorM Wide Web version of the keraffeoeytc database. These 
include cellular localization, pathways, proteins affected in ' 
psoriatic kcratinocyto, proteins expressed in normal urotbe- 
bum, kemtinocytes, fibroblast*. He La cells and bladder card- 
nooia ccD line RT4, levels in fetal human tissues, partial ami- 
no acid sequences, abundance, cytoskelctal pro tens, calcium* 
binding proteins, innrafais. chapcronina* heat shock proteins* 
etc. 

Functions to query the databases include search by eame. 



> or keywords (Fig. 4), coolecular weight and 
pi, a* wed as by organelle or ccOuav cenrpontnt. By dkkinj 
oa any of ihe orgattcflcsv ceOular struct urea or components it 
b possible to set a protein fist as weO as then relative posi- 
tions on the master image to addition, one can retrieve n Bit 
of all knows proteins recorded to the detainee Moreover, we 
provide pfotoooh) asd videos) of preparative steps thai con be 
used to reproduce the data displayed as well as a gallery of 2D 
gen and bnmunoUota. 

Aarbus 2D PAOB databases available at h tip UVbictbase.dk/ 
cg^uVcefis tetfude transitional oeU card nomas (TCCs), eqoa* 
mous oeU carcinomas (STXbX urine, fibroblasts* and mouse 
kidney cells. Other databases avaUabte In the Internet can be 
found fa fanp:/tapnsy±c*ifexh/^^ bUpM 

mmmm ka—^yyM ■^f» fl T|^ ^ ^y )g ft»fciKJ fr,«*l*»iy and bttp://USCr* 

psg&ehemieJu-c«^^ Also, Severn] 

proteosnic tools for pro tea hfrntification and characterise- 
tioo, primary structure analysts, second ary structure prcdh> 
b'oa, ternary atraeture and OKA transladoo into protein are 
avaisable at the EXPASY proteonuc server (brm://www J cipa- 
sydvVwwftools). 

SJ. Application* 

To date, there have bees thousands or reports illustrating 
the usefulness of the 2D PAGE and proteomic technologies fa 
many areas of biology, ft—* y of spaee IsmiiatSons* however, 
only a few of Ihe application ens are highlighted below: 

3 J J. Omar. A great deal of research has been devoted 
to the elucidation of the pathways that control cell prolifer- 
ation m normal ocUs, and hence, the detcreoinadon of the 
means by which attentions of these pathways lead to abnor- 
mal growth characteristics and/or neoplastic u^nsfcrnnauon 
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and cancer. Most studies have focused on oncogenes, tumor 
suppressors, cell cycle regulated proteins and signal transduc- 
tion molecules to various cell types of different species 193-96), 
but only in a few casts there have bets systematic attempts to 
analyze the protein pbesotype of pairs of normal, and trans- 
formed cell types using a protcomic appToaeb [60j68,°V99|. 
So far, only very few studies have made use of biopsy material 
due to problems related to the cell heterogeneity. 

Already ia 1982, Cclis and co-workers started a protcomic 
approach to the study ofctU transformation using doaed cell 
. tines (68,97,99). Their results shoved thai transformation re- 
suited in tbe abnormal upmdon of norma) acnes, rather 
than in the expression of new ones [9Q. la addition, their 
studies raised a word of caution concerning the widespread 
use of protein information derived from studies of different 
cell types from various species. Today, wc ait.wefl aware that 
cultured eclbr undergo important changes when ptaeed in cul- 
ture due to different environmental factors and growth con- 
ditions 1 1 00] aad accordingly, current efforts using the proton- 
mie approach are being directed to the study of ooo -cultured 
cells and/or tosue biopsies. Among the cancer prefects cur* 
rcotry undcrwoy, those centered on truamaia and henmtolog- 
ical malign sodes, breast cancer, colorectal cancer aad bladder 
cancer are briefly mentioned below, 

J.IU. Uidbanb art htmatclogicrt mahgnmckA Studies 
by Haaash aad colleagues on childhood leukemia and other 
hematological malignancies have yielded so far several 
markers that include Op 1 8. also known as staJhmin, an 
oncoprotein that has been usp&cated in signal transduction 
(101,107). lo childhood leukemia, phosphorylation of OpIS 
was shown to correlate with a high oontent of cells m the S- 
#uue suggestiag a role in piouYcretion. Tbe group aho 
identified nm23-HI (nucleoside diphosphate kinase AX a 19 
fcDs protein that b unregulated m normal lymphocytes 
treated with mitogens as well as to leukemia cells from 
patients with acuta teukenua (10}). 

3.3.3 J. Brtsut cancer. Systematic studies of dinkxJ breast 
tumors of different histopawologital types by Framen aad 
co-workera (104-106) have revealed several protein, including 
PCNA, hjp60, hsp90 and caJresJcaun that are highly deregu- 
lated in invasive carcinomas and that may serve as prognostic 
markers. These studies have made urn of fresh din teal rumor • 
tissues of different subtypes and have paid special attention to 
sample preparation. 

3.3 J.l CotortctaJ cancer. Studies of Jungbhit and cot- 
leagues 1 1 07-109) on sets of eucrosooprcafly ourmal colon 
mucosa and colorectal carcinomas have revealed several pro- 
teins that arc deregulated in the tumors. Downregubted pro- • 
tcins included the liver fatty add binding protean, the smooth 
muscle protein 22-a, and cydocaygenase 2. Ue^fulatc4 pro- 
teins iaduded the heat shock protein 70 as wed as several 
members of the 5-100 family of cahiunvbinding proteins (S- 
I0OA9. S-100 A8, SI00A1 1 and S-I00A6). Some of these find- 
iags have been confirmed by unsunohiitcchcmical studia 
(I0t> 

3.3.1.4. Bladder can err. Gclb and colleagues (110-112) 
have explored the possibility of using prottomc expression 
profiles of bladder tumors as fingerprints to subctsssify hiitn- 
palhologieal types, and as a starting point for searching for 
protein mark en that may form the basis for diagnosis, prog* 
nosis and treatment. To achieve these goals they have ana- 
lyzed the proteome expression profiles of hundreds of fresh 



tumors as wcO as random biopsies and cystectomies |l 10-1 1 2 J, 
and have established TCC and SOC protcomic databases that 
may provide • solid orfraslructu/t to support future studies 
0113); htrp://biob^.dVcs>bbVotJis). Ia the long run, a prac- 
tical goal of these studio is to identify a complete set of 
protein biomarkcrs that may be useful to classify hiaopatho* 
logjcal grades, and that will provide with specific probes for 
the objective diagnosis, prognosis aad treatment of these te- 
nons. So far, these studio have revealed markers for TCC 
prtfression (M0J, a marker m the urine or patients bearing 
SCCS MKIIS), and have led to the development of a novel 
strategy for identifying piemahgnsm squamous lesions |l 12). 
The approach makes use first of protcomic technologies to 
reveal a nd ide ntify proteins- that are differentially accessed 
in pure SCCe and normal urothehum. ThereaAer, specific 
utibocScs against the Differentially expressed proteins are 
toed to immunosiain aerial cryostat sections of biopsies flnv 
smiaowalkmg) obtninad from SOC pafenta that have under- 
gone removal of the- boulder due to invasive disease (cystec- 
tomy) Smee bladder cancer b a field disease (1.161 * Out b 
targe part of the bladder lining is at risk of developing disease 
-•to rrpwvd that the urotheCum of these patients easy 
exhibit a spectrum of abaoftnafiu'oi ranging from metaidassa 
to mvesive disease.. 

• J.JL2. Mean dbeans. Heart failure is among the hading 
causes of mortality to the Western Hemisphere and therefore, 
efforts are being devoted to the elucidation of the molecular 
events feeding to cardiac dy*Juncdeo QI17) and rcfercncei 
thereta). So far, research on dilated cardiomyopathy (DCM> 
(117-119), has revealed that approximately 100 proteins are 
deregulated; mostly downrcguUted, in DCM as compared to 
their normal ceimtcrparts. These induce cytoskcfctal and my* 
ofibrfflar proteins, polypeptides associated with tmtochondria 
and involved in energy production, as well as proteins asso- 
ciated with the stress response. These studies have been expe- 
dited by the establishment of protcomic 2D PAOB databases 
of the human heart (ventricle and atrium) (1207 

la addition to the global analysts of protein expression 
patterns in human heart rihraen, cardiac antigen aprcasion 
Mowing cardiac transportation has also been studied using 
trctnlquo from psvteomha. for cremplr, using 2D PAOB 
(cardiac proteins) in combination with Western immurjoWot- 
tlag (patient sera), h has bees possxbk to identify antigens 
that react with eutoantibedies present both ia DCM 
(121,1221 «Bd tnyocasdhss (121). la this way, antigens associ- 
ated with the aodbody response that may be involved la acuta 
or chronic organ rejection have been characterised. 

Proteomio studies using aaimal models of heart disease 
have also been carried out as aa-effbn to unravel the moleo- 
ular events lading to ca^isc disease. Recently, two different 
atttdiea mvotvmg large animals, paec^duccd heart failure in 
the dog (124) and bovine DCM (1251 raided ***lMt results as 
those observed In human DCM. loierestingjy. the most strik- 
ing result in the bovine OOA study was the finding of a 
seven-fold decrease in the expression revel of ubiquitin C-tcr- 
rnina) hydrolase, as inappropriate ubiquination of proteins 
has been suggested as an ctiologsc factor ia heart failure (126). 

3.3.3. Tudtotogy. Changes m the environment as well as 
the growing mtereat of the pharmaceutical industry have 
stimulated the development of novel testing approaches based 
cm the recent technical advances both bi genomics and pro- 
teomio. Pioneering studies by Anderson and Anderson (127) 
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have highlighted the uwfVitoco and potential of the proteomic 
approach to identify quantitative rhangn fo rat liver opto- 
rioo profiles nssnoiafrd with toxicity of drugs and other xeno- 
biou'es. The data, which are being systematically stored in the 
rodent molecular effects database, are expected to yield Im- 
portant information as to the molecular rnechanlnns under* 
lying toxic responses, likewise, the potential 'of protoomtss 
have beta roocnUy exemplified in studies of glomerular ncph* 
rotounty ia rats J 128], and of stimulated occopational jet Aid 
exposure ia mice lung [119). to particular, the studiea of Stein- 
cr*s group have shown a remarkable correlation between de- 
creased levels of calbiadin D-28, urinaxy calcium wasting ia 
the urine, and intra tubular cortinmedutlary cakificatfom in 
the kidney of rats and human treated with cyclosporin A 
l»30J. 

In the future, proteomlci hi a high-throughput mode is 
expected to have a major impact in the pre~diokaJ safety 
luting of drugs, these studiea will be facilitated by the estab- 
lishment of 2D PAGE databaaa of f request target tissues 
(kidney, liver) a* well as of .cell Unci and fluids. 

J.J.tf. Natrvtogitat disorders. The Creuufcld-Jaeob dis- 
ease (CJD) has been the subject of intensive analysb using 
protoonua. These studies have ted to the Identification of 



two members of the 14-3-3 family of proteins bs the cerebral 
spinal fluid (CSF) of CJD patients (131). The presence of these 
protein in the CSF has been used to differentiate CJD from 
other dementia both with high sensitivity and specificity 
(132,133). These proteins, however, are present in the CSF 
of patients suffering from other neurotogJcaJ chsctfders not 
revolving dementia, limiting its cUnical value (134,135). 

3.4. Contusions 

. Today, there b no technology in sight that matches the 
resolrmg power of 2D PAGE, a technique that will continue 
to enjoy a central position in proteomlc projects for some time 
in the near future There is considerable room for improve- 
ment, however, in particular as Car as sample preparation and 
solubility, choice of pH giudxcat and detection methods ore 
concerned. Alio, we need to improve the separation of very 
bisk as well as very low and high molecular weight porjrpep- 
tsdes. fai general, one expects resxaiehers to first use wide IPO 
gradicau to obtaio an overview of the proteomc profiles, and 
then proceed with a more demited analysis using narrow pH 
gradients, which provide higher resolution and sample load- 
ing, thus increasing the possibility of vumalsong the fester 
abundant proteins. The latter can he fecHitsted by the avail- 
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ability of specific antibodies, as well as by the use of titrac- 
lioo procedures and subcellular fractionation oittbods cur- 
rently at band (136-138). 

There arc iu)l maoy additional challenges, however, thai 
must be addressed before a complete Hunan Frotcome Pn>> 
eel can be implemented ||J9J. These iaeiude: automation to 
allow higb-ihfoughpm sample analysis (1401 improved quan- 
titation capabilities, better instrumentation and software for 
peptide lequenong using mass spectrometry, more sophbti- 
cated image analysis systems to support gel entoparuens 
and databasiag as weD as improved btotoformatic capabilities 
overall |IJJ. In addition, we seed to deal with the problem of 
tissue ccD heterogeneity as more and more protcomic projects 
will make use of biopsy materia] in the future, 

4L Transcript and protein levels: DNA nucroajrayi asd 
protmnics applied to the same samples 

As mentioned io Section I, both DNA caicroarreys and 
proteomia am complcsncntary tecbaotogfea. To date, bow- 
ever, there have been only a limited number of studies In 
-which both technologies have been compared by. emptying 
them to the same sample {141,1421. Notably, the pioneer stud* 
us of Anderson and Seilhamer (S2) showed that there b not a 
good correlation between mRNA and protein levels io human' 
liver, implying thai geoe-bascd cxpmaioo data may be of 
limited value m the process of drag doonvery. The study, 
which compared the levels of 19 gene products, yielded a 
correlation coefficient of 0.48 between mRNA snd protein 
abundance, a value that fa half way between perfect and oo 
conetatioa. 

Recently, 0rnxeft ct al. (manuscript in preparation) carried 
out a mzcroarray and protcomic study of bladder cancer In 
which they compared the transcript and protein caarcsaion 
levels of pain or non-uvasive aad invasive low grade fresh 
TOCs. Even though they onuM only compart the levels of 
about 40 wcJ^ruotved and focused abundant proteins, it 
was clear that in most cases there was a good correlation 
between traoscript and protein levels. Only in a few cases 
.they found discrepancies, and in some of thorn instances- 
they could oot eliminate the possibility that this was due to 
messenger stability, post-transcriptiona! cplirang, pc*t-cransav» 
tional modifications, protein focusing problems, degradation, 
as well as the choice of methods used to asses protein cm* 
predion levels (staiaiag versus radiolabclliog). For example; 
in ooc tumor pair they found that the levels of keratin 8 
transcripts were much higher in the invasive tumor (compare 
Fig. 6C and O), while the protein levels were much lower 
(compare Fig. 6A and B). Inununoblotting analysis uamg ker- 
atin 8 specific antibodies revealed that Ike discrepancy waa 
due to degradation, as several related products of tower ap- 
parent molecular weights and more acidic p/s, could be visu- 
alised (Tig. 6E). The identity of one of the croxntactieg 
peptides (indicated with arrows in Fig. 6E) to fceratm 8 
was further unarmed by MALDl-tune^f-fiight (IDF) (Fig. 
6F)- From these studies il was dear that when comparing 
mRNA snd protein levels there arc other 'factors that 
need to be taken into consideration when interpreting the 
data. 

When comparing transcripts and protein expression profiles 
of matched sample pairs one often gets the impression that 
there ore more changes in the abundance of the mRNA tran-> 



scripts as am pared to the proteins. Considering that the cur- 
rent 2D PAGE technology depicts mainly the more abundant 
proteins, it would seem possible thai most of lot changes 
affecting protein levels may involve low abundance polypep- 
tides. 

&. Gene tiprenfaro profOisg recJanianss: perspectives 

Novel and powerful techniques are now available to analyze 
the global gene expression pattens of cultured cells and uV 
tuea obtained from normal and diseased simjaets. Each of 
these icohnotogiea baa ita own advantages and limitations, 
but in combination they should provide us wftb a detailed 
gene caprcasioa phenotype at both the transcription and 
translation level. A major challenge in Urn near future will 
be to define a base kmc for Urn normal gene capression pho> 
aatype of a given oeO type, tissue or body Buhl Thb is not a 
trivial task, however, as it wOl require the aaalysii of hun- 
dreds or even thousands of samples. 

Besides improvements oo the individual frrytyt— them- 
scfvca (see above), there are etui major rrnittttfoni that must 
be sddrcsscd before these technologies can provide the 
expected outcome in arotecuJar medicine. These m etude: 
0) tirhTtirnl problems associated with the analysis of expres- 
sion profiles dertved from tissues that are composed of differ* 
em oeU types, (5) sack of procedures ror identifying targets 
that lie in the pathway of disease, aad (B) need for Worn/or* ' 
rnabxa tools for rapidly assessing the function of the putative 
targets. The latter, is of paramount itapoftanoe to the phar- 
maorutical udustry as the vientification of disease deregulated 
targets aloat is oot snfbaeat to start a costly drug screening 
process. 

As tar as tissue heterogssscity it ooncmed, the recent ad- 
vent of LCM holds great promise as with tbss technique it is 
possible to folate specific populations ef otto from a tissue 
section using direct microscopic observation (29). However, 
even though the technique has been used for UNA analysis 
it b stiD not ready for roost p ro t eomie projects as the cratnber 
ef calls that can be obtained is too amaD to gesserate taaaon* ' 
able protein profiles to terms of the number of proteins that 
can be visualized f 143). 

As DNA miri oarraya and psoteamsice gen ci ate more data hi 
the future it will become a matter of priority to develop simple 
and rapid strategics to validate the. vast amount of informa- 
tion that wffl be generated, particularly in tissue bJopxJca. Tbis 
we believe can be aoB OmpUihcd bs part by making use of 
specific antibodies in combination wufrniminnoc^ochcniirtry 
(1.12). At present, there is no technology at band that may 
allow us to prepare antibodies at will, aMmugb phage anti» . 
body libraries |I4) show much promise. 

FiaaDy,. we would Hke to emphasise that biology In this 
Mfflennium wiO be rtinrarrrrired by the study of comptca 
biological phenomena. DNA rnicroariays and protcomies 
are just some of the technologies of functional genomics, 
and only their integration may a Dow us to tackle the great 
cdmptcsJty underlying biological processes. 

4dtnowtrdf<mt*u: We wouM fikc to thank H. Wolf for many ttiov 
bIsUiuj donations. Tbii work «as ■vpoortcd by grants from tac 
Dasim Cancel Society, tnt Danish Bimechnology Programme. ins 
lohn and Bifibc Meyer Foaadatlon and the Motacubv OcTcnladogy 
CttUi. 
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Amplification and ovcrexpression of putative oncogenes 
confer growth advantages for tumor development. We 
used a functional genomic approach that integrated 
simultaneous genomic and transcript microarray, proteo- 
mics, and tissue microarray analyses to directly identify 
putative oncogenes in lung adenocarcinoma. We first 
identified 183 genes with increases in both genomic copy 
number and transcript in six lung adenocarcinoma cell 
lines. Next, we used two-dimensional polyacrylamide gel 
electrophoresis and mass spectrometry to identify 42 
proteias that were overexpressed in the cancer cells 
relative to normal cells. Comparing the 183 genes with 
the 42 proteins, we identified four genes - PRDXI, 
EEFJA2, CALR, and KC1P-I - in which elevated protein 
expression correlated with both increased DNA copy 
number and increased transcript levels (all r > 0.84, two- 
sided / > <0.05). These findings were validated by South- 
ern, Northern, and Western blotting. Specific inhibition of 
EE-FIA2 and KCIP-I expression with siRNA in the four 
cell lines tested suppressed proliferation and induced 
apoptosis. Parallel fluorescence in situ hybridization and 
immunohistochemical analyses of EEFJA2 and KCIP-I in 
tissue microarrays from patients with lung adenocarcinoma 
showed that gene amplification was associated with high 
protein expression for hnth gj>m>s anri rh^t p rotein 
overexpression was related to tumor grade, disease stage, 
Ki-67 expression, and a shorter survival of patients. The 
amplification of EEFIA2 and KCIP-I and the presence of 
overexpressed protein in tumor samples strongly suggest 
that these genes could be oncogenes and hence potential 
targets for diagnosis and therapy in lung adenocarcinoma. 
Oncogene (2006) 25, 2628-2635. doi: 1 0. 1 038/sj.onc. 1 209289; 
published online 12 December 2005 
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Introduction 

In lung adenocarcinoma, as in other types of cancer, 
gene amplification and the consequent overexpression of 
the amplified oncogene play an important role in the 
development of tumors, because their overexpression 
confers a growth advantage. The ability to identify 
putative oncogenes that are activated during tumorigen- 
esis could facilitate the choice of molecular genetic 
targets for diagnosis and therapy of the disease. This 
concept has been exemplified by HER-2, which was first 
found to be amplified in neuroblastomas and subse- 
quently shown to be associated with poor prognosis in 
breast cancer (Ross and Fletcher, 1999). Now, FJER-2 
aberrations are used as a predictor of response to 
therapy, and treatment of HER-2-positive breast cancer 
with the monoclonal anti-HER-2 antibody trastuzumab 
has been shown to improve prognosis (Ross and 
Fletcher, 1999). Emerging evidence of common ampli- 
cons in lung adenocarcinomas (Luk et aL, 2001; Jiang 
et al, 2004; Tonon ei al„ 2005) suggests that additional 
oncogenes remain to be identified; however, conven- 
tional techniques are ineffective in pinpointing such 
oncogenes. Parallel measurement of DNA copy number 
a n d mRNA l evel s i n cDNA microarray s perm i t s 
changes in copy number to be compared with transcrip- 
tion levels on a gene-by-gene basis to generate lists of 
candidate genes within the defining amplicons (Hyman 
et <»/., 2002; Pollack et aL y 2002). However, use of 
transcript patterns does not allow assessment of the 
expression of protein products or identification of proto- 
oncogenes. Another approach, identifying differentially 
expressed proteins by proteomic analysis and then 
comparing the proteins present with mRNA expression 
in cDNA microarrays from the same specimens, can 
clarify the extent to which changes in transcript patterns 
reflect changes in their cognate proteins and post- 
transcriptional mechanisms (Chen et a/., 2002), but this 
approach cannot be used to identify oncogenes driven 
by extensive increases of their gene copy number. 
Moreover, using individual microarrays or proteomic 
approaches alone cannot distinguish the cancer-driving 
oncogenes that directly propel tumor progression from 
the larger number of passenger genes that may be 
concurrently over-represented but are not biologically 
relevant in tumor development. 
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In this study, we used a comprehensive approach that 
integrated simultaneous comparative genomic hybridi- 
zation (CGH) and transcript microarray with proteomic 
analyses of six lung adenocarcinoma cell lines. We 
directly and specifically identified four putative onco- 
genes that could have been activated through amplifica- 
tion and consequent elevation of transcript expression. 
We used small interfering RNA (siRNA) to inhibit the 
expression of two of these four genes in the lung cancer 
cell lines, which further implicated them in oncogenesis. 
We then explored the clinical significance of these 
findings by assessing the expression of these two genes 
in tissue microarrays of human lung cancer specimens. 
Our findings underscore the power of integrated 
functional genomic analyses for identifying putative 
oncogenes in tumorigenesis; such activated genes could 
be useful as targets for diagnosis or therapy in lung 
cancer. 



Results 

Simultaneous global genomic and transcript analyses 
identify IH3 genes with increases in genomic copy 
numbers and transcript expression levels 
To identify genes in which increased DNA copy number 
might contribute to increased transcript in lung adeno- 
carcinomas, first we used CGH with microarrays of six 
lung adenocarcinoma cell lines. We identified 587 genes 
showing increases in DNA copy number across all six 
cell lines (Supplementary Table IS), which were 
distributed as 90 amplicons on all chromosomes except 
for chromosomes 13 and Y (Supplementary Table 2S). 
A subsequent transcript test with the identical arrays of 
the same cell lines revealed 275 genes that showed 
increased mRNA levels (Supplementary Table 3S). 
Using random permutation tests across all cancer cell 
lines, we identified 183 genes (31%) that showed 
elevated transcript levels from the 587 genes that were 
over - represe n ted i n the g e nome (Table I ), suggesting 
that elevated transcript levels of the 183 genes may 
reflect their genomic over-representation in the cancer 
cells. These findings are consistent with previous reports 
linking genomic changes with altered transcript patterns 
in breast cancer (Hyman et at., 2002; Pollack et aL. 
2002). However, our finding that only 31% of the genes 
showing increased DNA copy numbers had cognate 
increases in transcript expression in lung adenocarcino- 
mas is different from the overall rates of 40-60% 
reported for breast cancer (Hyman et aL, 2002; Pollack 
et aL 2002). This discordance may reflect methodologic 
differences between studies or biological differences 
between breast cancer and lung adenocarcinoma. 



Proteomic analyses identify four genes for which protein 
abundance was associated with increases in the cognate 
gene and transcript levels 

Analysis of transcript patterns is insufficient for under- 
standing the expression of protein products and the 

ci.'eci of genomic o\ cr-:cprcs;;iU;mon mi the expression 



of their cognate proteins. To extend these findings 
beyond genomic over-representation to expression of 
the protein products of those genes, we next assessed 
protein expression in the same cell lines by two- 
dimensional polyacrylamide gel electrophoresis (PAGE) 
and found that 42 different proteins, representing 42 
individual genes, were significantly increased in the 
cancer cell lines (Table 2; Supplementary Figures IS and 
2S). Some of these proteins were identified as having 
multiple isoforms, and all individual isoforms exhibited 
increases in expression ranging from 4.6 to 12.8 times 
their expression in normal lung tissue cells. In compar- 
ing protein level of the 42 genes with changes in their 
cognate genomic and mRNA expression from the global 
microarray analyses, we found that four (9.5%) of those 
42 genes - PRDX! % EEFIA2. CALR y and KCIP-I - 
showed statistically significant correlations between 
elevated protein expression and increases in both copy 
number and mRNA expression (all r>0.84; / > <0.05) 
(Table 2) in the cancer cell lines. These findings imply 
that the abundance of these four proteins is attributable 
to the amplification and consequent elevated transcrip- 
tion of their cognate genes. 



Validation of copy number, transcript, and protein 
expression of PRDX1, EEFIA2, CALR. and KCJP-1 
in lung cancer cell lines 

To confirm our findings from the high^throughput 
analyses, we next used Southern, Northern, and Western 
blotting to assess DNA, RNA, and protein levels for the 
four genes identified in the six cell lines. For compar- 
ison, we arbitrarily chose one gene, NFKBI, in which an 
increase in protein level did not correlate with genetic 
changes. Overall, we found excellent concordance 
between the CGH microarray and Southern blotting 
analyses, transcript array and Northern blotting ana- 
lyses, and proteomic and Western blotting analyses for 
all five genes (Figure I). For example, KCIP-I showed 
fivef old amplification in five of the six cancer cell lines. 
whereas NFKBI showed no such increase in any of the 
cell lines. As for transcript expression, Northern blotting 
of EEF1A2 showed high expression in five of the six 
cancer cell lines; again, levels of NFKBI transcript were 
not increased in any cancer cell line as compared with 
normal bronchial epithelial cells. The results of Western 
blotting were also consistent with the results of the 
proteomic experiments; for example, five of the cancer 
cell lines exhibited strong protein bands for PRDXI as 
compared with normal ceils. These findings provide 
strong support for the validity of the results derived 
from the high-throughput techniques in this study. 

These parallel analyses also revealed close correla- 
tions in the extent of changes in gene copies, transcript, 
and protein of each of the four genes in the cancer cell 
lines. For example, in the five cancer cell lines that 
showed at least fourfold increases in EEFIA2 copy 
number, expression of transcript and protein was also 
increased by at least a factor of four as well (relative to 
their expression in normal cells) (Supplementary Figure 
3S). The protein abundance of the four genes showing 
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Table I List of 183 genes with statistically significant correlation 
(0.05) between genomic copy number and transcript level 

OVw symbol Chro. Distance from p arm of each a 

chromosome (Mb) 



Table I (continued) 



Gene xymhot Chro. Distance from p arm of each 
chromosome (Mb) 



TNOI 
DDOST 

sr 'N . 

M LI* 

AKRIAI 

PRDXI 

UQCRH 

RPL7 

COLIIAI 

MCLI 

PSMB4 

JTB 

RPS27 

HAXI 

MUCI 

ctT3 

' (. RABP2 
TKT 
ATPIB1 
CIMTI 
SNRPK 
VWHAQ 
ODCI 
RPL3I 
BENE 
STAT I 
HSPDI 
HSPEI 
RPL37A 
IGFBP2 
RPS7 
RABIA 
IGKC 
IT F 
PKN2 
KPNA4 
SI OOP 
UODH 
UCHLI 
SPPI 
TRIM2 
FGB 
FOG 



• 



SDHA 

PDCD6 

CCT5 

PTPRF 

RPL37 

P.NCI 

qp-c 

SPINK I 

CANX 

SOX4 

HDGP 

RPSIO 

RPLI0A 

VEGF 

osr-2 

FSCNI 

eves 

CBX3 

IGFBP3 

CLDN4 

HSPBI 

CALR 

COL I A3 

ATP5J2 

AKRIBIO 



2 
2 
2 
■> 

2 
2 
2 
2 
3 
3 
3 
4 
4 
4 
4 
4 
4 



5 
5 

5 
5 
5 
5 
5 
5 
5 
6 
6 
. 6 
6 
6 
6 
7 
7 
7 
7 
7 
7 
7 
7 
7 



8.5 
20.1 
26.4 
32.2 
45.4 
45.4 
46.2 
96.4 
102.6 
147.3 
148.1 
150.7 
150.7 
151 
151.9 
153 I 
i53.4 
159.3 
165.8 ' 
199.7 
200.2 
9.6 
10.60 
101.20 
110.40 
I9LK0 
198.30 
198.30 
217.30 
217.50 
3.30 
65.30 
89.00 
46.3 
151 
161.5 
6.7 
39.3 
4LI 
89.3 
154.7 
156 
156 



0.251 
0.305 

10.3 

14.2 

40.8 

74 

132.2 

147.2 

179.2 

21.7 
. 22.6 

34.6 

35.4 

43.7 

45.4 
5.3 

24.9 

25.9 

45.7 

72.7 

75.5 

92.7 

93.6 

98.7 
133.6 



0.0085 

O.OIII 

0.0113 

0.0114 

0.0128 

0.0122 

0.0125 

0.0127 

0.0129 

0.0222 

0.0131 

0.0134 

0.0135 

0.0266 

0.0143 

0.CI67 

0.0148 

0.0152 

0.0234 

0.0154 

0.0165 

0.0159 

0.0119 

0.0161 

0.0169 

0.0175 

0.0277 

0.0185 

0.0388 

0.0189 

0.0193 

0.0204 

0.0285 

0.0455 

0.0207 

0.0211 

0.1122 

0.0215 

0.0222 

0.0227 

0.0231 

0.0235 

0 0441 



0.0243 

0.0245 

0.0446 

0.0248 

0.0251 

0.0336 

0.0466 

0.0256 

0.0263 

0.0321 

0.0362 

0.0177 

0.0369 

0.0372 

0.0173 

0.0378 

0.0381 

0.0289 

0.0389 

0.0403 

0.0433 

0.0425 

0.0457 

0.0475 

0.0481 



RPS20 

TCEBI 

LAPTM4B 

RPL30 

KCIP-I 

PABPCI 

EEFID 

TSTA3 

RPL8 

TRAI 

RPL35 

HSPA5 

LCN2 

DPP7 

PFKP 

AKRICI 

PLAU 

DSP 

TALDOI 

SLC22AIL 

TSSC3 

RPL27A 

ST5 

LDHA 

MDK 

DOC-IR 

MMPI2 

HYOUI 

SCNNIA 

LDHB 

KRT7 

KRT5 

KRT6E 

ERBB3 

NACA 

TM4SF3 

NTS 

ASCLI 

TXNRDI 

CKAP4 

COX6AI 

BGN 

RAN 

RPL36A 



PGD 

THBS2 

TRAF4 

SPINTI 

RPLI7 

PKM2 

IDH2 

RPL23A 

MSLN 

UBL2I 

RPS2 

CLDN9 

ARL6IP 

EIF3S8 

TUFM 

ALDOA 

NME4 

GPR56 

CDHI 

NQOI 

SLC7A5 

APRT 

GALNS 

RPLI3 

MCP 



8 
8 
8 
8 
8 
8 
8 
8 
8 
9 
9 
9 
9 
9 
10 
10 
10 
10 
II 
11 
II 
II 
II 
11 
II 
II 
II 
It 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
12 
-44 — 



14 

15 

15 

15 

15 

15 

15 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

16 

17 



56.7 
74.6 
98.5 
98.7 
101 6 
101.78 
144.4 
144.5 
145.6 
117.1 
121. 1 
121.5 
124.4 
133.4 
3.2 
5.1 
75.6 ' 
76.7 
0.434 
2.9 
2.9 
8.7 
8.8 
18.5 
46.4 
67.5 
102.8 
118.9 
6.3 
21.7 
52.3 
52.6 
52.6 
56.2 
56.8 
71.2 
86.2 
103.3 
104.6 
106.6 
120.7 
122.5 
129.88 
— «H 



50.7 

37.5 

38.3 

38.7 

45.26 

70.1 

88.2 

0.377 

0.753 

1.3 

1.95 

3.1 

18.7 
28.3 
28.9 
30.1 
53.6 
57.4 
68.5 
69.5 
87.6 
88.6 
88.6 
89.3 
32.4 



0.0482 

0.0486 

0.0497 

0.0054 

0.0093 

0.0119 

0.0121 

0.0122 

0.0128 

0.0136 

0.0133 

0.0135 

0.0137 

0.0139 

0.0223 

0.0146 

0.0356 

0.0289 

0.0143 

0.0151 

0.0611 

0.0156 

0.0162 

0.0168 

0.0162 

0.0167 

0.0177 

0.0183 

0,0185 

0.0193 

0.0196 

0.0197 

0.0201 

0.0212 

0.0218 

0.0401 

0.0215 

0.0219 

0.0223 

0.0124 

0.0435 

0.0235 

0.0238 

0.02 4 3 — 

0.0248 

0.0251 

0.0253 

0.0254 

0.041 1 

0.0258 

0.0211 

0.0264 

0.0366 

0.0271 

0.0281 

0.0329 

0.0412 

0.0336 

0.0377 

0.038 

0.038 1 

0.0386 

0.0289 

0.0396 

0.0397 

0.041 I 

0.0255 

0.043 1 

0.0465 
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35.11 
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39.8 
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RPL27 
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41.1 


0.0046 


NMQI 
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0.0082 


CO 1.1 At 


17 
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0.0326 
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49.6 


0.01 II 
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17 
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0.01 17 


SMT.HI2 


17 


73.6 


0.01 19 


SYNGR2 


17 


76.6 


0.0122 


LOALS3BP 


17 


77.4 


0.0127 


P4HB 


17 


H0.3 


.0.0126 


PPAP2C 


19 


0.221 


0.0228 


GPI 


19 


39.55 


0.0145 


HPN 


19 


40.2 


0.0129 




\) 


41.4 


0.0131 


SPI NT2 


19 


43.4 


0.0238 


PSMDN 


19 


43.5 


0.0132 


YIFIP 


19 


43.5 


0.0135 


R PS 1 6 


19 


44.6 


0.0144 
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19 


46.9 


0:0145 
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19 


46.9 


0.0143 


GfPR 


19 


50.8 


0.0259 


SNRPD2 


19 


50.9 


0.0413 


K DELHI 


19 


53.6 


0.0152 


RPL2K 


19 


60.6 


0.0156 


RPS5 


19 


63.6 


0.0267 


TRIM 28 


19 


63.7 


0.0158 


DAP 


20 


35.6 


0.0166 


TOPI 


20 


40.3 


0.0172 


UBE2C 


20 


45.1 


0.0174 


RPS2I 


20 


61.6 


0.0268 


EHKI.A2 


20 


62.8 


0.0185 


Th F3 


21 


42.6 


0.0186 


TIT! 


21 


42.7 


0.0192 


CSTB 


21 


44.1 


0.0201 


MIP 


22 


22.6 


0.0202 


xbpi 


22 


27.5 


0.0204 


PRDX4 


X 


22.9 


0.0198 


SYNI 


X 


46.3 


0.0204 


TIMPI 


X 


46.3 


0.0209 


PLP2 


X 


47.8 


0.0212 


M AO Fl)l 


x 




00331 


RPS4X 


X 


71 


0.0124 


SSR4 


X 


152.6 


0.0232 



(Supplementary Figure 4S). In contrast, EEF1A2 and 
KCTP-I protein levels remained unchanged in mock- 
treated control cells and in cells transfected with a 
scrambled siRNA sequence. At 48 h after siRNA 
transfection, the percentage of proliferation of the 
transfected cancer cells was reduced to 15-30% as 
compared with 91-100% of cell proliferation of the 
same cell lines treated with PBS or scrambled siRNA 
(Supplementary Figure 5S). Apoptosis of siRNA- 
transfected cells was 27-34%, whereas only 4% of the 
same cell lines treated with PBS or scrambled siRNA 
showed apoptosis. These results strongly support an 
oncogenic role for the identified genes in lung cancer and 
confirm their potential usefulness as therapeutic targets 
for the disease. 



Amplification and protein expression of KCIP-I and 
EEFIA2 in lung tissue 

To further validate these findings and to assess the 
possible clinical significance of the four potential 
putative oncogenes identified from the cell lines, we first 
applied fluorescence in situ hybridization and immuno- 
histochemical analysis, in parallel, to commercially 
available human lung tissue microarrays (Ambion, 
Austin, TX, USA) to evaluate the status of two of these 
four genes in lung cancer tissue specimens. (Commer- 
cially available antibodies to PRDXI or CALR were 
not suitable for use in immunohistochemical analysis 
when this report was written.) Overexpression of KCIP- 
1 and EEFIA2 protein in the tumors was concordant 
with amplification of the corresponding genes 
(/> = 0.0003 for KCIP-i and /> = 0.0011 for EEF1A2). 
For example, 16 (35%) of the 46 lung adenocarcinomas 
in the microarray showed amplification of KCIP-I, and 
strong cytoplasmic staining for K.C1P-J protein was seen 
in 18 tumors (39%) (Figure 2). We next examined 
whether overexpression of these genes was associated 
with increased cell proliferation by analysing Ki-67 
expre ss ion in contiguous sections of the tissue micro - 
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corresponding increases in both DNA copy number and 
mRNA provides further evidence that these could be 
oncogenes, the activation of which is reflected by 
genomic amplification and consequent increases in 
transcript level in lung adenocarcinoma cell lines. 

Specific inhibirion of EEFIA2 and KCIP-i expression by 
siRNAs led to decreased cell proliferation and induction of 
apo ptosis 

To further prove the oncogenic function of the identified 
genes in lung tumorigenesis. we used siRNAs to inhibit 
the endogenous expression of EEF1A2 and KCIP-1 
protein in four lung cancer cell lines (HI 563, H229, 
H522, and SK-LU). Transfection of the cancer cells with 
specific siRNAs reduced the level of EEFIA2 and 
KCIP-I protein by 70 90% 4Sh after transfection 



arrays. Positive Ki-67 expression was found to correlate 
with positive expression of both KCIP-I (/> = 0.02) and 
EEFIA2 (/> = 0.0l). To extend these findings, we then 
studied 1 1 tissue microarray blocks comprising normal 
and tumor tissue specimens from 113 patients with 
pathologic stage I non-small-cell lung cancer who had 
undergone curative surgery (Wang et aL, 2005). 
Immunohistochemical analysis showed that EEF1A2 
was expressed in 32 cases (28%) and KCIP-I in 29 cases 
(26%). Univariate and multivariate Cox proportional 
hazards models were used to detect possible associations 
between EEFIA2 and KCIP-I expression and clinico- 
pathologic variables. Expression of EEFI A2 or KCIP-I 
was associated with short overall survival lime 
(^ = 0.0012 for EEF1A2 and />=0.0026 for KCIP-I) 
(Supplementary Figure 6S). Age at diagnosis, histologic 
type of cancer, degree of tumor differentiation, and 
smoking history were not associated with survival time. 

Although only two genes were validated in the lung 
tissue microarrays (because available antibodies to 
the other two genes were not suitable for use in 
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Tabic 2 Proteins showing significant ovcrcxpression in cancer cell lines relative to those in normal bronchial epithelial cell lines and their 
_ correlation coelHcients with increased DNA copy number or mRNA values" 
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PRDXI 

iu-;kia2 

CALR 

KCIP-I 

P4HB 

KRTI7 

GSTPI 

IGFBP-3 

EEFIG 

K RTI9 

HSPBI 

PGKI 

TAGLN 

JTK9 

LGALS! 

DPF3 

HAPI 

KRTI8 

KRT8 

HP 

FGR 

NFKBI 

SI00A2 

ZBTBI7 
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MAP4 
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MCM3 

HCK 
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CDC25B 
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CCNHI 

CSF2 
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GTM4 

E1F5A 



48.4/5.4 
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23.2/4.7 
31.6/5.8 

50/6.4 
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22/6.5 
44.5/4.2 
.22.5/4.3 
59.5/6.8 

'6/5.5 
25.8/4.8 
75.5/6.5 

48/5.3 
9.2/4.4 
55.2/6.2 
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50.4/6.3 
10.9/4.6 
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75.2/4.8 
1 1 1/5.4 
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90.9/5.5 
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22.6/5.8 
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21.5/6.8 
49.6/6.5 
47.1/4.3 
16.9/6.3 
25.3/5.2 
54.7/4.3 

25.3/5.0 
10/4.4 
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liukaryoiic translation elongation factor I alpha 2 
Calrcticulin 

Tyrosine 3-monooxygcnasc activation protein, zeta 
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Glutathione ^-transferase pi 
Insulin-like growth-factor binding protein 3 
Eukaryotic translation elongation factor I gamma 
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Phosphoglycerate kinase I 
Transgclin 
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Huntington-associatcd protein I 
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Keratin 8 
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Gardner-Rasheed feline sarcoma viral oncogene homolog 
Nuclear factor of kappa light gene enhancer in B-cclls I 
SI00 calcium-binding protein A2 
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Microtubulc-associated protein 4 
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Minichromosomc maintenance deficient 3 
Hemopoietic cell kinase 
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Cell division cycle 25 B 

Cell division cycle 42 <GTP-binding protein, 25kDa) 
Ras homolog gene family, member A 
Ras-relatcd C3 botulinum toxin substrate I 
Tubulin, beta polypeptide 
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Colony stimulating factor 2 (granulocyte-macrophage) 
,Proteasome (prosome, macropain) subunit, beta type, 6 
Aldehyde dehydrogenase I family, member A I 

Glutathione S-trans (erase M4 

EukarVQtic translation initiation fartnr SA 
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immunohistochemical analysis), these findings are con- 
sistent with those from our cell lines, demonstrating 
again that genomic amplification and consequent 
increases in amounts of transcript may be, at least in 
part, driving the abundance of proteins in these lung 
tumors. The association between expression of these 
genes and that of Ki-67. a known indicator of poor 
prognosis in lung cancer (Martin et ai, 2004), suggests 
that activation of these genes may be an indicator of 
tumor aggressiveness. These results also suggest that 
expression of EEFI A2 and KCIP-1 proteins in stage J 
non-small-cell lung cancer may be useful as a marker for 
distinguishing patients with relatively poor prognosis 
Irom those who might benefit from adjuvant treatment. 



Discussion 

Our current study illustrates the power of integrated 
functional genomic analyses for identifying putative 
oncogenes and for evaluating their potential clinical 
significance. Among the four identified oncogenes, three 
genes (PRDXL CALR, and KCIP-I) have been im- 
plicated in lung lumorigenesis. PRDXI is an antioxidant 
protein involved in regulating cell proliferation, differ- 
entiation, and apoptosis. Kim et al. (2003) found 
PRDXI expression to be elevated in both lung cancer 
and adjacent normal lung tissue, suggesting that 
activation of PRDXI may enhance proliferation in lung 
cancer. CALR has a major role in Ca 2+ binding and the 
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Figure I Confirmation by Southern. Northern, and Western blot 
analyses of increased DNA copies, transcript levels, and protein 
levels in the four genes identified in high-throughput analyses. For 
comparison, wc arbitrarily chose one gene, NFKBt, in which an 
increased protein level did not correlate with genetic changes. The 
Moiling results are consistent with the results from the COM array, 
transcript array, and protcomic analyses. Nor, indicates normal 
bronchial epithelial cell line. All the experiments were repeated at 
least three limes with each cell line. Means of normalized to /?-aciin 
signal intensities on Southern. Northern, and Western blots, along 
with 95% confidence intervals, were calculated (jj-actin signals are 
not shown in the figure; two different normal bronchial epithelial 
cell lines were used in the confirmation and only one normal cell 
line is shown in the figure). 



transcriptional regulation of other genes and was 
- -recently found to be overexpressed in 73% of 40 lung 
adenocarcinomas (Oates and Edwards, 2000). KCIP-1 
belongs to the 14-3-3 family, which participates via the 
MAPK and Wnt signaling pathways in the regulation of 
many cellular processes including cell proliferation and 
differentiation as well as tumorigenesis (Thomas et a/., 
2005). KCIP-1 was recently found to be expressed in all 
!2 lung tumors tested in a single-institution study (Qi 
et aL 2005). Interestingly, EEFJA 2 was originally 
considered a putative oncogene in ovarian cancer on 
the basis of its being amplified in 25% and over- 
expressed in 30% of the same set of ovarian tumors 
(Anand et <//., 2002); functional analyses have estab- 
lished its oncogenic role in cellular transformation (Lee, 
2003). Our discovery that EEFIA2 may be a putative 
oncogene in lung adenocarcinoma demonstrates the 
power of our functional genomic strategy for rapidly 
identifying potential oncogenes. 

Although the main focus of this study was to 
s|xvi!ically identify putaMvv oncogenes, it should be 



noted that 90.7% of the genes showing high protein 
expression did not show corresponding increases 
in both DNA copy number and transcript, a finding 
consistent with that of others that transcriptional, 
translational, and post-translational regulatory mecha- 
nisms can greatly influence the abundance of protein 
in lung tumorigenesis (Chen et aL y 2002). For example, 
NFKB1 is a critical arbiter of immune responses, 
cell survival, and transformation and is often activated 
in several types of tumors (Chen et a/„ 2002). De- 
regulation of NFKB1 is thought to be modulated 
through phosphorylation of Ser337 by protein kinase 
A (Chen et a/., 2002). In our study, 68.8% of the 
genes showing over-representation in the genome 
did not show elevated transcript - levels, implying 
that at least some of these genes are 'passenger' genes 
that are concurrently amplified because of their 
location with respect to arnplicons but lack bio- 
logical relevance in terms of the development of lung 
adenocarcinoma. 

Although the potential oncogenes we identified here 
are likely to be important, certainly other oncogenes 
could be involved in the development of lung adeno- 
carcinoma. The oligo microarray we used consists of 
22000 probes, which represent only about 60% of the 
human genome. Moreover, each probe was designed for 
the 3' region of expressed sequence lags of the selected 
genes. Also, our results were initially derived from 
cancer cell lines, although the findings were later 
confirmed in human tissue samples. Our ongoing study 
using microarrays with information on more genes 
and the development of high-resolution proteomic 
analyses for use with larger numbers of specimens will 
allow more comprehensive analyses of the molecular 
consequences of gene amplifications. Such expanded 
analyses will very likely lead to the identification of 
additional oncogenes. 

Some of the results of our current study were 
comparable to those of other studies of lung cancer. 
For example , genomic- co p y number a nd prote i n leve l s 
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of KCIP-1 were previously found to be amplified and 
overexpressed in primary lung cancers by cDNA clone- 
based CGH array analysis (Jiang et al. y 2004) and 
proteomic analysis (Chen et aL, 2002), respectively. Our 
functional genomic approach, which integrates simulta- 
neous CGH, transcript microarrys, proteomic analyses, 
and siRNA, allows us not only to quickly identify 
potential oncogenes but also to explore their significance 
as diagnostic and therapeutic targets in tumor progres- 
sion - more than could be achieved by any technique 
alone. 

Genes identified in this way may serve as promising 
targets for diagnosis and therapy in lung adenocarci- 
noma. Further research on the clinical implications of 
such genes is needed; experiments now underway in our 
laboratory include overexpression of the genes in 
normal cells, disruption of the function of these genes 
in cancer cells, and investigation of how interactions 
among these genes (or interactions with other known 
oncogenes) may mediate the expression of the trans- 
formed phenotype. 
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Figure 2 EEF1A2 amplification is associated with high EEFIA2 protein expression in lung adenocarcinomas, (a) Cells from a lima 
adenocarcinoma i ample m which EEFIA2 is amplified show more green signals (EEFIA2J than red ^ bS^ 20 
^ow^lrn^^ Wfaiiion, x 400). (b) rmmunohistochemical staining of cells from the same tissue 

shows strong EEF A2 staining ,n the cytoplasm, (c) A lung adenocarcinoma sample with two copies of EEF/A2an6 chromosome" 0 
ccn romcne probe, md.calmg no EEFIA2 amplification (original magnification, x 400). (d) XuLmM^^ 
Irom the Name tissue sample as in panel c shows negative staining for EEF I A2. * 



Materials and methods - 

Cell lines 

Six human lung adenocarcinoma cell lines (H23, H229, HI 792, 
SK-LU-I. H522, and HI 563) were obtained from the 
American Type Culture Collection (Manassas, VA, USA). 

Two norma l bronchial epithe lial cel l li nes we r e obtai n ed f ro m 

Clonlech (Palo Alto, CA, USA). Genomic DNA, mRNA, and 
protein were derived from a single harvest of these cells. 

DNA and RNA profiles by microarray analysis 
Genomic DNA labeling and hybridization were performed as 
described previously (Barrett et af.. 2004) with Agilent's 
Human IA Oligo Microarray (V2) (Agilent Technologies, 
Palo Alto, CA, USA), which- contains 22 000 unique 60-mer 
oligos. Details of the protocol for analysing transcripts are 
available at hltp://\vww.cheni.agilenLcom. Map positions for 
arrayed genes were assigned by identifying the DNA sequence 
represented in the UniGene cluster and matching it with the 
Golden Path genome assembly (http://genome.ucsc.edu/; Mat 
7, 2004 Freeze). Microarray images of DNA copy number and 
expression were analysed by using AgilcntCGH Analytics and 
Feature Extraction software. DNA copy number profiles that 
deviated significantly from background signal ratios (measured 
from normal control cell hybridization, as described elsewhere; 

• Barren et aL 2004) were interpreted as evidence of true 
differences in DNA copy number. The criteria for defining 
genomic over-representation and am pi icons are described 
vlscwhcre (llynmn n .-;/.. ?i><)2): details are given in the 
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Supplementary Information. An increase in mRNA level was 
defined as a twofold increase in signal ratio relative to that of 
the control (log 2 > I). 

Quantitative two-dimensional PAGE and mass spectrometry 
Analysis of proteins by two-dimensional PAGE and their 
identification by mass spectrometry were performed as 
previously described (Shen et aL, 2004). Briefly, protein pellets 
were solubilized in rehydration buffer, after which the first- 
dimension isoelectric focusing was carried out with a Protean 
IEF Cell (Bio-Rad Laboratories) and the second-dimension 
separation was carried out with Bio-Rad's Ready Gel Precast 
Gels and the Bio-Rad Criterion Cell apparatus. Protein spots 
were visualized by silver-based staining, and all gels were 
assessed with Bio-Rad's PDQuest 2D gel image analysis 
software. Selected spots were subjected to in-gel tryptic 
digestion and analysed on a Voyager-DE PRO matrix-assisted 
laser desbrption ionization/time-of-flight mass spectrometer 
(Applied Biosystems, Foster City, CA, USA). The mass list of 
the 20 most intense monoisotopic peaks for each sample was 
entered in the MS-Fit search program (v3.2.l) (http:// 
prospector.ucsf.edu/ucsfhtml4.0/msfit.htm) and searched in 
the National Center for Biotechnology Information protein 
database. 

Southern, Northern, and Western blot analyses 
Southern, Northern, and Western blot hybridizations were 
performed according to standard protocols. cDNA clones for 
the tested genes were purchased from Invitrogen (Carlsbad, 
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CA. USA) and prepared as probes for the blot hybridizations. 
Antibodies used were obtained as follows: PRDXI, CALR, 
NFKBI, KCIP-I. and /7-actin from Santa Cruz Biotechnology 
(Santa Cruz. CA. USA); and EEFIA2 from Upstate Biotech- 
nology (Wultham. MA. USA). 

Fluorescence in silu hybridization and imnnmohistochemical 
analyses of lung tissue microarrays 

Fluorescence in situ hybridizations and immunohistochemical 
analyses or KCIP-I and EKFI A2 were carried out as described 
elsewhere (Jiang ct aL 2002; Wang et <//., 2005) with Lung 
Tissue Microarrays (Amnion, Austin, TX, USA) and II 
homemade microarray blocks containing tissue samples from 
1 1 3 patients with pathologic stage 1 non-small-ccll lung cancer 
(Wang et aL 2005). DNA probes specific for KCIP-I and 
EEFI A 2 were obtained hy screening a Human BAC Clone 
library (Invitrogcn) by polymerase chain reaction as described 
previously (Jiang ct aL. 2O02). The antibodies used for the 
immunohistochemical analyses were the same as those used 
for the Western blotting. Cell proliferation of the lung tissues 
was assessed vviih a Ki-67 monoclonal antibody from Santa 
Cruz Biotechnology. Definitions of the cutoff value for a 
positive result of each antibody are shown in Supplementary 
Information. 

siRNA transection; cellular proliferation assay, and apoptosis 
analysis 

Transactions were carried out by using siPORT Lipid 
Transfcclion Agent (Ambion) with siRNAs targeting KCIP-I 
or EEFIA2 or with a scrambled siRNA duplex (siControl) 
(Dharmacon Inc., Lafayette, CO, USA), with PBS used as a 
negative control (Jiang et aL, 2002). Cells were fixed 24. 48, or 
96 h later and subjected to further tests. AH siRNAs were 
prepared by using a transcription-based method with Silencer 
siRNA according to the manufacturer's instructions (Am- 
bion). Sequences of the individual siRNAs are listed in 
Supplementary Table 4S. Inhibition of cell growth by the 
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siRNAs was determined by MTT staining, and cell growth 
rate was plotted against the percentage of viable cells in the 
saline-treated controls (a value arbitrarily set at 100%) (Jiang 
et aL, 2002). Apoptosis was analysed by fluorescence cell 
cycle analysis of terminal deoxynucleotidyl transferase- 
mediated dUTP nick-end labeling with FITC-labeled dUTP 
(Boehringer Mannheim Biochemicals, Mannheim, Germany) 
(Jiang et aL, 2005). 

Statistical analyses 

Relationships between gene copy number and mRNA level 
were examined as described elsewhere (Hyman et aL. 2002. 
Supplementary Information). Correlations between protein 
abundance and DNA copy number and mRNA expression of 
the corresponding genes were evaluated with the Spearman 
correlation coefficient. Fisher's exact test and £ 2 -tests were 
used to analyse associations between amplification and 
expression of the candidate genes with various histopathologic 
variables of the samples in the tissue microarrays. Univariate 
and multivariate analyses were carried out with Cox's 
proportional hazards model to determine which independent 
factors might have a joint significant influence on survival. A 
lvalue sSO.05 was considered statistically significant; all 
statistical tests were based on a two-sided significance level. 
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Regulation of cytochrome P4501A1 in teleosts: sustained induction of 
CYP1A1 mRNA, protein, and catalytic activity by 2,3,7,8- 
tetrachlorodibenzofuran in the marine fish Stenotomus chrysops. 

Hahn ME, Stegeman JJ . 

Biology Department, Woods Hole Oceanographic Institution, Massachusetts 02543. 

Cytochrome P4501AI (CYP1 Al) is known to play important roles in die activation and 
detoxification of carcinogens and other toxicants in vertebrate animals, including fish. 
Although extensively studied in mammalian systems, the regulation of CYP1 A forms in 
other vertebrates is less well understood. We examined the time course and dose- 
response relationships for induction of CYP I Al rnRNA, protein, and catalytic activity by 
2,3,7,8-tetrachlorodibenzofuran (TCDF) in the marine fish Stenotomus chrysops (scup). 
The time course of CYP1 Al induction was determined following a single ip dose (10 
nmol/kg) of 2,3,7,8-TCDF. Hepatic ethoxyresorufin O-deethylase activity was increased 
after I day, reached a maximum by 8 days, and was still elevated 14 days after treatment. 
The content of immunodetectable CYP I Al protein in liver was elevated on Day I and 
continued to increase through 14 days. CYP1 Al protein content was also strongly 
induced in heart and gill beginning at 2 days after treatment and extending through Day 
14. Hepatic CYP1 Al mRNA was strongly induced by 1 day after dosing and remained 
elevated through 14 days. The sustained induction of CYP1 Al mRNA by 2,3,7,8-TCDF 
contrasts with the transient induction seen previously in fish treated with nonhalogenated 
inducers and most likely reflects differences in persistence of the inducers. Dose-resporise 
studies indicated that induction of CYP 1 Al mRNA, protein, and catalytic activity 
occurred following doses of 2,3,7,8-TCDF as low as 0.4 nmol/kg (120 ng/kg), within the 
range of whole-body contents of this congener measured in fish from contaminated 
environments. The estimated dose producing half-maximal CYP1 Al induction in scup 
was approximately 2-10 nmol/kg, suggesting that the sensitivity of these fish to induction 
may be as great as or greater than that of rats. In contrast to previous results obtained with 
3,3VM , -tetrachlorobiphenyl (TCB) and beta-naphthoflavone, which appear to inhibit or 
inactivate CYP1 Al in fish and other vertebrates, there was a good correlation among 
levels of CYP1 Al mRNA, protein, and catalytic activity in individual fish following 
various doses of 2,3,7,8-TCDF. The difference in response to 2,3,7,8-TCDF versus 
3 > 3 , ,4,4'-TCB may reflect differences in the inducing potencies.of the two compounds 
relative to their similar potencies as inhibitors of CYP1A1 catalytic activity. In additional 
studies to evaluate structure-activity relationships for CYP1 Al induction by chlorinated 
dibenzofurans in fish, scup were treated with 2,3,6,8-tetrachlorodibenzofuran (2,3,6,8- 
TCDF). At 10 or 50 nmol/kg, 2,3,6,8-TCDF was inactive as an inducer of CYP1 Al 
mRNA, protein, or catalytic activity.(ABSTRACT TRUNCATED AT 400 WORDS) 
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Expression of the pS2 gene in breast tissues assessed by pS2-mRNA 
analysis and pS2-protein radioimmunoassay, 

Hahnel E, Robbhas P. Harvey Ji Sterrett G, Hahnel R . 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical 
Centre, Nedlands. 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-proteiri 
using a radioimmunoassay, and by determining pS2-mRNA using Northern blotting. 
There was a good correlation between the two measurements, indicating that expression 
of the pS2 gene in breast tissues may be assessed by either method. Since 
radioimmunoassay is technically easier and more efficient than Northern blotting, 
radioimmunoassay will be the method of choice in routine applications. 
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Brief communication 

Expression of the pS2 gene in breast tissues assessed by pS2~h)RNA analysis 
and p$2-protein radioimmunoassay 



Eriica Hahnef, Peter Robbins, Jennet Harvey, Gregory Sterrett and RoJand Habnel - 
Department of Pathology, University of Western Australia, Queen Elizabeth II Medical Centre, Nedlands, 
6009, Western Australia 



Key words: breast tissue, pS2-mRNA, pS2 protein, radioimmunoassay 

Summary ... ... 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-prorein us ing a radioimmu- 
noassay, and by determining pS2-rnRNA using Northern blotting. There was a good correlation between the 
two measurements, indicating that expression of the pS2 gene in breast tissues rriay be assessed by either' 
method. Since radioimmunoassay is technically easier and more efficient than Northern blotting, radioim- 
munoassay will be the method of choice in routine applications. 



Introduction 

Expression of the pS2 gene is controlled by estro- 
gen. This was first described in the MCF-7 breast 
cancer cell line [1]. pS2 expression has since been 
reported to be useful as a prognostic indicator f2, 
3], although this was not confirmed in another se- 
ries (4]. 

pS2 expression may be assessed in tissue ho- 
mogenates by ansfysis of pS2-mRNA [5J, by ra- 
dioimmunoassay of the pS2-protein [2], ox by im- 
munpcytochemical detection of the pS2 protein in 
tissue sections [5]. Jt was the aim of this study to 
establish the correlation between pS2-mRNA and 
pS2-protein by radioimmunoassay in a series of 
tissues obtained from mastectomy specimens per- 
formed for carcinoma of the breast. Primary breast 
carcinoma tissue, metastatic carcinoma within ax- 
illary Dodes. and macroscopjcajjy benign breast 
ussue were examined. 



Materials and methods 



Breast tissues 



Tissue specimens from mastectomies performed 
for carcinoma of the breast were examined. 32 pri- 
mary breast carcinomas, 10 axillary lymph nodes 
containing metastatic breast carcinoma, and 20 
samples of uninvolved breast tissue were analyzed 
for pS2 expression, 

. The primary breast carcinomas were histolog- 
ically classified using a conventional subclassifi- 
cation. The presence or absence of primary tumour 
was assessed. The presence of metastatic carcino- 
ma within lymph nodes studied was verified by 
histological examination of the node remnant after 
sampling. 

'Uninvolved* breast tissue was sampled from 
sites well removed from the primary breast tumour 
(usually in another quadrant of the breast), and was 
selected only if the tissue appeared macroscppical- 
)y unremarkable. Tissue sampling occurred irrime- 
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diately upon arrival of the mastectomy specimen in 
the laboratory, with minimal delays between re- 
moval and sampling. 
. • Tissues for j>S2 analysis were snap frozen in liq- 
uid nitrogen and stored at -70° C until processed. 



Extraction of UNA and determination of 
pS2-mRNA • 

Details of the procedure have been described in pur 
previous paper [6 J. Briefly, the deep-frozen tissue 
>vas homogenized in a micrp-dismembrator. The 
homogeneous powder, was extracted with guani- 
diniumjsothiocyanatephenolchloroformiso- 
amylalcobol, and RNA was precipitated with iso- 
propanol. The washed RNA pellet was dissolved in 
SDS and glyoxylated, and the RNA preparation 
loaded onto agarose gel. After electrophoresis the 
gel was capillary blotted onto Zeta-probe mem- 
branes. Membranes were hybridized overnight 
- with cDNA.probes pS2 and 36B4, which were la- 
beled with [a^PJ dCTP by nick translations. 
! Washed membranes were exposed to Kodak X- 
omat AR film. Relative intensities of the mRNA 
bands were assessed visually as not detectable, ve- 
ry weak; weak, medium, strong, and very strong, 
taking the intensities of the nbiguitous 36B4 bands 
into account. 



Radioimmunoassay ofpS2-protein 

Deep frozen specimens were pulverized with a mi- 
crodismembrator. The tissue powder was suspend- 
ed in 10 volumes of pH 7.5 phosphate buffer/The 
homogenate was centrifuged in a refrigerated cen- 
trifuge at 4°C for 60 minutes at 2600xg. The su- 
pernatant was removed with a Pasteur pipette, 
carefully avoiding the fat layer on the top. The pro- 
tein concentration in the supernatant was estimated 
by use of the Coomassie dye-binding method [7J. 
An aliquot of the supernatant was diluted to a pro- 
tein concentration between 1 and 2mg/ml before 
assay of tbe pS2- protein. In one case the protein 
concentration of the supernatant was well beJow 
Ime/ml. 



The estimation of the pS2-protein was perform- 
ed using a solid phase, two-site radioimmunoas- 
say. The kits were bought from CIS Biointema- 
tional, Gif-sur-Yvette, France (ELSA-PS2). In this 
method the molecules of pS2 are sandwiched be- 
tween two monoclonal antibodies; the first one is 
coated on the EJLSA solid phase, the second one is 
radiolabeled with j25-iodme. The radioactivity 
bound to the ELSA is proportional to the concen- 
tration of pS2-protein. Details of tbe procedure are 
supplied with the kit {8), " 

/ Results and discussion 

32 primary , breast carcinomas, metastatic breast 
carcinoma in 10 lymph nodes, and 20 samples of 
benign breast tissue from mastectomies were in- 
vestigated. Two of the carcinomas were of the infil- 
trating lobular type, two were ductal carcinomas in 
situ, one was a multicentric invasive ductal carci- 
noma, all others were invasive ductal carcinomas. . 

Examples of pS2 Northern blots have been 
shown in our previous paper [6] which demon- 
strate that undegraded pS2rmRNA can be isolated 
by the method used. 

The results of the pS2-protein and pS2-mRNA 
assays are shown on Kg. 1. There was a good corre- 
lation betweejube two types of results. When pS2- 
mRNA could not be detected by Northern blot, 
pS2-protein results were usually below ] ng/mg 
protein (22 of 30) ; or between 1 and 3.7ng/mg (6 of 
30). Two were exceptions .(7.7 and 14.6); one of 
them could have been due to the very low protein 
content in the cytosol which would lead to a large 
pS2 value and an associated error. There was no 
explanation for the other high result. Very weak 
pS2-mRNA signals on Northern blots correspond- 
ed to pS2-protein values between 1.1 and 19.2 with 
an average of 6. 6 ng/mg protein (median 5.7); The 
mean arid median pS2-protein concentration in the 
tissue with weak pS2-mRNA signals Were 14.3 and 
10.7 ng/mg protein, respectively. The average pS2- 
protein concentration increased to 32.7 (median 
31.5) ng/mg protein for tissues assessed as medium 
pS2-mRNA intensity, and to 43.3 (median 53.8) 
^g^rog protein for tissues with strong or very strong 
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pS2-mRNA signals. These values should be used 
as an approximate guide only, since the number of 
samples in the various groups was fairly small. 
One-way analysis of variance confirmed that the 
means, of the pS2-protein values in the groups 
made up according to their pS2-mRNA signal in- 
tensity, were significantly different (p < 1 (H). 

If the pS2 gene is expressed, its expression is on 
average greater in breast carcinomas than in unin- 
volved breast tissue. If one takes pS2-protein valr 
ues above 4ng/mg protein as cut-off, the average 
pS2-protein irt J4 breast cancers was 343 (median 
: (35.2), while it was only 18.1 (median 13.8) in 12 
uninvolved breast tissue samples. If the cut-off is 
taken at idng/mg protein, average pS2-protein in 
breast carcinoma is also about twice the level of 
uninvolved tissue. There were not enough lymph 
node metastases which expressed the pS2 gene to 
allow a comparison with carcinoma or uninvolved 
breast tissue. , 

Recent preliminary results of pS2 by radioim- 
munoassay [9] are similar to ours for breast cancer 
but considerably lower than our results for normal 
breast tissue. 

The values of the pS2-protein measured obvj- * 
ously depend on the protein used for calibration. 
We used the pS2-protein standards supplied with 
the CIS kit, which according to the supplier gave 
values from 0 to 740ng/mg protein in a series of 
205 breast cancer cytosols. Previously, a different 
standard had been used for presumably the same 
series of breast carcinomas [2], and a conversion 
factor to current standards is given as 2.8 [8]. 

It was noticed that the correlation between pS2- 
protein and pS2-mRNA was better in breast carci- 
noma specimens than in uninvolved breast tissue. 
This is unexplained, though it could be due to the - 
variable content of cell or tissue types in adjoining 
parts of a specimen, a variation more likely to oc- 
cur in our sampling of non-ma)ignani breast tissue 
compared to sampling of carcinomas. A similar 
variability in breast carcinoma specimens will 
probably have a smaller influence on the pS2 re- 
sults, since the malignant cells - if ihey do express 
the pS2 gene - contain more pS2-prote in than nor- 
mal breast. 
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The role of the epidermal growth factor receptor in sustaining neutrophil 
inflammation in severe asthma. 

Hamilton LM. Tort es-Lozano C, Puddicombe SM Richter A, Kimber I, Dear man 
M Vrugt B , Aalbers R , Holgate ST , Djukanovic R . Wilson SJ, Davies DE . 

Division of Infection, [nflamrnation & Repair, School of Medicine, University of 
Southampton, UK. 

BACKGROUND: The extent of epithelial injury in asthma is reflected by expression of 
the epidermal growth factor receptor (EGFR), which is increased in proportion to disease 
severity and is corticosteroid refractory. Although the EGFR is involved in epithelial 
growth and differentiation, it is unknown whether it also contributes to the inflammatory 
response in asthma. OBJECTIVES: Because severe asthma is characterized by 
neutrophilic inflammation, we investigated the relationship between EGFR activation and 
production of IL-8 and macrophage inhibitory protein-1 alpha (IvfLP-1 alpha) using in 
vitro culture models and examined the association between epithelial expression of IL-8 
and EGFR in bronchial biopsies from asthmatic subjects. METHODS: H292 or primary 
bronchial epithelial cells were exposed to EGF or H202 to achieve I igand-dependent and 
ligand-independent EGFR activation; IL-8 mRNA was measured by real-time PCR and 
IL-8 and MlP-lalpha protein measured by enzyme-linked immunosorbent assay 
(ELISA). Epithelial IL-8 and EGFR expression in bronchial biopsies from asthmatic 
subjects was examined by immunohistochemistry and quantified by image analysis. 
RESULTS: Using H292 cells, EGF and H202 increased IL-8 gene expression and 
release and this was completely suppressed by the EGFR-selective tyrosine kinase 
inhibitor, AG 1 478, but only partially by dexamethasone. MIP-1 alpha release was not 
stimulated by EGF, whereas H202 caused a 1.8-fold increase and this was insensitive to 
AG1478. EGF also significantly stimulated IL-8 release from asthmatic or normal 
primary epithelial cell cultures established from bronchial brashings. In bronchial 
biopsies, epithelial IL-8, MDP-1 alpha, EGFR and submucosal neutrophils were all 
significantly increased in severe compared to mild disease and there was a strong 
correlation between EGFR and IL-8 expression (r = 0.70, P < 0.001) CONCLUSIONS: 
These results suggest that in severe asthma, epithelial damage has the potential to . 
contribute to neutrophilic inflammation through enhanced production of IL-8 via EGFR- 
dependent mechanisms. 
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Human hepatic microsomal epoxide hydrolase: comparative analysis of 
polymorphic expression. 

HassgttC, LinJ, CartyCL, Laurenzana KM , Omiecinski CJ . 

Detriment of Environmental Health, University of Washington, Seattle 98105-6099, 

toemdividual variation in theexpression of human microsomal epoxide hydros 
£nEH) may be an important risk factor for chemically induced toxicities, including 
cancer and teratogenesis. In this study, phenotypic variability and mEH genetic 
polymorphisms were examined in a bank of 40 transplant-quality human liver samples 

S^SS^^S^ r ^ enZymatiC " ^ PotvWSSno 
acids, as well as mEH RNA levels were evaluated in parallel. Enzymatic activity was 

rXTLT! ^^P^-^S-epoxide at 2 substrate c^centrations. The 
t^Z^fr-^Ti 3C : Vlh ^ I ob ! I ained usin S bating substrate levels were highly 
Related (r - 0.85> with results derived from limiting substrate concentrations and • 
exhibu approximately an 8-fold range in activity levels across the panel of 40 liver 

m e ? Zym f 3CtiVity *° dem ° nstra ted strong correlation (r > or = 0.74) with 
an 8.4-fold vanat,on determined for mEH protein content within the same samples 

^nZ'l rr 1 ^^^^^ ^ poorly correlated (r <or = 0.23) with 

activ E ^ °S ^ T d - d excIu * veIv accou «t for variation in enzymatic 
achvity, although this conclusion IS confounded by heterozygousity in the samples These 
dau demonstrate the extent of hepatic mEH functional variability L well-preset 
JTE 311(1 SU ? g6s , t ^ Polymorphism of mEH protein expression is regulated in 

"^^^^^ 
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Restored expression and activity of organic ion transporters rOATl, 
rOAT3 and rOCT2 after hyperuricemia in the rat kidney. 

HabuY, YanoJ, Qkuda fll Fukatsu A. InuiJC 

Department of Pharmacy, Kyoto University Hospital, Faculty of Medicine, Kyoto 
University, Sakyo-lcu, Kyoto 606-8507, Japan. . 

We previously reported that in hyperuricemic rats, renal impairment occurred and organic 
lon fransport act.vity decreased, accompanied with a specific decrease in the expression 
SSr^fT 2111011 tranSporters > rOAT1 ™ d "OAT3, and organic cation transporter 
aSal PreSent W " inVesti 8 ated the reversibility of the organic ion transport 
activity and expression of organic ion transporters (slc22a) during recovery from 
hyperuncern,a, Hyperuricemia was induced by the administration of a chow containing 
unc acid and oxonic acid, an inhibitor of uric acid metabolism. Four days after 

tl^Tr^I th ? Ch ° W * Plasma Uric 3Cid concen ^ion returned to the normal 
nu Z ° t,0nS SUCh 38 clearance and BUN levels were restored 

although the recovery of tubulointerstitial injury was varied in sites of the kidney ' 
Baso ateral uptake of p-aminohippurate (PAH) and tetraethylammonium (TEA) and both 
protein and mRNA levels of rOATI , rOAT3 and rOCT2 in the kidney gradually 
JZwtdlh 7" g H d ? ' ° f reCOVeiy fr ° m hyperuricemia. Basolateral PAH transport 
?rSSi* COlT f latl0n with the P rotein l^el of rOATl (r(2)=0.80) than rOAT3 
m^^^!r^ h " Mp0rt Sh ° Wed a «>-^tion with rOCT2 
KSulS^?SS? P f °,f r ° ne concentration > whldi is a dominant factor in 
hut ZT T ? v. ' W3S gradUa " y rCSt0red durin g the recover y fr om hyperuricemia 
bu the correlauon between the plasma testosterone level and rOCTC protein expression 
m the kidney was not significant. These results suggest that the regulation of oSaTc ion ' 

JZT^r ^ "* ^ by is reversible, an EgaTc 

.on transport activity restores according to the expression levels of these transporters. 
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[Article in Chinese] 
GuoY,XieC. 

measured by ELISA mefhnH • k i ui 5 P n0rmal persons were 

^>ui^ uy ci^a method. Penpherai blood mononuclear cell rPRMn \a ir 
was determined bv flow- ovt nmotw, -n, - w»uoicax ceil (i «MC) MIF expression 

patients as compared with normal persons n> < n nn it, . / *VKi>2> 
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Protein abundancy and mRNA levels of the adipocy tc-type fatty acid 
binding protein correlate in non-invasive and invasive bladder transitional 
cell carcinomas. 

Gromova I , Gromov P . WolfjH^ Cgiisjnfr 

Department of Medical Biochemistry and Danish Centre for Human Genome Research 
I he University of Aarhus, Aarhus C, Denmark. 

The adipocyte type fatty acid-binding protein (A-FABP) is a small molecular weight fatty 
acid-binding protein whose expression correlates both with the grade of atypia and the 

sage of bladder transitional cell carcinomas (TCCs). To determine if the protein 
abundancy correlates with die mRNA levels in non-invasive and invasive lesions we 
have ana ysed fresh TCCs (grade II, Ta; grade II,, T2-4) by two-dimensional 
polyacrylamide gel electrophoresis (2D-PAGE) and measured the mRNA levels using the 
reverse transcription linked polymerase chain reaction (RT-PCR) Overall the results 
showed a good correlation between protein abundancy and mRNA levels, indicating that 
the lack of expression of the protein observed in some lesions reflects low levels of 
transection of the A-FABP gene rather than translation^ regulation. In addition our 
studies showed that the loss of A-FABP protein observed in some tumors is not ' 
compensated by an increase in the skin fatty acid-binding-protein PA-FABP as is the 
case in the A-FABP knockout mice. ' 
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Figure 3. Levels of A-FABP and PA-FABP protein in grade II; Ta tumors 
resected from the same patient. The two upper panels show the 2D gel 
autoradiograms of \ yy S}- methionine labeled proteins from TCCs (grade 11, 
Ta) resected from the same patient. A, TCC 192-9 tumor 4 and B„- TCC 
192-9 tumor I. Only the relevant area of the autoradiograms are shown. The 
low panel shows the RT-PCR analysis of A-FABP mRNA expression in the 
same tumors (see also legend to Fig. 2). 



which differ significantly in iheir levels of A-FABP protein 
and mRNA (Fig. 3, low panel). As shown in Fig. 3, the 
decrease in A-FABP observed in TCC 129-4, T, is not 
accompanied by an increase in the PA-FABP protein (Fig. IE). 

Discussion 

Of the TCC progression markers identified to dale, A-FABP 
is perhaps one of (he most interesting as its presence correlates 
both with the grade of alypia (p=0.0006) and the stage of the 
disease (p=0.0269) (3). A-FABP is a low molecular weight 
protein belonging to a cytosolic multigene family of lipid-, 
binding proteins that include heart, liver, intestinal, muscle, 
brain, skin and epithelial isoforms (20). Members of the 
FABP family are highly expressed in differentiated cells and 
show narrow tissue distribution. Their precise function is at 
present unknown, although there is evidence suggesting that 
they may play roles in intracellular lipid transporl and 
metabolism, signal transduction (21,22) as well as growth 
control and differentiation (23). The role in signal transduction 
has been inferred from the fact that long-chain fatty acids and 
their metabolites can act as primary and second messengers 
in specific signalling pathways (2a). Recenily, it has been 
shown lhat A-FABP may play a central rote in the pathway 
that links obesity with insulin resistance, most likely by 
connecting the fatly acid metabolism with ihe expression of 
TNF-a (18). Furthermore, there is evidence indicating lhat 
the A-FABP gene contains sequence information necessary 
for differentiation-dependcni expression in adipocytes (25). 
Our own data derived from the study of TCCs and normal 
uroiheliurn suggest that A- FA BP may be required for normal 
uroiheliurn differentiation (1), as may be (he case for PA- 
FABP in ihe skin ( 19) 



Considering Ihe potential prognostic. value : of A-FABP 
protein and/or mRNA in TCC progression it was important to 
determine if I be levels of both type of rnacrpmoleculcs 
correlated both in the non-invasive and the invasive lesions 
expressing and lacking; A-FABP- The need for such correlation 
was underlined by recent studies of Anderson and Seilhamer 
(15), who reported a lack of overall correlation between the 
mRNA and protein levels of. 45 rat proteins analysed by 2D 
PAGE in combination with cDNA arrays. Tfieiir data yielded a 
correlation coefficient of 0.45 which is half way between weak 
and perfect correlation. Gearly, our data showed a very*good 
correlation between ihe protein and mRNA levels of A-FABP 
in all tumors analysed indicating that the loss of A-FABP 
protein observed in some tumors is not due to posi- 
transcriptional regulation. 

Recenily, knockout mice carrying a null mutation in the 
aP2 gene encoding for A-FABP was produced (18). These 
animals do not show an obvious morphological or metabolic 
phenotype, but exhibit a 20- fold increase in the levels of the 
keralihocyte type FABP (mall\ which may compensate for 
the loss of the deleted gene (18). The human homologue of 
the mat J gene, PA-FABP, was cloned in our laboratory and 
has been shown lo be highly upregulated in psoriatic skin as 
well as in abnormally differentiated primary keratinocytes 
(19). PA-FABP is expressed in normal uroiheliurn together 
with A-FABP (3), and ongoing studies in the laboratory have 
shown lhat its level decreases significantly as rumors progress, 
interestingly, the studies reported in this article did not revealed 
a compensatory up- or down-regulation of PA-FABP in the 
TCCs analysed, supporting the contention that PA-FABP may 
also play a role in cell growth and differentiation (19). . 
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FiEure 1 IEF 2D gels of whole cellular extracts horn non-invasive and 
invasive TCCs. A. TCC 532- 1 ; B. TCC-692- 1 ; C TCC 763- 1 ; D. TCC 709- 1 
and E. TCC 711- 1. Only the relevant area of the gels are shown. 
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assessed by monitoring for ihe absence of vimentin 
(contamination with connective tissue) and desmin 
(contamination with smooth muscle cells), as well as 
polypeptide resolution. In addition, reasonable amounts of 
these tumors were available for mRNA preparation. 

Table I shows the levels of A-FABP proiein expression in 
the 10 tumors analysed by 2D PAGE. The data were scored 
entirely based on the visual analysis of Coomassie Brilliant 
Blue stained gels and represent an average estimate of at 
least two different runs. Tumors scored' as positive differed 
significantly with respect to the levels of this protein, and 
therefore are indicated with either four (very high), three 
(high), two (medium) and one cross (low). Representative 
examples of Coomassie stained 2D gels of tumors exhibiting 
very high (TCC 532-1, Fig. I A), high (TCC 692-1; Fig. IB), 
medium (TCC 763-1. Fig. 1C) and undetectable levels (TCC 
709-1 and TCC 711-1 Fig. ID E) of A-FABP are shown in 
Fig. 1 (only the relevant area of the gels arc shown). 

A-FABP mRNA levels in no n- invasive grade II, Ta TCCs: 
Since in many instances only a limited amount of fresh tumor 
was available, we used RT PCR to determine the levels of A- 
FABP mRNA in the ten TCCs analysed by 2D PACE (Fig. I). 
Following amplification, the PCR products were analysed by 
conventional 1.5% agmose gel electrophoresis and visualised 
by eihidium bromide staining as shown in Fig. 2. The amount 
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Figure 2. RT-PCR analysis of A-FABP mRNA expression in non-invasive 
(Gill. Ta) and invasive TCCs (Ci 111. T r TJi For RT-PCR analysis, the ss c- 
DNA was synthesized by Reverse Transcriptase using total RNA. and used 
for RT-PCR amplification. The PCR products were resolved on 1.5% 
agaiose gels and visualised under UV light following eihidium bromide 
staining. The A-FABP panels show the results of amplifications where the 
pair of gene specific primers was used to generate ihe 220 bp DNA 
fragment. Amplification of A-FABP was obtained after 30 cycles of POL 
The 6-aclin panels represent the amplification of the B-actin gene, which 
was used as an internal control to confirm that equal amounts of c-DNA were 
used in each reaction. ' = • 



of cDNA in each lane was normalised using several house- 
keeping genes so as to achieve a more accurate assessment of 
the expression of the A-FABP mRNA. As shown in Fig. 2, 
TCC 532- 1 exhibited the highest amount of A-FABP mRNA, 
followed by TCCs 692-1, 763-1. 616-1. 581-1. 154-1, 166-5 
and 533-1. Undetectable levels of A-FABP mRNA were 
observed in the case of TCCs 607-1 and 709-1 (Fig. 2). 
Relative mRNA levels for the ten TCCs are given in Table 1. 

A-FABP protein and mRNA levels in invasive grade 1U. 
TCCs. Of the invasive TCCs (grade 111, T 2 . 4 ) analysed by 2D 
PACE only six yielded reasonable proiein profiles for further 
study As shown in Table K none of these lesions expressed 
detectable levels of A-FABP as determined by Coomassie 
Brilliant Blue staimng (Fig. JE, TCC 711-1). In line with these 
results, the RT-PCR analysis of these tumors also revealed 
undetectable level of A-FABP mRNA (Fig. 2, Gr IB T 2 -I 4 , 
Table 1). 

Loss of A-FABP protein is no! compensated by on increase in 
PA-FABP Recent studies of A-FABP knockout mice have 
shown that the loss of A T FABP in fat tissue is "mpcnsaied 
by an increase in the skin fatly acid-binding protein malM W 
Our studies, however, indicated that the human homologue 
of malt PA-FABP (19). did not compensate for the loss ol 
A-FABP either in ihe non-invasive or the invasive tumors 
analysed in tins study (Fig. ID and E). In addition. F>*. -> 
shows 2D gels ol pSj-rnethionine labeled proteins from I wo 
orade 11. Ta TCCs < 192-4. T,; Fig. 3A and 192-4, T : : F,e. 3B). 



expression profiles of these lesions as fingerprints to define, 
their grade of atypia and eventually their stage (3,14). So far. 
more than 400 tumors of various grades and stages have been 
analysed by two-dimensional polyacrylamide gel 
electrophoresis (2D PAGE), and preliminary experiments 
have shown that even though the overall protein expression 
profiles of tumors of the same grade and stage are very similar, 
there are important differences suggesting that morpho- 
logically 'identical' TCCs may be further subdivided (I). Of 
the biomarkers of TCC progression identified so far, the 
adipocyte-type fatty acid binding protein (A-FABP) is 
perhaps one of the most interesting as the levels of this 
polypeptide have been shown to correlate both with the grade 
of aiypia as well as with the stage of the disease (3). Given 
the putative importance of A-FABP as a progression marker, 
and since Anderson and Seilheimer ( 15) recently showed that 
post-transcriptional regulation of gene expression is a frequent 
phenomena in higher organisms, we have compared the levels 
of A-FABP mRNA and protein in non-invasive and invasive 
bladder TCCs expressing and lacking this protein. 

Materials and methods 

Tumors. Fresh bladder tumors were obtained immediately 
after transurethral resection,. The grade and clinical stage of 
the tumors were determined by the pathologist at the Aarhus 
Municipal hospital. Clean tumors devoid of blood clots were 
divided into small pieces for 2D PAGE and DNA, and RNA 
preparation. The latter were immediately frozen in liquid 
nitrogen and store at -80* C until use. 

l n S]-methionine labeling and 2D-PAGE. In a few cases, 
small tumor pieces were labeled with p J S]-methionine as 
previously described (3). 2D- PAGE was performed 
according to published procedures (16; see also http: 
7/biobase.dk/cgi-bin/celis). 

RT-PCR. Frozen tumor samples were ground to powder in 
liquid nitrogen and total RNA was isolated using the acid 
guanidium isothiocyanaie/phenol chloroform extraction 
procedure (17). The samples were treated with RNase-free 
DNases I (Pharmacia) to eliminate contaminating genomic 
DNA using the protocols recommended by the supplier. 
Poly(A)* RNA was prepared using Poly (A) f Quick columns 
according to the manufacturer's instructions (Stratagene). The 
synthesis of cDNA for RT-PCR reactions were carried out 
using the Gibco BRJL Superscript Kit. Two pg of total RNAs 
was mixed with oligo -dTll primer. PCR buffer, MgCU 
(25 mM). 0.1 M DDT and 10 mM dNTP. The mixture was 
incubated at 42'C for 5 min followed by the addition of 
Superscript ]) reverse transcriptase and further incubation at 
42 r C for 50 min. The reaction was terminated by raising the 
temperature to 70"C for 15 min, followed by additional 
incubation at 37*C for 20 min in the presence of RNase H to 
deplete the RNA. Primers for known and cloned genes were 
purchase from Pharmacia as follows: A-FABP, Upper (from 
186-208 bp) 5'-GATCaTCAGTGTGAATGGGGAT-37 
lower (from 374-39-7 bp) 5- CATCCTCTCGTITTCTCTTT 
ATG-3"; 6-actin upper 5 -GAGGTTGGCTCTGACTGTACC 
AC-37lower 5 : -CTCA1TCaGCTCTCGGAACATCTCG-3\ 



Tabic I. Expression of A-FABP in non-invasive and invasive 
bladder TCCs: 



TCC 


Grade/ 


Level of 


Level of 




stage 


A-FABP protein 


A-FABP mRNA h 
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Grll/Ta 
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+i 


166-5 


Grll/Ta 




' -f ■ 


532-1 


Grll/Ta 


+++-* . 




533-1 


GrII/Ta 






607-1 


GrII/Ta 




- 


692-1 


GrII/Ta 


- +++ . 


+++ 


709-1 . 


Grll/Ta 


- 
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763-1 


GrII/Ta 




++ 




nrll/Tn 

vJl t Mi X *l 


T 
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616-1 


GrII/Ta 


+■ . 


++ 


428-5 


GrlII/T r T 4 






570-2 


GrIjI/r r T 4 






612-3 


Grlil/T 2 -T 4 






71M - 


Griiinyx, 






712-1 


GrIIl/T r T 4 






727-1 


GrIII/T 2 -T 4 







■The levels of a A-FABP protein were determined based on the 
visual analysis of Copmassie Brilliant Blue stained gels and 
represent the average estimate of at least two different runs. Tumors 
scored as positive differed significantly with respect to the levels of 
the protein, and therefore are indicated with either four (very high), 
three (High), two (medium) and one (low) cross (see also Fig. I); 
'The mRNA levels were determined based on the intensity of 
Ethidium Bromide stained cDNA bands separated on agarose gel 
using the Bto-Rad Gel Doc 1000 System and represent the average 
estimate of at least three independent experiments. Corresponding 
mRNA levels are represented by crosses as described above. 



PCR reactions were performed in a Progene thermal cycler 
using the Advantage Klen Tag Polimerase (Clontech). The 
cycling parameters consisted of 30 sec of denaturalion at 
94X, with annealing of 30 sec at 60*C for B-actin or at 
64"C for A-FABP. The extension lime was for 2 min at 68"C 
for 29-40 cycles with the final extension of 7 min at 68*C. The 
PCR products were separated on 1.5% agarose gel 
electrophoresis followed by ethidium bromide staining and 
photography under UV light. 

Results 

A-FABP protein levels in non-invasive and invasive TCCs. 
One hundred suspected TCCs removed at the Department of 
Urology, Skejby Hospital, were analysed by high resolution 
2D PAGE and Coomassie Brilliant Blue staining. Of these, 
10 grade Jl. Ta TCCs (Table 1) were chosen to correlate A- 
PABP protein and mRNA levels as these lesions yielded 
acceptable protein profiles both in terms of their purity, as 
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Abstract. The adipocyte type fatty acid-binding protein (A- 
FABP) is a small moleqular weight fatty acid-binding protein 
whose expression correlates both with the grade of atypia and 
the stage of bladder transitional cell carcinomas (TCCs). To 
determine if the protein abundancy correlates with the 
mRNA levels in non-invasive and invasive lesions, we have 
analysed fresh TCCs (grade II, Ta; grade III. J 2 J by two- 
dimensional polyacrylamide gel electrophoresis (2D-PAGE) 
and measured the mRNA levels using the reverse transcription 
linked polymerase chain reaction (RT-PCR). Overall, the 
results showed a good correlation between protein abundancy 
and mRNA levels, indicating that the lack of expression of 
the protein observed in some lesions reflects low levels of 
transcription of the A-FABP gene rather than translational 
regulation- In addition, our studies showed thai the loss of A- 
FABP protein observed in some tumors is not compensated 
by an increase in the skin fatty acid-binding protein PA- 
FABP. as is the case in the A-FABP knockout mice. 

Introduction . 

Bladder cancer accounts for 4.7% of nil human cancers 
diagnosed. The spectrum of bladder tumors is broad with 
various histological types that include transitional cell 
carcinomas (TCCs), squamous cell carcinomas (SCCs), and 
adenocarcinomas (1-3). TCCs are by far the most prevalent 
tumors as they represent nearly 95% of all bladder cancers m 
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the Western Hemisphere. At first presentation, about 70% of 
the urinary bladder TCCs are diagnosed as differenliaied 
superficial lesions that are confined either to the mucosa 
(Ta), or to the underlying connective tissue (Tj). the rest 
correspond to highly invasive, poorly differentiated tumors. 

Non-invasive TCCs occur as two distinct growth patterns, 
papillary and non-papillary (Oat) lesions (1,2); that display 
significant differences in their malignant- potential and. that 
are believed to originate from different genetic alterations 
(4-6). Papillary carcinomas usually correspond to low-grade 
lesions which frequently recur multiple times. These tumors 
begin as areas of hyperplasia thai later undergo a process of 
dedifferentiation (grades 1-1 V). Invasive tumors may arise 
from these lesions, but often are derived from non-papillary 
carcinoma in situ that usually is of high grade at presentation - 
and tend to invade and progress to muscle invasion and 
metastatic disease. . 

To date, many attempts have been made to pinpoint genetic 
changes that underly progression of bladder cancer as well as 
io identify molecular markers that correlate with tumor 
progression. Cytogenetic studies and molecular genetic data 
have shown that chromosomes 3p, 4p. 4q, 5q. 8p. 9p, 9q, 
Hp. I3q, !4q, I7p and 18q are frequently altered in bladder 
urothelial tumors (4,5 and refs. therein), and as a whole they 
have supported the notion that bladder cancer is a mulustep 
process. Recently, Spruck et at (6) showed that chromosome 
9 alterations occur early during development, while p53 
mutations appear later in the process and confer invasive 
properties. The situation however is reverse in the case of 
Cis, as a large fraction of these lesions contain p53 mutations 
(5,6,8,9). Besides pointing towards two divergent pathways of 
bladder tumor progression, these studies implied that the order 
in which the genetic changes occur is important in determining 
the outcome of the lesion. 

Assessment of bladder cancer is based on a thorough 
pathological examination of biopsy material which establishes 
the histological type of the tumor, its degree of differenuauon 
(o r ade), and depth of invasion of the bladder wall (staging) 
00-12). In spite of strict criteria for the pathological assessment 
of these lesions, there exist a significant inter-pathologist 
variation a fact thai emphasises the need for objective 
markers that may aid their class.ficalion. With this .n mind 
we are exploring the possibility of using proieome (13) 
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. Figure 6—3 Genes can be expressed 
with different efficiencies. Gene A is • 
transcribed and translated much more 
efficiently than gene B.Thts allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 



®®®®® 



FROM D'NATO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many ideniica) protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in rending out a needed pan of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
KNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by pbosphodiester bonds (Figure 6-<l). It differs from 
DNA chemically in two respects: (J) the nucleotides • in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (Q, it contains. the base uracil |U) 
instead of the thymine (T) in DNA. Since U, likeT, can base-pair by hydrogen 
bonding with A (Figure 6-5). the complementary base- pairing properties 
described for DNA in Chapters *J and 5 ripply also to RNA (irr RNA, (J pahs -with 
C, and A pairs with U). It is not uncommon, however, to find other types ol base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up info a 
variety of shapes, just as a polypeptide chain folds up to form ihe final shape of 
a protein (Figure 6-6). As we see later in ihis chapter, the ability to fold info com- 
plex tbree : dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process ol DNA replication discussed in Chapter 5. 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level 6f each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted 
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lire 6-90) could be regulated by the cell for each in i livid ual protein. However, as 
we shut) .see in Chapter 7, (he initiation t>f transcription is the most common 
point for a cell to regulate (he expression oJ each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block (he very firs l step — (he transcription of its UNA sequence into an RNA 
molecule. 

Summary 

The tianslation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonncleoprotein assembly called n rihosome. The 
amino acids used for protein synthesis are first attached to a family of t UNA 
molecules, each of which recognizes, by complementary base- pair interactions, par- 
ticular sets of three nucleotides in thenrUNA (cottons). The sequence of nucleotides in 
the ntJRNA is then read from one etui to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal snbnnit binds to the in UNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tttNA molecule. A 
large r i boson in t snbunit hi nils to complete the ribosome ami begin the elongation 
phase of protein synthesis. During this phase, nminoneyl t ftXAs—raeh bearing a 
specific amino acid bind sequentially to the appropriate cotton in mllNA by f aiming 
complementary base pairs with the tRNA anticodon. T.ach amino acid is added to the 
C- terminal rml of the growing polypeptide by means of it Cycle of three seqttentiftl 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed in this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactivation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inac Ovation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on ihc partic- 
ular genes that 1 he cells express, ai what level is the control of gene expression 
exercised? As we saw in the last chapter, 'there are many steps in the pathway 
leading from DNA to protein, and all ol them can in principle be regulated. Thus 
3 cell can control the proteins it mokes by (1) controlling when and. how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
iranscript is spliced or otherwise processed (RNA processing control), Kt) 
selecting which completed mKNAs in the cell nucleus are exportetl to the cylosol 
and determining where in the cylosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (Iranslational control). (5) 'selectively destabilizing certain mRNA 
molecules in the cytoplasm fmRNA degradation control), or (G) selectively acti- 
vating,, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (figure 7-5). 

For most genes transcriptional controls ore paramount. Ibis makes sense 
because, of all the possible control points illustrated in f'igure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous, inter me' 
cliates. In the following sections we discuss 1 the DNA and protein components 
itiat perform this function by regulating rhe initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
repression. 



nummary 

the genome of a cell contains in itsDNA sequence the information to make tunny 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of cells in mnlticellnla r orgtinisms arise, 
h-icause different sets of genes are expressed. Moreover, cells can change the pattern 
.-.'/ genes they express in response to changes in their environment, such os sigmds 
>?m other cells. Although all of the steps involved in expressing a gene can in prin- 
-.-.pie be regulated, for most genes the initiation of RNA transcription is the most 
-port ant point of control. 



'tow does a cell deirrmine which of its thousands ol genes to transcribe':' As 
>! --ntioned briefly in Chapters -t and (>. the nanseiiprion of each gene i.s con 
i. ■i)ed by a regulatory legion of DNA relatively near die the where nanso ipiion 
' *j. ; »s- -Some icgulatojy regions ate simple and act as switches that are thrown 
'•■ single signal. Many otheis are complex and :m,i as tiny microprocessors. 
: ponding to a variety of signals thai du-y inter ptet and integrate to switch the 
'i' :;.-hboring gene on or off. Whether cornpl*.w or simple, these swiiehini- devices 
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Selective apoptosis of natural killer-cell tumours by 1-asparaginase. 

Ando M , Sugimoto K , IGtoh T , Sasaki M , Mukai K , Ando J , Egashira M , Schuster 
SM , Oshimi K . 

Department of Haematology, Junteudo University School of Medicine, Tokyo, Japan. 

We examined the effectiveness of various anti-tumour agents to natural killer (NK)-cell 
tumour cell lines and samples, which are generally resistant to chemotherapy, using flow 
cytometric terminal deoxynucleotidyl transferase-medialed dUTP-biotin nick end- 
labelling (TUNEL) assay. Although NK-YS and NK-92 were highly resistant to various 
anti-tumour agents, 1-asparaginase induced apoptosis in these two NBC-cell lines. NK-cell 
leukaemia/lymphoma and acute lymphoblastic leukaemia (ALL) samples, were . 
selectively sensitive to 1-asparaginase and to doxorubicin (DXR) respectively. Samples of 
chronic NK lymphocytosis, an NK-cell disorder with an indolent clinical course, were . 
resistant to both drugs. Our study clearly separated two major categories of NK-cell 
. disorders and ALL according to the sensitivity to DXR and 1-asparaginase. We examined 
asparagine synthetase levels by real-time quantitative polymerase chain reaction (RQ- 
PCR) and immimostaining in these samples. At least in nasal-type NK-cell lymphoma, 
there was a good correlation among asparagine synthetase expression, in vitro sensitivity 
and clinical response to 1-asparaginase. In aggressive NK-cell leukaemia, although 
asparagine synthetase expression was high at both mRNA and protein levels, l- 
asparaginase induced considerable apoptosis. Furthermore, samples of each disease entity 
occupied a distinct area in two-dimensional plotting with asparagine synthetase mRNA 
level (RQ-PCR) and in vitrol-asparaginase sensitivity (TUNEL assay). We confirmed 
rather specific anti-tumour activity of 1-asparaginase against NK-cell tumours in vitro, 
which provides an experimental background to the clinical use of 1-asparaginase for NK- 
cell tumours. 
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Human thyroid carcinoma cell lines and normal thyrocytes: expression 
and regulation of ma trix metaUoproteinase-1 and tissue matrix 
metaUoproteinase inhibitor-1 messenger-RNA and protein. 

Aust G, Hofmann A, Laue S, Rosf A . Kohler T, Scherbaum WA . 

Institut of Anatomy, University of Leipzig, Germany. 

Matrix metaUoproteinase- 1 (MMP-1) and tissue matrix metaUoproteinase inhibitor 1 
(TIMP-1) play an important role in remodeling the extracellular matrix in normal and 
pathological processes. The effect of phorbol-myristate acetate (PMA), interleukin-1 (1L- 
. 1), and tumor necrosis factor-alpha (TNF-alpha) on MMP-1 and TIMP-1 expression was 
studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 
1736 thyroid carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA 
(mRNA) levels were monitored by competitive semiquantitative reverse trail scriptase 
polymerase chain reaction (RT-PCR) after 24 hours. Culture supernatants were assayed 
for free and/or complexed MMP-1 and TIMP-1 after 48 hours using enzyme-linked 
immunosorbent assay (ELI S A) systems (detection limit: <2 ng/mL). MMP-1 andTlMP-1 
mRNA were present in all cell types, although thyrocytes showed MMP-1 mRNA levels 
near the detection limit. 8505 C expressed MMP-1 mRNA levels of up to 10(6) times 
those of the other cells analyzed. PMA and IL-1 increased MMP-1 mRNA in most cell . 
types. TLMP-1 mRNA increased after treatment with PMA in all cells except 8505 C, 
whereas only slight effects were shown after IL-1 stimulation. MMP-1 protein was- 
undetectable in normal thyrocyte cultures, but was secreted spontaneously by all cell 
lines ([ng/mL]; C 643: 15+/-7; HTh 74: 81+/-1; SW 1736; 13+/-2; 8.505 C: 2097+/-320). 
There was a strong correlation between levels of MMP-1 mRNA and protein (r = 0.99, p 
< .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. 
Fibroblasts ([ng/mL] 517+/-55) and the cell lines (C 643: 142+/-48; HTh 74: 1 15+/-13; 
SW 1736: 202+/-14; 8505C: 120+/-19) secreted TIMP-1 in imstimulated cultures, 
whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated TIMP-I secretion after 48 hours in SW 1736, HTh 74, and C 
643 cells. Our data suggest that in contrast to normal thyrocytes, dedifferentiated thyroid 
carcinoma cell lines are potential producers of MMP-1 as well as TIMP-1. High MMP-1 
or MMP-I/TJMP-] expression may play a role in tissue invasion of undifferentiated 
thyroid cancer cells. 
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ABSTRACT 

Matrix metailoproteinasc-1 (MMP-1) and tissue matrix metalloproteinase inhibitor 1 (T1MP-1) play an impor- 
tant role in remodeling the extracellular matrix in normal and pathological processes. The effect of phorbol-myris- 
tate acetate (PMA), interleukin-1 (IL-1), and tumor necrosis factor-o* (TNF-or) on MMP-1 and TIMP-1 expres- 
sion was studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 1736 thyToid 
carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA (mRNA) levels were monitored by 
competitive semiquantitative reverse transcriptase polymerase chain reaction (RT-PCR) after 24 hours. Culture 
supernatants were assayed for free and/or complexed MMP-1 and T7MP-1 after 48 hours using emyme-linked 
immunosorbent assay (ELISA) systems (detection limit: <2 ng/mL). MMP-1 and T1MP-1 mRNA were present in 
all cell types, although thyrocytes showed MMP-1 mRNA levels near the detection limit. 8505 C expressed MMP- 
1 mRNA levels of up to 10 6 rimes those of the other cells analyzed. PMA and. IL-1 increased MMP-1 mRNA in 
most cell types. T1MP-1 mRNA increased after treatment with PMA in all cells except 8505 C, whereas only 
slight effects were shown after IL-1 stimulation. MMP-1 protein was undetectable in normal thyrocyte cultures, 
but was secreted spontaneously by all cell lines ([ng/mL]; C 643: 15 ± 7; HTh 74: 81 +.1; SW 1736: 13 ± 2; 
8505C: 2097 ± 320). There was a strong correlation between levels of MMP-1 mRNA and protein (r= 0.9SV 
p < .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. Fibroblasts ([ng/mL] 517 ± 
55) and the cell lines (C 643: 142 ± 48; HTh 74: 115 ± 13; SW 1736: 202 ± 14; 8505 C: 120 ± 19) secreted 
ITMFrl in unstimulated cultures, whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated T1MP-1 secretion after 48 hours in SW 1736, HTh 74, and C 643 cells. Our data 
suggest that in contrast to normal thyrocytes, dedifferentiated thyroid carcinoma cell lines are potential produc- 
ers of MMP-1 as well as T1MP-1. High MMP-1 or MMP-1/T1MP-1 expression may play a role in tissue inva- 
sion of undifferentiated thyroid cancer cells. 



INTRODUCTION suppressed on external stimulation, ie, with cytokines, 

Mphorbol 12-myristate 13-acetate (PMA), lipopolysaccaride 

ATRK mftalloprotefnasev (MMPs) constitute a (LPS), or rctinoic acid (2,3). After secretion at post-tran- 

m r of slruclma,| y relatcd Proteolytic enzymes re- scriptiona) level, latent MMP proenzymes are regulated by 

sponsible for the proteolytic degradation of extracellular proteolytic activation and interaction with tissue inhibitors 

matrix (ECM) components. They are important partici- of matrix metalloproteinase (TIMPs), their specific in- 

pants in norma) tissue remodeling and contribute to rhe hibitors. Any imbalance berween the proteolytic MMPs ac- 

phenorypc of several pathological conditions that are as^ nvhies and the TlMPs that could be influenced and caused 

soaated with progressive ECM degradation. MMPs are by cytokines could potentially lead to pathological condi- 

highly regulated at different levels (!). At the transcrip- tions (4). 

nonal level, MMP expression can be directly induced or MMP-1, although known as an interstitial collagenase, 
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ls the only enzyme active at neutral pH that can degrade 
extracellular fibers comprised of collagen types I, II, and 
III- With this initial step, MMP-1 provides the cleavage 
products to other collagenase types (5). The major specific 
inhibitor of MMP-1 is TIMP-1, a 2 8.5-kd glycoprotein, 
which forms 1:1 stoichiometric complexes with the pro- 
lease (6). Cytokines and growth factors have been shown 
to regulate the expression of both MMP-1 "and TIMP-1 
. U>7,8). 

Although the participation of MMP-1 as the initial col- 
lagenase in tissue breakdown during tumor development is 
well documented (9-11), only one study has described the 
expression (12) but no study has as yet investigated the reg-: 
ulation of this enzyme in different thyroid tumors. Few stud- 
ies have been published investigating the role of other 
MMPs in normal and pathological thyroid tissue by in situ 
hybridization and immunohistochemistry (13-16). Fur- 
thermore, tissue remodeling includes both the action of 
MMPs and their inhibitors; thus, these enzymes could be 
involved in autoimmune and other nonauroimmune thyroid 
diseases during morphological changes (17,18). It is still un- 
known whether or not thyrocytes are able to express MMPs 
and TIMPs. Although rype IV collagenases (MMP-2 and 
MMP-9) were detected in various human epithelial cells of 
different tissue origin (19,20), only one study described the 
secretion of MMP-1 by epithelial cells (21). 

Highly purified normal thyrocytes and four thyroid car- 
cinoma cell lines were included in this study to investigate 
the involvement of these cells in MMP-1 and TIMP-1 pro- 
duction during thyroid tissue remodeling processes and in 
malignant thyroid neoplasms! MMP-1 and TIMP-1 ex- 
pression were studied at both the mRNA and protein level 
by semiquantitative RT-PCR and ELISA measurement, re- 
spectively. 

In unstimulated carcinoma cell lines both MMP-1 and 
TIMP-1 mRNA were expressed, partly at a high level, fol- 
lowed by the spontaneous secretion of the proteins. The 
various conditions for the stimulation of the different cell 
lines by cytokines and PMA were defined. In contrast to 
the cell lines, normal thyrocytes did not secrete MMP-1 
and only trace amount of TIMP-1, even after stimulation 
with PMA. 



MATERIALS AND METHODS 

Preparation of tissues, thyroid-derived cells, and 
cell lines 

Thyrocytes were prepared from surgical thyroid speci- 
mens from 3 patients (1 Graves* disease, 2 nontoxic goi- 
ter; mean age 54.3 ± 5.0 years). Fibroblasts were separated 
from thyroid tissue of 5 other patients (3 Graves' disease, 
2 nontoxic goiter; mean age 43.6 ± 6.4 years). Graves 1 dis- 
ease and nontoxic goiter were diagnosed on the strength 
of clinical, biochemical, and immunologic features as well 
as thyroid scintiscans. 

Thyroid tissue was trimmed of fat 3nd connective tissue 
immediately after surgery. Thyroidrderived cells were en- 
riched aircr gradual enzymatic digestion of tissue and cul- 
tured over a period of 16 hours as described. Thyrocytes 
were obtained from the adherent fraction by incubating 



the cell monolayer with phosphate buffered saline (PBS) 
without Ca 2 ~/Mg 2+ for 45 minutes (22). Residual fibro- 
blasts were removed after subsequent incubation of the 
cells with the fibroblast-specific mab FibASOl (22> and 
goat-anti-mouse IgG-DYNABEADS® M450 (DYNAL, 
Hamburg, Germany) according to the manufacturer's pro- 
tocol.. 

Thyroid-derived fibroblasts were obtained after culturing 
small pieces of thyroid tissue in Dulbeccos's Modified Ea- 
gle's Medium (DMEM) with 10% fetal calf serum (FCS) 
and harvested in the 5th to 7th passage. The purity of the 
thyrocytes and fibroblasts was determined by using indirect 
immunofluorescence technique on a FACS-Scan (Becton 
Dickinson GmbH, Heidelberg, Germany) as described {22), 

The following human anaplastic thyroid carcinoma cell 
lines were cultured in DMEM with 10% FCS: C 643 (23); 
SW 1736 (23); and HTh 74 (24). The cell line 8505 C (25) 
was purchased from the German Collection of Microorr. 
ganisms and Animal Cell Cultures (DSM ACC219). This 
cell line was established from a primary thyroid tumor 
characterized histologically as a undifferentiated carcinoma 
that was partially composed of poorly differentiated pap- 
illary cells (25). This is a feature of a subgroup of anaplas- 
tic carcinoma (26). The majority of these coexistent better 
differentiated carcinoma foci in anaplastic carcinoma were 
papillary (26); . 

In vitro cultures 

Using 24- well plates, 1 X 10 3 cells were cultured for 24 
hours. The medium was aspirated and replaced with 500 
ftL GPT1-MEM (G1BCO BRL, Grand Island, NY) without 
FCS to eliminate possible stimulation of MMP-1 and 
TIMP-1 production by FCS. The medium contained the 
desired concentration of human IL-la (10 U/m); Pepro 
Tech EC Ltd., London, UK), TNF-ot (100 U/mL; Pepro 
Tech EC Ltd.), inrerferon-y (IFN-?) (500 U/mL; Pepro 
Tech EC Ltd.), or 10 ng/mL PMA (SIGMAK 

Triplicate cultures of each stimulator were analyzed af- 
ter 3, 6, and 24 hours at the mRNA and after 24 and 48 
hours at the protein level. The supcrnatants were removed 
and stored at -80°C for further use, First,.a collagenolytic 
assay based on the digestion of type 1 collagen was per- 
formed. This method showed direct evidence of free pro- 
MMJM enzyme in the cell culture supernatants of un- 
stimulated and IL-la stimulated 8505 C, HTh 74, and 
C634 cells (data not shown). However, the method does 
not allow quantitation of MMP-1 emyme activiry. Thus, 
ihe cell culture supernatants were assayed for MMP-1, 
TIMP-1, and MMPl/HMP-l complex by EL1SA (Amer- 
sham Life Sciences, Braunschweig, Germany). The MMP- 
1 assay (sensitivity: 1.7 ng/mL) delected only total human 
MMP-1, ie, free MMP-1 and MMP-I completed with in- 
hibitors such as TJMP-1. It did not delect MMP-1 bound 
by the nonspecific protease inhibitor a 2 -macroglobulin. 
The MMP-1/T1MP-1 assay (sensitivity: 1.5 ng/mL) de- 
tected MMP-l/TJMP-1 complex, ie, .activated MMP-1 that 
has been subsequently complexed with the specific MMP- 
1 inhibitor TIMP-1. It did not detect free actrve MMP-1, 
free TIMP-1, or pro-MMP-L There was no cross-ieactiv- 
iry with active MMP-1 bound by the nonspecific protease 
inhibitor a^-macroglobulin. The TIMP-1 assay (sensitivity: 



MMP-1 AND TTMP-1 

1.25 ng/mL) detected total human TIMP-1, ie, free TIMP- 
1 and that completed with MMPs. The assay did fully 
cross-react with TIMP-1 in complexes with other MMP. 
It did not cross-react with TIMP-2. 

RNA extraction and cDNA synthesis 

For gene expression studies, 5 mL RNAzol™ B (Biotex 
Laboratories Inc., Houston, TX) was added to the cell cul- 
ture wells. The content of three wells of any cell type was 
pooled and then stored frozen for further mRNA analysis 
in liquid nitrogen. Total cellular RNA (cDNA) was iso- 
lated from the probes according to the manufacturer's pro- 
tocol. RNA was fractionated on a denaturing 1.0% agarose 
gel and stained with ethidium bromide to confirm that 
spectrophotometric measurements were accurate arid that 
the RNA had not been degraded. Five micrograms total 
RNA was reverse-transcribed to cDNA using the First- 
strand cDNA synthesis kit of Pharmacia (Uppsala, Swe- 
den), in a total reaction volume of 15 /*L. 

mRNA analysis by competitive RT-PCR 

To correct for variations across different cDNA prepa- 
rations; all samples were first adjusted to contain equal in- 
put glyceralderyde-3-phosphate dehydrogenase (GAPDH) 
cDNA concentrations. Semi-quantitative GAPDH RT-PCR 
was used with a heterologeous synthetic competitor frag- 
ment. The generation of the specific PCR products from 
the competitor and the cDNA with the GAPDH primers 
were published earlier (22,27). 

We then estimated the MMP-1 and TLMP-1 cDNA in 
these adjusted samples. The primers were selected using the 
DNAsis computer program (Hitachi Software Engineering 
Co, Yokohama, Japan). The primer pairs span one or more 
introns to allow unambiguous discrimination between 
cDNA and unwelcome contaminating genomic DNA. In 
quantitating MMP-1 and TIMP-1 cDNA, a rapid one-step 
method was introduced to synthesize an internal homolo- 
geous competitor (plan diagram of procedure: Fig. 1, ex- 
emplary for MMP-1 (28). A hybrid primer was synthesized 
(MMP-1 hy) that consisted of two segments (segj, scg2). h 



MMPtl 



5' 



seg, 



seg 7 . 



PCR 



478 bp 



MMPlhy . MMPlr 

: 82 bp 



MMPl internal .homologeous compelitor 



FIG. 1. General scheme for generating homologous com- 
petitors used for quantitative PCR. 
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had a length of 40 nucleotides, in which 20 nucleotides (seg,) 
at the 3' end corresponded to the opposite strand of the tar- 
get sequence a predetermined distance from primer MMP- 
lf; and 20 nucleotides at the 5' end (seg2 = MMP-lr) that 
corresponded to the target sequence upstream from the seg- 
ment seg,. Amplification with the primers MMP-lf and 
MMP-lhy from the cDNA resulted in a 478-base pair (bp) 
(polymerase chain reaction (PCR) product. It was freed from 
excess primers and deoxynucleoside-triphosphates (dNTPs) 
using the Qiaquick Gel Extraction Kit (Qiagen GmbH, 
Hilden, Germany) and quantified. A known number of 
copies of the competitor was introduced in the GAPDH-ad- 
justed samples and amplified with the primers MMP-lf and 
MMP-lr. With this approach, two products were generated, 
one derived from the cDNA {560 bp) and another, 82 bp 
smaller in size derived from the internal competitor (Pig. 1). 
PCR. products were resolved by gel electrophoresis (1.5% 
agarose gel). The relative amounts of sample cDNA and 
competitor were quantified by measuring the intensity of 
ethidium fluorescence with a CCD image sensor and ana- 
lyzing the data with the EASY program (Herolab, Wiesloch, 
Germany). The initial amounts of sample cDNA and com- 
petitor were assumed to be equal in those reactions where 
the ratio of the two products was judged to be equal. This 
was expressed in arbitrary units (AU) (22,29). One AU was 
defined as the lowest concentration of competitor yielding 
a detectable amplification product when added to PCR 
alone. For example, if equivalence between sample cDNA 
and competitor was reached using a 100-fold concentrated 
competitor the relative sample cDNA concentration was 100 
AU. Thyrocytes and the cell lines were analyzed for the ex- 
pression of thyroid-specific and cytokine receptor mRNAs 
in a simple RT-PCR. The sequences of the TPO and cy- 
tokine receptor primer pairs have been published by Wat- 
son et al. (30) and Tada et al. (31) and gave the following 
product sizes: TPO: 506 bp; 1L-1R type I (p80):300 bp; 1L- 
1R type D ( P 68): 392 bp; TNF-oR (p75): 324 bp; TNF-aR 
(p55): 587 bp and IFN-yR: %99 bp. The thyroglobulin (Tg) 
and thyroid stimulating hormone receptor (TSH-R) primer 
pairs were selected according to the published sequences us- 
ing the DNAsis program (Table 1). 

Each 25-/iL amplification reaction contained 2.5 pL 
10 X concentrated PCR buffer (15 mM MgCI 2 , Boehringer 
Mannheim, Germany), 03 U Taq DNA polymerase 
(Boehringer Mannheim, Germany), 100 jiM dNTPs 
(Perkin Elmer, Weiterstadt, Germany), 0.1 }iM of each 
primer (1MB, Jena, Germany), and 1 /xL cDNA and com- 
petitor in adjusted dilution. Furthermore, restriction map- 
ping (restriction enzymes: Boehringer Mannheim GmbH, 
Germany) was carried out to confirm the originality of the 
PCR product (Fig. 1, Table 1). 

Statistics 

Protein levels of thyrocyte or fibroblast cultures from the 
different patients and of the thyroid carcinoma cell lines ob- 
tained from three separate experiments were presented as 
mean ± SEM values. Statistical comparisons between un- 
stimulated and stimulated cell cultures were performed by 
the alternate (Welch) t-test. The correlation between basal 
mRNA levels and the unstimulated protein secretion in all 
cell types was calculated according to the Spearman method. 
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Table J. Primers, Length of Amplified Templates, Restriction Mapping and Assay Conditions for RT-PCR 
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cONA (bp) 


. competitor (bp) . 
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Tg 


forward 


GCACATCTTACTGAGTGGCT 




4)6 






35 


reverse 


TGTCAGCACAGTGGCAATAC 












TSH-R 


forward 


ACTTGCTGCAGCTGGTGCT 




354 




65 


35 


" exons 1-4 


reverse 


TGAGGGCATCAGGGTCTATG 












TSH-R 


forward 


GAAATTCGGAATACCAGGAACTTA ACT 


m 




:,- 53 . ... 


35 


exons 4-10 


reverse 


AACTCATCGGACiTGGGGGTACA 












MMP-1 


forward 
reverse - 


TGGGAGCAAACACATCTGAC 
ATCACITCTCCGCGAATCGT 




560 


478 - 


64 ... 


33 




hybrid 


ATCACTTCTCCCCGAATCGT 














CCATATATGGGTTCjGATGCC 












TIMP-1 


forward 
reverse 
hybrid 


CTTAGGGGATGCCGCTGACA 
GGCAGGCAGGCAAGGTGACG 
GGCAGGCAGGCAAGGTGACG 
GGATGGATAAACAGGGAAAC 




351 


* 274. 


64 


30 



Tg indicates ihyroglobuline; TSH-R, thyroid stimulating hormone receptor, MMP-1, matrix rnetalloproitinase-I; TIMP-I, tissue inhibitor of 
mcla!loprottin25e-l; bp, base pair. 



RESULTS 

Thyroid specific and. cytokine receptor 
mRNA expression 

Isolated thyrocytes as well as 8505 C cells expressed Tg 
and thy roper oxidase (TPO) mRNA, whereas transcripts of 
the TSH-R (exons 1-4, 354 bp, exons 4-10, $96 bp) were 
present only in the thyrocytes. The three* anaplastic thy- 
roid carcinoma cell lines SW 1736\ C 634, and HTh 74 
were completely negative for the Tg, TPO, and TSH-R rnR- 
NAs (Fig. 2). All cell lines 3nd thyrocytes expressed 1L-1R 
{type I and. type II), TNF-aR (p75 and p55) and IFN-yR 
mRNA (Fig. 2). 

Basal MMP-1 and TIMP-1 niRNA and 
protein expression 

In most stimulation experiments, mRNA levels did not 
increase until 24 hours of incubation. The 24-hour mRNA 
levels are shown in Figures 3 and 4. The 3- and 6-hour 
levels are demonstrated in those experiments where the 
mRNA levels reached their peak before 24 hours of stim- 
ulation. If not otherwise indicated, the MMP-1 levels were 
measured using the ELISA system, which recognizes 
free/complexed MMP-1. 

MMP-1 and TIMP-1 mRNA were found during un- 
stimulated culture in at! investigated cell types, although 
the mRNA levels varied over a gre3t range. 8505 C showed 
a basal MMP-1 mRNA level 20 times as high as those of 
the HTh 74 cells, 6 X JO 4 times as high as C 643, and 2 X 
10 6 times as high 3S SW 1736 cells. In thyrocytes, MMP- 
1 mRNA levels were found near the detection limit (Figs. 
3 and 4). 

Generally, when analyzing the noted cell types, the mea- 
sured basal MMP-J or TIMP-1 mRNA levels correlated 
well with the basal protein expression (MMP-1: r = 0.99 y 
p < ,0001; TIMP-1: r = 0.98, p < .002). Corresponding to 
the high MMP-1 mRNA level, 8505 C cells secreted ex- 
tremely high levels of MMP-J. No MMP-J or T1MP- 1 was 



detected in unstimulated 'thyrocyte cultures, at any time- 
point examined. All other cell types showed a spontaneous 
MMP-1 and TIMP-1 secretion (Figs. 5 and 6). thyroid- 
derived fibroblasts produced, basal TIMP-1 levels of up to 
4 times higher in the four carcinoma cell lines, which se- 
creted nearly the same amounts of basal TIMP-1 protein. 
Nevertheless, TIMP-1 secretion of fibroblasts was found 
at lower levels than expected after TIMP-1 mRNA mea- 
surement in 4 of 5 analyzed patients. The results of the fi- 
broblast cultures from patient five showing a higher T1MP- 
1 expression than those from the 4 other patients (basal 
24 hour: 50 ± 2; PMA 24 hour! 90 ± 6 ng/mL TIMP-1) 
was omitted in Figure 6. 

Comparing the basal amount of free/complexed and 
TIMP-1 complexed MMP-1 after 24 hours of stimulation, 
a significant level of MMP-1 was not complexed with 
TIMP-1 in 8505 C cultures, whereas in fibroblast cultures 
most of the MMP-1 activity was inhibited by TIMP-1. The 
anaplastic carcinoma cell line HTh 74 did not show such 
a great discrepancy between free/complexed and TIMP-1 
complexed MMP-1 level as 8505 C cells (Fig. 7). 

Effects of IL-la on MMP-1 and TIMP-1 mRNA and 
protein expression 

Experiments were performed to determine whether hu- 
man thyroid epithelial cells and thyroid carcinoma cell lines 
could produce or increase basal MMP-1 and TIMP-1 se- 
cretion after exposure to various stimuli. The results from 
these stimulation experiments are summarized in Figures 
3 through 5. Generally, there was a delay in protein se- 
cretion level in comparison to the mRNA expression level.. 
At the protein level, the cytokine-mediated stimulating or 
inhibiting effect is more distinct after 48 hours compared 
with 24 hours, even when the mRNA level had already de- 
creased after 6 hours. 

IL-J upregulated MMP-1 mRNA in SW 1736 cells up 
to 100 times and, in thyioid-derived fibroblasts, up to 12 
times after 24 hours of incubation (Fig. 3). This increased 
mR.NA level was accompanied by a significantly enhanced 
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FIG. 2, Amplification of thyroid specific and imerJeukin-receptor mRNA in thyrocytes (1), SW 1736 (2), C 643 (3), HTh 
74 (4) and 8505 C (5) cells using RT-PCR; M f = 100-bp ladder (GJBCO). 



MMP-1 secretion after 48 hours. Furthermore, IL-la in- 
creased MMP-1 mRNA expression in thyrocytes up to 
seven times after 6 hours, but no MMP-1 protein could be 
detected in thyrocyte cultures. IL-1 had no stimulatory ef- 
fect on MMP-1 mRNA expression in C 643, HTh 74, and 
8505 C cells after 24 hours, although a significant increase 
of MMP-1 secretion was found in HTh-74 and SW 3736 
cells after 48 hours of incubation {Fig. 5). This discrepancy 
may be explained by a possible increase in MMP-1 mRNA 
level after 24 hours of stimulation. The same effect could 
also be observed in the IJL-1 stimulated T1MP-1 at the 
mRNA as well 3s the protein level: the only slight effect 
of IL-1 on T1MP-1 mRNA expression in carcinoma cell 
lines after 24 hours was accompanied by a significant in- 
crease of "TIMP-1 secretion in 8505 C and HTh 74 cells 
after 48 hours (Figs. 4 and 6). 

Effects of TNF-a on both MMP-1/T1MP-1 mRNA 
and protein expression 

In contrast to IL-1, TNF-a did not stimulate the MMP- 
I arid TIMP-1 mRNA 3nd protein levels in al! carcinoma 
cell lines and thyrocytes. Only thyroid-derived fibroblasts 
responded with a slight upregulation of MMP-1 and TIMP- 
1 mRNA expression after TNF-a stimulation, which was 
not accompanied by an increase of MMP-1 and TIMP-1 
secretion. 



Effects of PMA, and lFN-y on MMP-1 and TIMP-1 
mRNA and protein expression 

PMA was included in our study as a positive control 
because it is known to upregutateor induce both MMP- 
1 and TIMP-1 secretion in various cell types (1,32). In- 
deed, PMA was able to induce or enhance MMP-1 
mRNA levels in all cell types investigated, although the 
detected levels varied to a large extent {Fig. 3). This re- . 
suit is in good correlatron with the significantly increased 
MMP-1 protein levels that were already detectable after 
24 hours of stimulation {Fig. 5). PMA upregulated T1MP- 

1 rnRNA levels by up to 20 times in C 643, and up to 

2 times in SW 1736 and HTh 74 cells, fibroblasts and 
thyrocytes, but it did not change tbeTiMP-1 mRNA con- 
tent in 8505 cells {Fig. 4). At the protein level, we found 
a significant stimulation of TIMP-1 secretion in C 643 
and HTh 74 cells, as well as in thyroid-derived fibrob- 
lasts {Fig. 6). 

In contrast to PMA, IFN-y was without effect on stim- 
ulation or downregulation of MMP-1 and TIMP-1 mRNA 
or protein in any of the cell types investigated (Figs 5 and 

6). 

The main inhibitor of MMP-1 isflMP-1, which forms 
. 1:1 stoichiometric complexes with MMP-1, although some 
other inhibitors can also bind MMP-1. On the other band, 
TIMP-1 can bind other MMP types. 
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FIG. 3. Representative samples of competitive amplified MMP-1 mRNA of thyrocytes, thyroid-derived fibroblasts and thy- 
roid carcinoma cell lines without stimulation (control) and after stimulation with J0 U/mL IL-1o and 100 U/mL TNF-a and 
10 ng/mL PMA after 24 hours. Serial dilutions of known amounts of the competitor fragment were coamplifjed with identi- 
cal aliquots of cDNA. The 560-bp (cDNA) and 478-bp (competitor) PCR products were visualized by agarose gel electrophoresis 
and ethidiumbromide staining. The relative concentration of the added competitor was given in arbitrary units (AU) in the 
figure. One AU was defined as the lowest. concentration of the competitor yielding a detectable amplification for MMP-1 
mRNA. The ratio of competitor to cDNA fragments was determined by measuring the intensify of ethidium fluorescence with 
a CCD image sensor and analysis of data. Measured cDNA concentration can expressed in AU. 




DISCUSSION 

Our findings demonstrate for the first time that thyroid 
carcinoma cell lines are able to express MMP-1 and TLMP- 
1 mRNA and protein at significant levels in vitro. The ob- 
servation of spontaneous release of MMP-1 and TIMP-1 
corresponds well with earlier studies covering the secretion 
of these proteins by several carcinoma cell lines (33,34). 

However, in contrast to its clear physiological function 
in extracellular matrix breakdown, the role of MMP-J in 
tumor growth and metastases is still controversial 
(9-11,35). Recenrly, Murray et al. (3 0) demonstrated that 
MMP-1 is associated with poor prognosis in colorectal can- 
cer, and has a prognostic value independent of the Dukes 
singe. Therefore, MMP-1 could be a target for therapeu- 
tic intervention in such tumors. Furthermore, the hypoth- 
esis of whether or not cancer cells themselves arc able to 



produce MMP,' or whether cancer cells stimulate the sur- 
rounding stromal cells to secrete MMP in vivo y is disputed. 
MMP-1 mRNA and protein were detected by both in situ 
hybridization and immun ©histochemistry in stromal as 
well as tumor cells of head, neck, gastric, colorectal, and 
mammary carcinomas (9,10,36,37). In contrast, 
Kameyama (12) demonstrated by in situ hybridization that 
the MrVlP-1 mRNA was not expressed in the cancer cells 
but in the surrounding fibrous capsules of strongly differ- 
entiated papillary thyroid carcinoma tissue. Highly differ- 
entiated follicular carcinomas and follicular adenomas 
were depleted for MMP-1 transcripts. Undifferentiated fol- 
hculary, papillary, and aggressive anaplastic carcinomas 
that showed poor prognosis and strong tumor invasive and 
metastatic potential and thai can be compared in their 
morphological, genetic and growth features with undiffer- 
entiated ihvroid catcinorna cell lines were not included this 
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study. However, the missing expression of MMP-1 by nor- 
mal thyrocytes and the spontaneous secretion of this pro- 
tein by highly malignant thyroid carcinoma cell lines, as 
demonstrated in our study, indicate the involvement of 
MMP-1 secretion of transformed thyrocytes in aggressive 
thyroid tumors. 

Although all cell lines analyzed in our study sponta- 
neously secreted MMP-1, we observed marked differences 
in the basal secretion capacity. The highest MMP-1 levels 
were determined in cultures of 8505 C cells. Only 8505 C 
cells expressed TPO and Tg mRNA that may be put down 
to residual differentiated components in the cell line (see 
Materials). However, none of the anajyzed cell lines ex- 
pressed TSH-R mRNA. The cell population doubling times 
were Jess than 40 hours. All cell lines had accumulations 
of multiple genetic events. These facts indicate the undif- 
ferentiated pathology of the studied lines. It is well known 
rhat anaplastic carcinoma cell lines well retain the malig- 
nant characterises of their parental tumors (38—10). 

Furthermore, we found a distorted proportion between 
MMP-1 and T1MP-J mRNA/protein for carcinoma cell 
lines but not for normal thyroid-derived fibroblasts. The 
most disadvantageous constellation between MMP-1 and 
TIMP-1 was found in 8505 C cells. Similar to other stud- 
ies (41), rhese results suggest the influence of an altered 
'MMP/T1MP relation on lurnor progression. However, it 



should be mentioned that most studies, including the pre- 
sent one, do not take into consideration that a number of 
inhibitors distinct from TIMP-1 may regulate MMP-1 ac- 
tivity. Taking into account that the balance of active en- 
zyme and TIMP-1 concentration strongly influence the ex- 
tent of local matrix degradation, a number of studies 
showed unexpectedly high levels of TIMP-1 in malignant 
neoplasms (9,42,43). There is a great discussion as to 
whether the overall expression of MMP-1 and TIMP-1 or 
the amount of noncomplexed MMP-1 could be critical in 
aggressive tumor development. This fact underlines the na- 
ture of tissue breakdown, reflecting the complicated net- 
work of selective and coordinated production of individ- 
ual proteinases . and inhibitors under normal and 
pathophysiological conditions. Thus, the invasive and 
metastatic potential of thyroid tumors depends on the lo- 
cal net level of active MMPs. 

The synthesis of MMP-1 and TIMP-1 is influenced by a 
variety of biochemical stimuli. The recent findings on 
MMP-] and TIMP-1 gene promoters are useful in under- 
standing the complex mechanisms implied in the regula- 
tion of MMP synthesis modulated by cytokines ar\d tumor 
promoters (34,44,45). The promoter regions contain tu- 
mor promoter responsive elements (TRE) and binding mo- 
tifs for the transcription factor PEA-3, which are recog- 
nized by proto-oncogenic transcription factors, such as the 
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FIG. 7. Comparison between (i) free/complex and (ii) T1MP- 
1 complexed MMP-1 levels, and (in) TIMP-1 levels in super- 
natants of unstimulated 8505 C and HTh 74 cells, and thy- 
roid-derived fibroblasts after 24 hours using a (i) MMP-1 
ELISA that recognizes total MMP-1 (see Figure 5). The (ii) 
MMP-l/TT.MP-1 assay recognizes MMP-1/T1MP-1 complexes, 
ie, activated MMP-1 that has subsequently been complexed 
with the specific MMP inhibitor T1MP-1. The (iiij TIMP-1 
EUSA recognizes total TIMP-1, ie, free T1MP-1 and that com- 
plexed with MMPs. 



fos and jun family (45-47). IL-1, TNF-a, and PMA up- 
regulate proto-oncogenes like fos and jun, resulting in the 
stimulation of MMP-1 and T1MP-1 (45,48). The action of 
the cytokines is mediated by their specific receptors. In our 
study, 1L-1R (type 1 and type II), TNF-aR (p75 and p55) 
and JFN-yR mRNAs were demonstrated in all investigated 
cell types. PMA and JL-1 were shown to elevate MMP-1 
and TIMP-1 in nearly all cell rypes investigated, thus con- 
firming the results of several studies on other epithelial cells 
(reviewed in refs. 1,7,49). Jn the majority of experiments, 
we found a concordant expression of MMP-1 and TIMP- 
1 after stimulation, possibly achieved by the coordinated 
actions of the nuclear transcription factors, although MMP- 
1 and TIMP-1 expression can also be independently or even 
reciprocally regulated (3). The effect of TNF-a was not as 
distinct as in the case of PMA and IL-1, although several 
investigators found a pronounced effect of TNF-a particu- 
larly on TIMP-1 secretion (4,34). In contrast to studies per- 
formed with other cell types (863,864,819), IFN-y did not 
influence MMP-1 and TIMP-1 expression in thyroid carci- 
noma cell lines. In summary, the involvement of the in- 
trathyroidal physiological and pathological cytokine mi- 
eroenvironmem in the regulation of MMP- 1 and TIMP-1 
induction activation and inhibition is strongly suggested. 

Furthermore, the data demonstrate that regular human 
thyrocytes did not produce MMP-1, even after powerful 
stimulation with PMA Investigating other mammalian ep- 
ithelial cells, only one study revealtd the production of 



MMP-1. by rabbit corneal cells (21). It b yet not clear 
whether the MMP-1 mRNA detected in thyrocytes is due 
to. a low level of constitutive transcription of the MMP-1 
gene (illegitimate transcription), an existing pool of stable 
MMP-1 mRNA, or. in vitro induction of MMP-1 mRNA. 
But it seems more likely that residual fibroblasts contained 
in the purified thyrocyte preparation (<0.2%) arc respon- 
sible for the slightly positive RT-PCR results. Another ex- 
planation could be that thyrocytes are indeed MMP-1 pro- 
ducers, but the ELISA detection system used was not 
sensitive enough to measure extremely low MMP-1 secre- 
tion levels. Furthermore, the discrepancy between elevated 
HMP-1 mRNA levels of thyrocytes and the extremely low 
TIMP-1 protein secretion by these cells is difficult to ex- 
plain. Post-transcriptional regulatory events may be re-, 
sponsible for this confounding result. 

Taken together, the present study suggests that the in- 
trathyroidal cytokine microenvironment is involved in the 
regulation of MMP-1 and its inhibitor TIMP-1 in the thy- 
roid, and that both proteins may be secreted by dediffer- 
entiated thyroid carcinoma cells and involved in aggressive 
thyroid tumors in vivo. 
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Expression of embryonic fibrouectin isoform EULA parallels alpha-smooth 
muscle actin in maturing and diseased kidney. 

Ba rncs_VJLj IVTusa J , Mitchell RJf , Darncs JL . 

Department of Medicine, Division of Nephrology, University of Texas Health Science 
Center, San Antonio, Texas, USA. 

In this study we examined if an association exists between expression of an alternatively 
spliced "embryonic" fibronectin isofonn EIIIA (Fn-EIIIA) and alpha-smooth muscle 
actin (alpha-SMA) in the maturing and adult rat kidney and in two unrelated models of 
glomerular disease, passive accelerated anti-glomerular basement membrane (GBM) 
nephritis and Habu venom (HV)-induced proliferative glomerulonephritis, using 
immunoliistochemistry and in situ hybridization. Fn-EIIIA and alpha-SMA proteins were 
abundantly expressed in mesangiurn and. in periglomerular and peritubular, interstitium of 
20-day embryonic and 7-day (D-7) postnatal kidneys in regions of tubule and glomerular 
development Staining was markedly reduced in these structures in maturing juvenile (D- 
14) kidney and was largely lost in adult kidney. Expression of Fn-EttlA and alpha T SMA 
was reinitiated in the mesangiurn and the periglomerular and peritubular interstitium in 
both models and was also observed in glomerular crescents in anti-GBM nephritis. 
Increased expression of Fn-EIIIA mRNA by in situ hybridization corresponded to the 
localization of protein staining. Dual labeling experiments verified co-localization of Fn- 
EIIIA and alpha-SMA, showing a strong correlation of staining between location and 
staining intensity during kidney development, maturation, and disease. Expression of 
EIIIA mRNA corresponded to protein expression in developing and diseased kidneys and 
was lost in adult kidney. These studies show a recapitulation of the co-expression of Fn- 
EIIIA and alpha-SMA in anti-GBM disease and suggest a functional link for these two 
proteins. 
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BMI-1 geuc amplification and overexpressiou in hematological 
malignancies occur mainly in mantle cell lymphomas. 

Bca S T Tort F, Ptnyof M Puiu X , Hernandez L . Hernandez S . Fernandez PL. van 
Lohuizcn M Colomer D , Campo E . 

The Hematopathology Section, Laboratory of Anatomic Pathology, Hospital Clinic, 
Institut d'Investigacions Biomediques August Pi i Sunyer (IDIBAPS), University of 
Barcelona, Spain, 

The BMI-I gene is a putative oncogene belonging to the Polycomb group family that 
cooperates with c-myc in the generation of mouse lymphomas and seems to participate in 
cell cycle regulation and senescence by acting as a transcriptional repressor of the 
rNfC4a/ARF locus. Tlie BMI-1 gene has been located on chromosome I0pl3, a region 
involved in chromosomal translocations in infant leukemias, and amplified in occasional 
non-Hodgkin's lymphomas (NHLs) and solid tumors. To determine the possible 
alterations of this gene in human malignancies, we have examined 160 
lymphoproliferative disorders, 1 3 myeloid leukemias, and 89 carcinomas by Southern 
blot analysis and detected BMM gene amplification (3- to 7-fold) in 4 of 36 (11%) 
mantle cell lymphomas (MCLs) with no alterations in the INK>la/ARF locus. BMI-l and 
pl6INKL4a mRNA and protein expression were also studied by real-time quantitative 
reverse rranscription-PCR and Western blot, respectively, in a subset of NHLs. BMI-1 
expression was significantly higher in chronic lymphocytic leukemia and MCL than in 
follicular lymphoma and large B cell lymphoma. The four tumors with gene 
amplification showed significantly higher mRNA levels than other MCLs and NHLs with 
the BMI-l gene in gerrnline configuration. Five additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between BMI-1 mRNA 
levels and protein expression was observed in all types of lymphomas. No relationship 
was detected between BMI-1 and pl6INK4a mRNA levels. These findings suggest that 
BMI-l gene alterations in human neoplasms are uncommon, but they may contribute to 
die pathogenesis in a subset of malignant lymphomas, particularly of mantle cell type. 
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Abstract 

• The BMI-J gene is a putative oncogene belonging to the Polycomb 
groop family that cooperates with c-myc in the generation of mouse 
■ lymphomas and seems to participate in eel) cycle regulation and senes- 
cence by acting as a transcriptional repressor of the INK4a/ABF Jocus. 
The BMI-l gene has been located on chromosome 10pJ3, a region involved 
in chromosomal translocations in infant leukemias, and. amplified in. 
occasional non-1) odgkin's lymphomas (NHLs) and solid tumors. To de- 
termine the possible alterations of this gene in human malignancies, we 
have examined J 60 lymphoprohferative disorders, 13, myeloid leukemias, 
and 89 carcinomas by Southern blot analysis and detected BMI-J gene 
amplification (3- to 7-fo!d) in 4 of 36 (13%) mantle ce)l lymphomas 
(MCLs) with no alterations in the JIsK4n/ARF locus. BMI-J and pJ6 ,NK4 * 
mRNA and protein expression were also studied by real-time quantitative 
reverse transcriplioo-PCR and Western blot, respectively, in a subset of 
NWLj. BMJ-1 expression was significantly higher in chronic lymphocytic 
leukemia and MCL than in follicular lymphoma and large B cell lym- 
phoma. The four tumors with gene amplification showed significantly 
. higher mflNA levels than other MCJLs and NHLs with the BMJ-1 gene in 
gcrmlinc configuration. Five additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between 
BMJ-1 mRNA levels and protein expression was observed in all types of 
• lymphomas. No relationship was detected between BMJ-1 and pl6 ,NK ** 
mRNA levels. These findings suggest thai BMi-i gene alterations in 
human neoplasms are uncommon, but they may contribute to the patho- 
genesis in a subset of malignant lymphomas, particularly of mantle cell 
type. 

Introduction 

The BMJ~J y gene is a putative oncogene of the Polycomb group 
originally identified by retroviral msertional mutagenesis in E/i-c- 
myc transgenic mice infected with the Moloney murine leukemia 
vims (1, 2). These animals had a rapid development of pre- B cell 
lymphomas showing frequent proviVal insertions near the BMJ-1 gene. 
This integration resulted in BMJ-1 overexpression suggesting a coop- 
erative effect between C-MYC mdBMJ-J genes in the development of 
these tumors (3, <l). Recent smdies have indicated that the BMI-l gene 
may also participate in cell cycle control and senescence through the 



Received 1 0/1 WOO; accepted l.WQl. 

The costs of publication of ifm article wcie defrayed in pan by ihe payment of p3«e 
ch.n pes. This aniclc must therefore be hereby marked advertisement in accordance with 
18 U.S.C. Section 1734 solely to indicate this fart, 

1 Supported by Grant SAF from Comi.siwi Jnlrrminislcnal dc Ciencia y Tccno- ' 
logta. European Union Contract QLGI-CT-2000-68^ the Asociaciorj Espanola "contra cl 
Cancer, and Generality dc CDialunya 98SGR2I. S. B. and F. T. were fellows supported by 
Spanish Minisieiio dc t: due ac ion y Culture, and S. H. was supported by the Asociacion 
Espanola contra cl Cancer f»nd the Fundacio Rius i Viigili. 

2 To whom requests for reprints should be addressed, at the Department of Pathology. 
Hospital Clinic. University of Barcelona, Vijlarrocl 170, 08036- Barcelona. Spain. Phone: 
34 93 227 5450; Fa*: 34 93 ??7 5572: E-mail: campo<£mcdicina.ub.c5. 

3 The abbreviations used arc: liMI-I. fi ccll-sperific Moloney murine leukemia virus 
integration site i; NHL. non-iioduldn's lymphoma; CLL, chrome lymphocytic leukemia: 
FL. follicular lymphoma: LO., brec B cell lymphoma; MCL, mantle cell lymphoma; 
RT-PCK, tt-vmc-ltansniption-rCR; R U. retain c units. 



INK4a/ARF. locus by acting as an upstream negative regulator of 
p!6 JNK4a and pl<i/pl9 ARF gene expression (5). the human BMI-I 
gene has been mapped to chromosome 10pl3 (6), a region involved.in 
chromosomal translocations in infant leukemias (7) and rearrange- 
ments in malignant T cell lymphomas (8, 9). More recently, higMeve) 
DNA amplifications of this region have been' found by comparative 
genomic hybridization in NHLs and solid tumors (10, 11). However, . 
the possible implication of the BMI-J gene iri these alterations and ils 
role in the pathogenesis of human rumors is not known. The aim of 
this study was to analyze- the possible BMI-J gene alterations and 
expression in a large series of human neoplasms and to determine the 
relationship with INK4a/AHF locus aberrations. 

Materials and Methods 

Case Selection. A series of 262 human tumors, including 173 hematolog- 
ical malignancies and 89 carcinomas (Table 1), matched normal tissues from ' 
all carcinomas, 11 samples of normal peripheral- mononuclear cells, and 5 . 
reactive lymph nodes and tonsils, were selected based on the availability of 
frozen sampJes-for molecular analysis. 

DNA Extraction and Southern Blot Analysis. Genomic DNA was ob- 
tained using Proteinase K/RNase treatment.' 15 fig were digested with £coRI 
and Hind U I restriction enzymes (Life Technologies, Inc., Caithersburg, MD),' ■ 
for Southern blot analysis and hybridized with a L5-kb Pst\ fragment of the 
partial BM1- 1 cDN A '(6). 

JRNA Extraction and FteaJ-time Quantitative KT-PCR. Total RNA was 
obtained from 67 lymphoid neoplasms (10 CLLs, 27 MCLs. 8 FLs, and 22 
LCLs) using guanidine/isothiocyanate extraction and cesium/chloride gradient 
centrifugation. One /ig of total RNA was transcribed into cDNA using 
MMLV- reverse transcriptase (Life Technologies, Inc.) and random hexamers, 
following manufacturer's directions. Sequences of. the BMI-J and the pI6 
detection probes and primers were designed using the Primer Express program 
(Applied Biosystems, Foster City) as follows: BMI-J sense, 5 '-CTGGTTGC.- 
CCATTGACACC-3'; .BMI-J antisense, 5 '- CA G A A A ATGA ATGCG A G- 
CCA-3'; P 16 sense, 5'-CAACGCACCCAATAGTTACGG-3'; pI6 antisense, ■ 
5'-AACTrCGTCCTCCACACTCGC-3\ The probes BMI-J, 5'-CAGCTC- 
GC1TCA AG ATGGCCGC 3', and P J6, 5'-CGGAGGCCGATCCAGGTGG- 
GTA-3\ were labeled with 6-caiboxy-fluorescein as the reporter dye. The 
TaqMan-GAPDM Control Reagents (Applied Biosystcms) were used lo am- 
plify and detect the CAPOJJ gene, as recommended by the manufacturer. The 
quantitative assay amplified I u,ml of cDNA in two to four replicates using the 
primers and probes described above and the standard master mix (Applied 
Biosysterns). All reactions were performed in an AB1 PRISM 7700 Sequence 
De lector System (Applied Biosysterns). GAPDM. BMI-l, and pl6 ,NK4 * ex- 
pression was related lo a standard curve derived from serial dilutions of Raji 
cDNA. The RUs of BMJ-1 and p]6 niK4 * expression were defined as the 
mRNA levels of these genes normalized to the GADPH expression level in 
each case. 

Protein Analysis Whole-cell protein extracts were obtained from addi- 
tional ficttrri tissue available in 31 cases (7 CLLs, 12 MCLs, 8 FLs, and A 
LCLs), loaded onto a 10% SDS- poly aery lamide gel. and clectroblotted to a 
nitrocellulose membrane (Amersham). Blocked membranes were incubated 
sequentially with the monoclonal antibody BMJ-F6 (12), aniirnouse conju- 
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Tabic I Hematological malignancies and solid tumor samples analyzed for BMJ-J 
gene alterations 



Tissue samples 


No. of cases 


Hematological malignancies 




Hodgkin's disease 


2 


B ceJI lymphoproliferalive disorders 




B-Acurc lymphoblastic leukemia 


14 


CLL 


29 


Hairy cell leukemia 


. 4 


FL 


15 


MCL 


36 


LCL 


40 


T cell rymphoproliferative disorders 




T-Acuic lymphoblastic leukemia 


8 


Large granular cell leukemia 


4 


Peripheral T-cell lymphoma 


8 


Myeloproliferative disorders 




Acute myeloid leukemia 


7 


Chronic myeloid leukemia 




Solid tumors 




Colon carcinoma 


26 


Breast carcinoma 


29 


Laryngeal squamous cell carcinoma 


34" 


Total 


262 



gated to horseradish peroxidase (Amersham), and detected by enhanced cbemi- 
lurmnescence (Amersham) according to the manufacturer's recommendations. 
Statistics) Analysis. Because of the non-normal distribution of the samples 
. and the small. size of some subsets of rumors, the statistical evaluation was 
performed using nonparametric tests (SPSS, version 9.0). Comparison between 
mRNA expression levels in the different groups of NHLs was performed using 
the Kruskal-Wallis Test, with a P for significance set al 0.05. For differences 
between particular groups, the conservative BonfefTOni procedure was per- 
formed, and the P was set at 0.005: The remaining statistical analyses were 
carried out using the Mann- Whitney nonparametric U test (significance, P 
<0.05). The comparison between BMM and p]6 SHK ** quantitative mRNA 
levels was also performed using the Pearson's correlation coefficient. 

Results 

BMJ-J Gene Amplification. The BMI-J gene was examined by 
Southern blot in a large series of human rumors and normal samples 
(Table 1). The cDNA probe used in the study detected three EcoR\ 
fragments of 7.3, 3:8, and 2.6 kb and three BiwS\W fragments of 6.2, 
4, and 3.5 kb. BMJ-J gene amplification (3- to 7- fold) was delected in 
4 of 36 (11%) MCLs (Fig; I). The amplifications were confirmed with 
bolh restriction enzymes. The amplified MCLs were two blastoid and 
two typical variants. No amplifications were observed in any of the 
solid tumors when compared with tbeir respective matched non- 
neoplastic mucosa. No BMI-J gene rearrangements were observed in 
any of the samples examined; 

BMI-J mRNA Expression. To determine the BMM expression 
pattern in NHL we analyzed BMM mRNA levels by real-time quan- 
titative RT-PCR in 67 lymphomas (10 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs), including the four tumors with gene amplification. A distinct 
BMM mRNA expression pattern was observed in the different types 
of lymphomas (Fig. 2; K rusk al Wall is Test; P < 0.001). The BM1 
mRNA levels in CLLs (mean, 2.2 RU; SD, 1.3) and MCLs with no 
BMJ-J gene amplification (mean, 2.5 RU; SD, 2.3) were significantly 
higher than in FLs (mean, 0.9 RU; SD, 0.8) and LCLs (mean, 0.6 RU; 
SD, 0.4;, Mann- Whitney nonparametric U lest; P < 0.01). The 4 
MCLs with BMJ-J gene amplification showed significantly higher 
levels of expression than all other groups of rumors (mean, 5.1 RU; 
SD, 1.6; P < 0.005). In addition, five typical MCLs with no structural 
alterations of the gene also showed very high levels of BMJ- 1 mRNA 
expression ranging from 4 to 9.8 RU, similar to cases with gene 
amplification (Fig. 2,4). 

BMM Protein 'Expression. BMM protein expression was exam- 
ined by Western blot in 31 tumors (7 CLLs; 12 MCLs, including two 
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cases with BMJ-J gene amplification and 4 cases with mRNA over- 
expression and no structural alteration of the gene; 8 FLs, and 4 LCLs) 
in which additional frozen tissue was available. The monoclonal 
antibody against BMM detected three closely migrating proteins of 
M r 45.000-48,000 (2). The two more slowly migrating bands prob- 
ably represent phosphorylated isoforms of the protein (12). The two 
MCLs with gene amplification and three of four cases with mRNA 
ovei expression without amplification of the gene showed very high 
levels of protein expression. The remaining MCLs and CLLs showed 
intermediate levels of protein expression, whereas low- or no-expres- 
sion signals were detected in the LCLs and FLs included in the study 
(Fig. 3). These results indicate that BMJ-J protein expression in NHL 
is concordant with the mRNA levels observed by real-time quantita- 
tive RT-PCR. 

Relationship between BMJ-J and pJ6 ,NKj,a Gene Alterations. 
The INK4a/ARF locus has been recently identified as a downstream 
target of the transcriptional repressing activity of the BMJ-J gene, 
suggesting that this gene may contribute to. human neoplasias with 
wild type JNK4/ARF (5). Most of the lymphoproliferative disorders 
analyzed in the present study, including the four cases with BMJ-J 
gene amplification, had been previously examined forp53 gene mu- 
tations and INK4a/ARF locus alterations, including gene deletions, 
mutations, hypermethylation, and expression (13, 14). The four MCLs 
with BMJ- J gene amplification and mRNA overexpression and the. 
five tumors. with BMJ-J mRNA overexpression with no structural 
alterations of the gene showed a wild- type configuration of the 
INK4a/ARF locus (13). However, one case BMJ-J gene ampli- 
fication and one case with niRNA overexpression with no alteration of 
the gene showed p53 gene mutations associated with allelic deletions. 

To determine the possible relationship between BMJ-J and 
p!6 1NK4a mRNA expression, pl6 1NK4a mRNA levels were evaluated 
by real-time quantitative RT-PCR in 50 tumors (10 CLLs, 27 MCLs, 
and 13 LCLs), including 6 cases with alterations in the JNK4o/ARF 
locus (2 MCLs and 1 LCL with pJ6 }>IK4 * gene deletion, 2 LCLs with 
pi6 promoter hypermethylation. and I CLL with p76 1NK4a gene 
mutation), and the 4 lymphomas with BMUJ amplification. Negative 
or negligible levels of pl6 ,>,K/,:> were observed in the 6 tumors with 
INK4a/ARF locus alterations. These cases were not included in the 
comparisons between BMM and p J ( > mKA7i mRNA expression. The 
pl£iNK<*a e xp ress j on levels were relatively similar in the different 
types of tumors. Only LCLs tended to have lower levels of expression, 
but the differences did not reach statistical significance (Fig. 2B\ No 
differences were observed in the pJ6 INK " a mRNA levels between 
rumors with BMI-J .gent amplification and overexpression and lym- 
phomas with germline configuration of the gene. 



CLL MCL yj CIS LCL FL 




Fig. I. Southern bloi aimlysij of BMI-J fenc. Ftwi MCLs (MCL*) showed BMJ-J gcrtc 
atop! ilk ai >on (3- iu 7-fotd) ii«r»> f >:ireil with non- neoplastic i issues {N} and vihcr NHLs. No 
amplifications 01 L*cnc rearrangement were ilcrirtrd in the icniaining NHLs and card- 
nurna.s included in ihc ihidv. 
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CLL MCL MCL* LCL 

Fig. 2. A, quantitative BM1-I mRNA transcript analysis (median and range) using 
real-time RT-PCR in a series of NHLs. MCLs with BMJ-J gene amplification (ilfCL*) 
revealed significantly higher overall BMM mRNA levels than all other types of NHLs, 
including MCLs wiib no structural alterations of the gene (P < 0.005). MCLs and CLLs 
expressed significantly higher levels than FLs and LCLs (P < 0.00 J). Results are depicted 
ds the ratio of absolute BMM:GADPH mRNA transcript numbers (RU). Bars, SD. B, 
quantitative pl6 ,NKJ, • mRNA transcript analysis (median and range) using real-time 
RT-PCR in a series of NHLs. Expression levels were relatively similar in the different 
types of rumors. Results are depicted as (he ratio of absolute pl6 ,NK ***:GADPH mRNA 
tiansrripi numbers (RU). Bars, SD. 



Discussion 

)n Ibe present study, we have examined a large series of human 
rumors for the presence of gene alterations and mRNA expression of 
the BMJ-J gene. Gene amplification was identified in four MCLs. 
These rumors showed significantly higher levels of mRNA and pro- 
tein expression compared with other lymphomas with BMJ-J in germ- 
line configuration. BM1-) expression levels were also highly up- 
rcgulated in a subset of MCLs with no apparent structural alterations 
of the gene. No alterations were detected in any of the different types 
of carcinomas included in the study. BMJ-J is considered an oncogene 
belonging to the Polycomb group family of genes. These proteins 
mainly act as transcriptional regulators, controlling specific target 
genes involved in development, cell differentiation, proliferation, and 
senescence. Different studies have shown the implication of BMM 
oveiexpiession in the development of lymphomas in murine and 
feline animal models (3. 4). The findings of the present study indicate 

Ml 



for the first time that BMJ-J gene alterations in human neoplasms are 
an uncommon phenomenon, but they seem to occur mainly in a subset 
of NHLs, particularly of mantle cell type. 

The human BW-J gene has been mapped to chromosome )0p)3. 
High-level DNA amplifications and gains in this region have been 
identified by comparative genomic hybridization in occasional solid 
tumors and NHLs (iG\ ]J). Different chromosomal translocations 
involving the 10pI3 region have also been identified in infant leuke- 
mias and T cell lymphoproliferative disorders (7, 8, 15). Most acute 
Jeukemias with this chromosomal alteration occur in children <J2 
months of age, whereas it seems to be extremely rare in adults. JOp 
translocations in T-cell lymphoproliferative disorders have been ob- 
served mainly in adult T cell Jeukemia/lymphomas and occasional 
cutaneous T cell lymphomas. In our study, we did not observe BMJ-J 
rearrangements or amplifications in any of the acute Jeukemias or T 
cell lymphomas. However, all of the acute leukemics in this study 
were diagnosed in patients over 1 6 years, and no adult T cell leuke- • 
mia/lymphomas or cutaneous lymphomas could be included in the 
series. Similarly, high-level DNA amplifications at the 10pl3 region 
have been detected in head and neck carcinomas and other solid 
tumors. Although we found no. evidence for BMJ-J gene rearrange- 
ments or amplifications in a substantial set of carcinomas, this does 
not exclude the possibility of increased gene expression or protein 
levels in these tumors. Additional studies are required to elucidate the 
possible involvement of BMJ-J in these particular groups of human 
neoplasms. 

In human hematopoietic cells, BMM is preferentially expressed in 
primitive CD34+ bone marrow cells, whereas it is negative or very 
low in more mature CD34 - cells ( 1 6). Jn peripheral lymphocytes, and 
particularly in follicular D cells, BMI-J protein expression has been 
detected in resting cells of the mantle zone, whereas it is down- 
regulated in proliferating germinal center cells (17, 18). These obser- 
vations indicate that BMM expression in normal hematopoietic cells 
is tightly regulated in relation with cell differentiation in bone marrow 
and anligeo-spectfic response in peripheral lymphocytes. BMM ex- 
pression in human tumors has not been examined previously. In this 
study, we have demonstrated thai BM1- 1 mRNA and protein expres- 
sion show a distinct pattern in different types of lymphomas. Thus, 
BMM levels were low in LCLs and FLs and significantly higher in 
MCLs and CLLs. These findings suggest that BMM expression 
patterns in B cell lymphomas maintain in part the expression profile 
of their normal cell counterparts; because FLs and at least a subgroup . 
of LCLs are considered lymphomas derived from follicular germinal 
center cells, whereas MCLs and CLLs are tumors mainly derived from . 
naive pregerminal center cells. However, the four MCLs with BMJ-J 
gene amplificaiion expressed significantly higher mRNA levels than 
all other tumors. In addition, five MCLs with no structural alterations' 
of the gene showed high mRNA levels similar to those observed in 
tumors with BMJ-J gene amplification, suggesting that other mecha- 
nisms may be involved in up-regulation of the gene in these lympho- 
mas. Different studies using animal models have shown a dose- 
dependent effect of BMJ-J gene expression on skeleion development 
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and lymphomagenesis (K 3). These observations suggest that (he high 
mRNA and protein levels detected in a subset of MCLs may play a 
role in the pathogenesis of these neoplasms. 

Recent studies have identified the JNK4/ARF locus as a down- 
stream target of the BMM transcriptional repressor activity, suggest- 
ing that BMM overexpression may contribute to human neoplasias 
that retain the wild-type )NK4a/ARF Jpcus (5). Interestingly, in our 
study, BMJ-J amplification and overexpression appeared in rumors 
with no alterations in p)6 n * K4 * and pJ4 AKF genes. However, we could 
not detect differences in the expression levels of pl6 ] * 4K4i ' in rumors 
with arid without BMJ-J gene alterations. The reasons for this apparent 
discrepancy with experimental observations are not clear. One possi- 
bility may be that genes other than JNK4a/ARF zit the main targets of 
BMM repressor activity in these rumors. Particularly, different genes 
of the HOX family are regulated by BMM and may also be involved 
in lymphomagenesis (19, 20). 

In conclusion, the findings of this study indicate that BMJ-J gene 
. expression is differentially regulated in B cell lymphomas. Alterations 
of the gene seem to be an uncommon phenomenon in human neo- 
plasms, but they may contribute to the pathogenesis in. a subset of 
MCLs. Although, BMJ-J gene alterations occurred in rumors with 
wild-type JNK4a/ARF locus, the possible cooperation between these 
genes and the oncogenic mechanisms of BMM in human neoplasms 
Tequire additional analysis. 
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Histopathology is insufficient to predict disease progression and clinical outcome in lung adeno- 
carcinoma. Here we show that gene-expression profiles based on microarray analysis can be 
used to predict patient survival in early-stage lung adenocarcinomas. Genes most related to sur- 
vival were identified with univariate Cox analysis. Using either two equivalent but independent 
training and testing sets, or 'leave-one-out' cross-validation analysis with all tumors, a risk index 
based on the top 50 genes identified low-risk and high-risk stage I lung adenocarcinomas, which 
differed significantly with respect to survival. This risk index was then validated using an inde- 
pendent sample of lung adenocarcinomas that predicted high- and low-risk groups. This index 
included genes not previously associated with survival. The identification of a set of genes that 
predict survival in early-stage lung adenocarcinoma allows delineation of a high-risk group that 
may benefit from adjuvant therapy. 



Lung cancer remains the leading cause of cancer death in indus- 
trialized countries. Most patients with non-small cell lung can- 
cer (NSCLC) present with advanced disease, and despite recent 
advances in multi-modality therapy, the overall 10-yeaj survival 
rate remains a dismal 8- 10%'. However, a significant minority of 
patients (-25-30%) with NSCLC have stage I disease and receive 
surgical intervention alone. Although 35-50% of patients with 
stage I disease will relapse within 5 years 2 '*, it is not currently 
possible to identify specific high-risk patients. 

Adenocarcinoma is currently the predominant histological 
subtype of NSCLC (refs. 1,5,6). Although morphological assess- 
ment of lung carcinomas can roughly stratify patients, there is a 
need to identify patients at high risk for recurrent oi metastatic 
disease. Preoperative variables that affect survival of patients 
ivith NSCLC have been identified 7 " 10 . Tumor size, vascular inva- 
sion, poor differentiation, high tumor-proJi/erative index and 
several genetic alterations, including K-ras (refs. 11,12) and p53 
(refs. 10,13) mutations, have prognostic significance. Multiple 
independently assessed genes or gene products have also been 
investigated to bet ter pi edict patient prognosis in lung can- 
cer 14 ' '\ Technologies that simultaneously analyze the expression 
of thousands ol genes" can be* used to correlate gene-expression 
patterns with numerous clinical parameters— including patient 
outcome— to better predict tumor behavior in individual pa- 
tients 70 . Analyses of lung cancers using ajray technologies have 
identified subgroups ol tumors that differ according to tumor 
type and histological subclasses and, to a lesser extent, survival 
among adenocarcinoma patients 1 ' 1 ". Here we correlated gene- 
expiossion profiles wjih clinical outcome in a cohort of patients 
with lung adenocarcinoma and identified specific genes that 



predict survival among patients with stage I disease. For further 
validation, we also show that the risk index predicted survival in 
an independent cohort of stage 1 lung adenocarcinomas. 

Hierarchical profile clustering yields three tumor subsets 
Using oligonucleotide arrays, we generated gene-expression pro- 
files for 86 primary lung adenocarcinomas, including 67 stage 1 
and 19 stage 111 tumors, as well as 10 non neoplastic lung sam- 
ples. Selected sample replicates showed high correlation among 
coefficients and reliable reproducibility. We determined tran- 
script abundance using 3 custom algorithm and the data set was 
trimmed of genes expressed at extremely low levels, that is, 
genes were excluded if the measure of their 75th percentile value 
was less than 100. Although potentially resulting in the loss ol 
some information, trimming in this manner decreased the possi- 
bility that the clustering algorithm would be strongly inf luenced 
by genes with little or no expression in these samples. 
Hierarchical clustering with the resulting '1,966 genes yielded 3 
clusteis of tumoi5 (Fig. 1). All 10 non-nebplastic samples clus- 
tered tightly together within Cluster 1 (data not shown). We ex- 
amined the relationships between cluster and patient and tumoi 
characteristics (Fig. 1 and Supplementary Figure A online). There 
were associations between cluster and stage (P = 0.030) and be- 
tween cluster and differentiation [P- 0.01). Cluster 1 contained 
the greatest percentage (42.8%) of well differentiated tumors, 
followed by Cluster 2 (27%) and Cluster 3 (4.7%). Cluster 3 con- 
tained the highest percentage of both poorly differentiated 
(■J 7. 6%) and stage 111 tumors (42.8%), yet contained 3 (M.3%) 
moderately differentiated and 1 (5%) well differentiated stage ) 
tumor. Notably, 1 1 stage J tumors were present in Cluster 3, sug- 
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gesting a common gene-expression profile for 
this subset of stage I and stage HI tumors. 

For patients with stage) and stage II I tumors, 
the average ages were 68. 3 and 64.5 years and 
the percentage of smokers was 88.9% and 
89.5%, respectively. Marginally significant as- 
sociations between cluster and smoking his- 
tory were observed {P ~ 0.06). A significant 
relationship between histopathological classifi- 
cation and cluster was only discernible for 
bronchioloalveolar adenocarcinomas (BAs), 
which were only present in Clusters 1 and 2 
(P = 0.0055) and comprised 35.7% and 12.3% 
of tumors for Clusters 1 and 2, respectively. 

We examined the heterogeneity in gene-ex- 
piession profiles based on the trimmed data. set among normal 
lung samples and stage I and stage III adenocarcinomas by calcu- 
lating correlation coefficients between all pairs of samples. In 
contrast to normal lung samples that displayed highly similar 
gene-expression profiles (median correlation, 0.9), both stage I 
and 111 lung tumors demonstrated much greater heterogeneity in 
their expression profiles with lower correlation coefficients (me- 
dian values, 0.82 and 0.79, respectively). 

Northern-blot and immunohistocherrmtry analyses 
Of ihe 4,966 genes examined, 967 differed significantly between 
stage I and HI adenocarcinomas, a number in excess of that ex- 
pected by chance alone (248 at alpha level (a) = 0.05). Three 
genes were arbitrarily selected to verify the microairay expression 
data. The mRNA from 20 of the normal Jung and tumor samples 
was examined by northein-blot hybridization with probes for in- 
sulin- like growth factor-binding protein 3 {JGFBP3), cyst at in C 



Cluster I 



Ouster 2 



Ouster 3 



Fig. 1 Unsupervised classification analysts of lung adenocarcinomas. 3 classes of tumors identi- 
fied by agglomerative hierarchical clustering of gene-expression profiles using the 4 ; 966 expressed 
genes. Patient and histopathological information for each lung adenocarcinoma case by cluster 
designation and methods for K-ros 12/1 3th-codon mutational status and nuclear p53 protein ac-. 
cumulation are provided (Supplementary Figure A online). TN classification denotes information 
regarding patient tumor size and nodal involvement. Associations between cluster membership 
and patient or histopathological variables are indicated at significance level (P£ 0.05). 



arid lactate dehydrogenase A (LDH-A) (Fig. 2a). Two gene probes 
not represented on the microarrays were used as controls, includ- 
ing histone H4, a potential index of overall cell proliferation, and 
28S ribosomal RNA, a control for sample loading and transfer. 
The relative amounts of 1GFBP3, cystatin C and.lDtf-yt mRNA 
strongly correlated with microarray-based measurements* (Fig. 
2b). In both assays, JGFBP3 and LDH-A mRNA levels increased 
from stage 1 to stage HI adenocarcinomas and were higher than 
those in normal lung. Cystatin C mRNA levels were more variable 
but relatively greater in normal lung than tumors. These results 
suggest that the oligonucleotide microarrays provided reliable 
measures of gene expression. The tumors showed slightly greater 
histone H4 expression than the normal lung, likely reflecting in- 
creased proliferation of tumor cells. 

Immunohistochemistry was performed for IGFBP3, cystatin C 
and HSP-70 to determine whether mRNA overexpression was re- 
flected by an increase of their corresponding proteins in tumors. 
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I ig. 2 Validation analyses ol gene-egres- 
sion profiling. «, Northern- blot analysis of i'P 
selected candidate genes tor verification' of n - 0 D> t0 ,- 4 ;r ? , , 0 ^ 

data obtained fiom oligonucleotide arrays. K'crthcm 
The same sample RNA lor the 4 uninvolved 
lung, 8 stage I and 8 stage 1)1 tumors was 

used lor the northern-blot and oligonucleotide array analyses. 
b. Correlation analysis of quantitative data obtained trom oligonucleotide 
arrays 3nd northern blols measured by integrated phosphor irnager-based 
signals lor the IGIBP3 and IDH-A genes. Ihe ratio ol IGIBP3, cystatin C 
and IDH A mRNA to 78 j rRNA was determined, "the relative values lot 
ea<h gene from each sample aie shown, n, non- neoplastic normal tuny; 
1, stage I tumors; 3, stage III tumors, c, Immunobistochemic at analysis ot 
lC'fiP-3, MM'- 70 and r ystatin C in lung and tuny adenocarcinomas. 
Cytoplasmic ICIBP-3 irnmunoieac tivity in a neoplastic gland (tumor I??) 
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IGFBP3 



HSP-70 




Cystatin C 



Cystatin C 



with prominent apical staining (blue reattant staining, arrow, upper" lett). 
Diffuse cytoplasmic HSP-70 immunoienciivity (tumor L?7), yet stromal el- 
ements show no leactivity (upper right). Normal lung parenchyma (tower 
felt) shows cytoplasmic cystatin C irnmunoreactivity in alveolar pneumo- 
( ytes (arrow) and intra- alveolar mat lophages but tumor (190) shows dit- 
tos e cytoplasmic (ystatin C irnmunoreactivity with prominent apical 
staining (lower light). Magnification, >?00 
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J mmtinoie activity for both JGFBP-3 and HSP- 70 (Fig. 2 r) was de- 
tected in the cytoplasm of the adenocarcinomas, with little de- 
tectable reactivity in the stromal oi inflammatory cells. Cystatin 
C was detected in alveolar pneumocytes and intra-alveolar 
macrophages in non neoplastic lung parenchyma and also con- 
sistently in the cytoplasm of neoplastic cells. 

Gene-expression profiles piedkt survival 

As expected, Kaplan-Meier survival curves (Fig. 3a) and log-rank 
tests indicated poorer survival among stage 111 compared with 
stage I adenocarcinomas [P = <0.000l). Two statistical ap- 
proaches were used to determine whether gene-expression pro- 
files could predict survival using the data set of 4,966 genes. In 
one approach, equal numbers of randomly assigned stage 1 and 
stage 111 tumors constituted training [n = 43) and testing {n = 43) 
sets. In the training set, the top 3 0, 20, 50 or 75 genes were used 
to create risk indices that were evaluated for their association 
ivjth survival using the '60th, 60th or 70tb percentile cutoff 
points to categorize patients into high or low groups. The results 
were similar across cut of I points but the SO- gene risk index had 
the best overall association with survival in the training set. 



Fig. 3 Gene-expression profiles and patient survival, a. Relationship be- 
tween tumor stage and patient survival (stage 1 and stage 3 differ signifi- 
cantly, P< 0.0001). b, Relationship between the survival. in the 43 test 
samples and their risk assignments based on the 50-gene risk index esti- 
mated in the 43 training samples. The high- and low- risk groups differ sig- 
nificant^ {P- 0.024). c, Relationship between patient survival and the risk 
assignments in test samples (in b) conditional for tumor stage. The high- - 
and low-risk stage I groups differ significantly (P = 0.028), whereas stage III 
low- and high-risk groups did not (/>= 0.634). rf, Relationship between sur- 
. vival in the test cases and their risk assignments based on the.86 leave-one- 
out' cross-validation of the 50-gene risk index. The high- and low-risk 
groups differ significantly {P- 0.0006). e, Relationship between test case's 
risk assignment and survival (in d) conditional on tumor stage. The high- 
and low- risk stage I lung adenocarcinoma groups differ significantly from 
each other (P = 0.003), whereas low- and high-risk stage III tumors do not. 
f, Relationship between tumor class identified by hierarchical clustering and 
patient survival. Survival for patients in Cluster 3 differed relative to the tu- 
mors in Cluster 2 (P= 0.037) and approached significance for Cluster t and 
2 combined (P- 0.06). g, Analysis of the Michigan- based risk index using 
top cross-validated survival genes identify a low- and high-risk group in an 
independent cohort of 84 Massachusetts- based hing adenocarcinomas that 
are signifkanth/ different (P= 0.003). h, Among the 62 stage I lung adeno- 
carcinomas in the Massachusetts sample, the high- and low-risk groups dif- 
fered significantly {P - 0.006). 



After conservatively choosing the 60th percentile cutoff point 
from the training set, we then applied this risk index and cutoff 
point to the testing set. The risk index of the top 50 genes cor- 
rectly identified low- and high-iisk individuals within the inde- 
pendent testing set (P = 0.024) (Fig. 3b and Supplementary 
Methods online). Notably, 11 stage 1 tumors were included in 
the high-risk subgroup. When this risk assignment was then 
conditionally examined for stage progiession^Fig. 3c), low- and 
high-risk groups among stage I tumors were found to differ (P = 
0.028) in their sui vival. 

Identification of a robust set of survival genes 
Although predictive of patient survival, a single training-testing 
set may not provide the most robust set of genes due to random 
sampling issues. Therefore, a 'leave-one-oul' cross-validation ap- 
proach was used to identify genes associated with survival from 
all 86- tumor samples. We first developed a 50-gene risk index in 
each training set, and then applied the risk index to the test case 
held out from the full set of tumors and assigned the held out 
tumor to the high- or low-risk groups (Fig. 3r/). The high and 
low-risk subgroups determined in the test cases differed signifi- 
cantly in their overall survival (P = O.00O6). Among the larger 
group of stage I lung adenocarcinomas, the low-risk (n = 46) and 
high-risk (n = 21). groups had markedly different survival (P = 
0.003) (Fig. 3e). Table 1 lists selected examples of the cumulative 
top 3 00 genes derived from this cross-validation procedure 
(complete list in Supplementary Table A online). 

It was also noted that many of the stage I patients in the high- 
risk subgroup (Fig. 3c) were present in Cluster 3 (Fig. 1). 
Kaplan-Meier analysis (Fig. 3f) demonstrated a significantly 
worse survival (p= 0.037) for patients in Cluster 3 relative to pa- 
tients in Cluster 2 and approaching significance for Cluster 1 
and 2 combined {P = 0.06). This further indicates the important 
relationship between gene- expression profiles and patient sur- 
vival, independent of disease stage. 

Consistent with previous analyses ol lung adenocarcinomas^, 
40% oi stage 1 and S7.S% of stage 111 lumors had I2ih or 13ih 
codon K-rijs gene mutations. Those patients with tumors con- 
taining K ir.s mutations showed a trend of poorer survival, but 
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Table 1 Selected examples of the top 100 genes from cross-validation 
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Cell adhesion and structure 


KRT7 


8.02E-08 


126% 
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Cell cycle and growth regulators 
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Growth factor receptor-bound protein 7 
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Vascular endothelial growth (actor 


WNT10B 


0.05 


31%' 


0.48 


20% . 


0.0022 


Wingless- type MMTV integration site family. 
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Heat-shock 70 kD protein 8 
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FXYD domain-containing ion transport 
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Eniymes, cellular metabolism 
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Cytochrome P450, subfamily XXIV 
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Steroidogenic acute legulatory protein related 


PDE7A 
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Phosphodiesterase 7A 
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Transcription and translation 
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Core promoter element binding protein 
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v-c»k avian sarcoma virus O 10 oncogene 
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v-rel avian relkuloendotheliosis viral 
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0.02 


45% 


0.0010 


KIAA0005 gene product 


MOB! 


0.27 
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0.33 


459% 


0.0018 


Mammoglobin 1 



Bolcied gcnei were also jignilk anl \ot survival in 43 lumoi (raining set (J ig. 3b). 



Table 1 5 elected e>amp)es of the cumulative top 100 genes identified using 
training-testing, cross-validation of all 86 lung tumor samples. "Ihe peitent 
change, as well as the direc lion, for the average values of the 1 0 non- neoplastic 
lung to all tumors, and for the 67 stage I to the 19 stage 1)1 tumors ar e shown. A 
positive coefficient |i value is indicative of a relationship ol cjene oprrssion to a 



poorer patient outcome. TTie genes aie listed in potential functional categories. 
Genes that were also present in the top SO survival genes m'tng the 4 3- tumor 
I joining set (Fig. 3b) are indicated in bold type. Complete listing ol the gene 
probe sets and annotated gene and uninene identifiers tan be found in the 
Supplementary Methods. 
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fig. 4 Gene expression patterns ol top survival genes o, Gene- expression patterns de- 
termined using agglomerative hieraichital clustering ol the 86 lung adenot arc inomas 
against the 100 survival-related genes (7 able 1) identified by the training- testing, tross- 
vatidaiion analysis. Substantially elevated (red) or decreased (gree/>) e>presston ol the 
genes is observed in individual turnois. 5orne tumors (bla(V ;hiow and expanded area) 
show extremely elevated expression ol specific genes. 6, An out tier gene-e>pression pat- 
tern (>5 times the interquartile range among all samples) i.\ observed lor the e/f;B2 and 
Reg! A genes (lop Iclt and right, respectively). The 3/ 00V and (rk genes (bottom lell and 
right, respectively) show a graded pattern ot expression related lo patient survival. O, 
olive; dead (also in c). i, "the number ol outliers pf person ideniitied'in the top 100 
genes plotted by survival distribution. 
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this difference did not reach statistical significance among all 
patients (P= 0.25), between patients within tumor clusters {P = 
. 0-41) or when analyzed separately among stage I {P = 0.22) and 
stage III (P = 0.53) patients. Nuclear accumulation of p53 was de- 
tected in 1 7.9% stage 1 and in 22.2% stage II] tumors. No signifi- 
cant, relationship was observed for p53 staining and patient 
survival, cluster or tumor stage. 

Confirmation using an independent set of adenocarcinomas 
The robustness of our 50-gene lisJc index in predicting survival in 
lung adenocarcinomas was tested using oligonucleotide gene-ex- 
piession data Obtained from a completely independent 
(Massachusetts- based) sample of 84 lung adenocarcinomas (62 
stage J, 14 stage II and 8 stage III; ref. 21, and dataset A at 
www.genome.wi.mit.edu/MPR/Iung). To ensure equivalent 
power for testing and comparability of samples, the criteria for 
including tumors in the analysis were 40% or greater tumor cellu- 
laiity, no mixed histology (that is, adenosouamous) and patient 
survival information. To obtain comparative gene- express ion 
measures between the two data sets, gene sequences present on 
the U95A and HuGeneFL array were examined, and expression 
data for our top 50 cross-validation genes for all 84 Massachusetts 
samples were obtained and processed 24 (see. also Supplementary 
Methods online ). When we examined the risk assignment of 
these 84 samples, employing the identical cutoff point used lor 
the 86 Michigan- based lung samples, we observed low- and high- 
risk groups (Fig. 3g;P = 0.003). Notably, among the 62 stage I tu- 
mors, high- and low- risk groups were observed that differed 
significantly (P= 0.006) in their survival (Fig. 3Ji).. 

Survival gene* had graded and outlier expression patterns 

A statistical and graphical analysis of lhe 100 survival-iejated 



genes (Table 1) clustered against all 86 tumors revealed individ- 
ual rumors with substantially elevated expression in both a lim- 
ited and larger number of genes (Fig. 4a). Among these genes, we 
observed two distinct patterns of expression related to patient 
survival. One pattern, designated 'outlier', included genes show- 
ing substantially elevated expression (greater than five times the 
interquartile range among all samples), whereas the other pat- . 
tern, designated 'graded', was characterized by continuously dis- 
tributed expression with patient survival (Fig. 4b). The abB2 and 
Reg] A genes are examples of outlier expression patterns and 
SWOP and ak genes of graded patterns. The number of outliers 
per person in the top 100 genes was identified and plotted ac- 
cording to survival times and events (Fig. 4c). Both stage I and 
stage 111 lung adenocarcinomas showed outlier gene patterns 
and 10 tumors contained 3 or more outlier genes. 

Because gene amplification may result in increased gene ex- 
pression, the nine genes with outlier expression patterns (erbBZ, 
SIC 1 AG, Wnt 1, 'MGB\, IfctflA, AKAPW, PACE, CYPZ4. KYNU) 
and one gene with a graded expression pattern (KJcTT 8) were ex- 
amined using quantitative genomic PCR to evaluate genomic 
copy number (Fig. 5a). Gene amplification of crbB2 (17ql2) was 
detected in tumor L94, which had the highest erbB2 mRNA ex- 
pression (Fig. 4a). Gene amplification was not detected for any 
of lhe other seven tested genes in tumor L94, as well as in other 
tumors. The two genes most frequently demonstrating the out- 
lier pattern in these lung adenocarcinomas were KYNU and 
CYP24, and were present in 10 and 9 tumors, respectively. 
CYPZ4 has been described as a gene amplified and overexpressed 
in breast cancer 25 , and. these results indicate elevaled expression 
in lung adenocarcinoma. 

To determine whether the graded or outlier gene-expiession 
patterns also occur at the protein-expression level, 10 of the 100 
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fig, 5 Gene amplification ;>nt) protein expression of survival- related genes, 
o. Analysis ol potential gene amplification (or 9 genes showing outlier expres- 
sion patterns in the lung tumors (etbR2, JiOA6, Wnt 1, MCfll, ZfeylA, 
AKAPM, PACE, CVP24 and KYNU) and examined using quantitative genomic 
PCR. A gene showing graded expression pattern (K#7T8), and one gene 
(PAC(A) with a sirniloi chromosome location as PACl, were used as controls. 
Only frbB2 and Rec/IA are shown. An esophageal adenoc aicinoma with 
known high-level genomic arnplilic alion of etb&2 was used as a positive con- 
trol and normal esophagus ONA was used as a negative control (Ctl)- t'C R 
fragments sizes were 343 bp tor CAPOH, 166 bp for cvbB? and 1?6 bp loi 



fteglA. DNA is from normal lung (N) and tumoifT) horn each patient (tor ex- 
ample 1 37). b, Ir nmunohis toe hemic al analysis of survival related genes with 
lung adenocarcinoma microarrays using the lurnors from this study. The 
transmembrane erbB2 protein (top left) expression is substantially increased 
in tumor 194 containing the amplified etbR7 gene (tig. 4o and b). Expression 
of VI Q (lop right) and S100P (bottom left) was located within the neoplas- 
tic cells and the pattern ol inirnuhoreactivity was consistent with the graded 
expression pattern demonstrated by their mRNA profiles. I.x press ion of the 
oncogene ctk (bottom right) was abundantly e> pressed in neoplaMic It my 
(( lis. Magnitic alion x400(erbB?); >?0V (Via, MOOT* and ok). 
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top survival genes (Table 1) for which specific antibodies were 
available were chosen for immunohistochemieal analysis using 
lung-tumor arrays from this study (Fig. Sb). Expression of mem- 
brane erbB2 protein was substantially increased in the erfcB2-am- 
plified tumor L94 and very low levels of expression were present 
in other tumors, consistent with mRNA-expression measure- 
ments (Fig. 4a and b). CDC6 protein expression was also sub- 
stantially higher in tumor L94; consistent with mRNA levels 
(data not shown). Expression of vascular endothelial growth fac- 
tor (VEGF) and Si OOP (Fig. Sb), as well as cytokeratin 18 (KRT18), 
cytokeratin 7 (KRT7) and fas-associated death domain (FADD) 
protein (data not shown), was located within the lung tumor 
cells and consistent with the graded expression pattern of the 
mRNA profiles. The oncogene ok showed both graded mRNA as 
well as a graded protein- express) on pattern with survival, and 
was abundantly expressed in the tumor cells (Fig. Sb). These re- 
sults indicate that many survival-associated genes are expressed 
at the protein level and demonstrate similar mRNA and protein- 
expression patterns. 

Discussion 

We used several approaches for the analysis of gene-expression 
data related to clinicopathological variables and patient sur- 
vival. One approach, hierarchical clustering, was used to exam- 
ine similarities among lung adenocarcinomas in their patterns 
of gene expression. Previous studies of lung tumors" " have also 
used this method to describe subclasses of lung tumors. Here, 
we found three clusters that showed significant differences with 
respect to tumor stage and tumor differentiation. This suggests, 
as expected, that tumors with similar histological features of 
differentiation demonstrate similarities in gene expression. 
This feature also partly underlies the observed statistical associ- 
ation of tumor stage and cluster, as many of the higher-stage tu- 
mors, often poorly differentiated and previously associated 
with a leduced survival'*' 0 , were located in Cluster 3. Although 
this cluster contained the highest percentage of stage 111 tu- 
mors, it also contained a nearly equal mixture of stage I and 
stage III tumors and not all tumors were pooily differentiated. 
This indicates that a subset of stage I lung adenocarcinomas 
share gene-expression profiles with higher-stage tumors. 
Notably, 10 of the 1 1 stage I tumors found in Clustei 3'weie the 
high-risk stage I tumors identified using the risk index in the 
'leave-one-out' cross-validation. 

In contrast to previous analyses of lung adenocarcinomas 71 ", 
we validated the expression data fiom the arrays. The strong cor- 
relation of northern-blot analysis and oligonucleotide-array data 
for gene expression in the same samples (Fig. 2b) indicates that 
these studies provide lobust gene-expression estimates- 
Immunohistocbemistiy using the same tumor samples in tissue 
arrays demonstrates piotein expression within the lung tumor 
cells. Together, these studies indicate that many ol the genes 
identified using gene-expression profiles are likely relevant to 
lung adenocarcinoma. For example, JGFBP3 gene expiession is 
increased in lung adenocarcinomas (Fig. 2c). 1GFBF3 protein 
modulates the autocrine or paracrine efJerls of insulin-like 
growth factors, elevated IGFBP3 expression is observed in colon 
cancer*, and increased serum JGf-'BP3 is associated with progies- 
sion in breast fancer 7 . Heat-shock protein 70 (HSP-70) is in- 
creased in lung adenocarcinomas of smokers 7 * and is associated 
with increased metastatic potential in breast cancer' 5 . Increased' 
serum lactate dehydrogenase is correlated with tumor Marje and 
tumor burden 3 ", and cyst at in C, a cysteine protease inhibitor ex- 



pressed in human lung cancers 31 , is prognostic in some cancers 32 . 
The decreased expression of this protease inhibitor may affect 
the invasive properties of the tumor cell. 

. The cross-validation analytical strategy we used is particularly 
informative for these types of gene- express* on analyses for dis- . 
ease outcome 33 / 3 *, and identification of ooss-validated genes with 
a larger tumor cohort may help refine this ijsk index for use in a 
clinical setting. The gene-expression data also provide opportune 
ties to observe overarching patterns that advance our under- 
standing of associations between genes and disease. Foi example, 
the top 100 survival genes include those involved in signaling, 
cell cycle and growth,, transcription, translation and metabolism. 
Expression of many of these genes is likely a function of increased 
proliferation and metabolism in the more aggressive tumors. 
Some genes, such as erbBZ and ReglA (Fig. 4o and b), were highly 
overexpressed in a few patients having poor survival. In one 
tumor, the erbB2 gene was amplified (Fig. 5a), demonstrating that 
genomic changes may underlie the over expression of a subset of 
these outlier genes. Immun ©histochemistry confirmed protein 
overexpression in this patient's tumor (Fig. Sb). Notably, seven of 
the eight outlier genes were not amplified, indicating that other 
mechanisms underlie the increased mRNA expression of these 
survival-related genes. 

Most genes showed a graded relationship betiveen expression 
and patient survival. Genes such as that encoding VEGF, known 
to be strongly associated with survival in lung cancer 3 *- 3 * were 
identified as related to patient survival in our study. VEGF . 
demonstrated a graded expression pattern, as did the 5100P and 
ak oncogene (Fig. Sb). SI OOP is a calcium-regulated protein not 
previously reported in lung cancer. The crk gene, the cellular ho- 
molog of the v crk oncogene, is a member of a family of adaptor 
proteins. involved in signal transduction and interacts directly 
with c-jun N-terminal kinase ] tJNKl) 37 . Although ak has not 
been shown to have a role lung cancer, its role in the MAP-ki- 
nase pathway, which leads to activation ol matrix metallopro- 
teinase secretion and cell invasion 38 , indicates potential 
involvement in the the tumor cell invasion or metastasis of 
some lung adenocarcinomas. Among the many genes identified 
in this study, like ak, that may be causally involved in lung can- 
cer progression (Table 1), some were related to survival in many 
patients, and others in only smaller subsets of patients. This re- 
sult is consistent with the complex molecular aichitecture ol tu- 
mors in general, the heterogeneity of lung adenocarcinomas in 
particular and the multiple mechanisms underlying tumor-cell 
survival, invasion and metastasis 39 . 

Our results demonstrate that a gene-expression risk profile- 
based on the genes most associated with patient survival — can 
distinguish stage 1 lung adenocarcinomas and differentiate prog- 
noses. The particular genes that define the clusters, or are associ- 
ated with survival, likely reflect the characteristics of the 
particular tumors included in the analysis. Current therapy lor 
patients with stage 1 disease usually consists of suigical resection 
without adjuvant treatment 23 . Clearly, the identification of a 
high-risk group among patients with stage 1 disease would lead 
to consideration of additional therapeutic intervention loi this 
group, possibly leading to improved survival ol these patients. 

Methods 

Patient population. Sequential patients seen at the Univrisity ol Michigan 
Hospital between May 1994 and My ?000 tor stage I or stage III lung ade- 
notaitinoma weie evaluated (or (his study. Consent was received and the 
project was approved by the locat Institutional Review Board. Primary tu- 
mors and adjacent non neoplastic king tissue were obtained at the lime ol 
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surgery. Peripheral portions of resected long carcinomas were sectioned, 
evaluated by a study pathologist and compared with routine H&E sections 
of the same tumors, and utilized for mRNA isolation. Regions chosen for 
analysis contained a tumor. ceMularity greater than 70%, no mixed histol- 
ogy, potential metastatic origin, extensive lymphocytic infiltration or fibro- 
sis. Tumors were histopathologic ally divided into two categories based on 
their growth pattern: bronchial- derived, if they exhibited invasive features 
with architectural destruction, and bronchiploarveplar, if they exhibited 
preservation of the lung architecture. AH stage I patients received only sur- 
gical resection with intra-tnoracic nodal sampling and no other treatments. 
Stage III patients received surgical resection plus chemotherapy and radio- 
therapy. 

Gene-expression profiling and K-fos mutation analysis. RNA isolation, 
cRNA synthesis and gene-expression profiling were performed as de- . 
scribed". Details of gene annotation and K-ros mutation analysis are pro- 
vided in supplementary information. 

Northern- blot analysis. Total cellular RNA (1 0 pg) was separated in 1 .2% 
agarose- formaldehyde gels and vacuum- transferred to Gene Screen Plus 
(NEN Life Science Products, Boston, Massachusetts). Hybridization condi- 
tions and probe labeling were as described 40 . Individual sequence-validated 
cDNA image clones for human ICFBP1 (clone 1407750), LDH-A (clone 
2420241), cyslatin C (OS3; clone 949938) were from Research Genetics 
(Huntsville, Alabama). The human histone H4 cDNA and the 2SS ribosomal 
RNA 26-mer oligonucleotide probe were prepared and labeled as de- 
scribed* 0 . 

Gene-amplification analysis. 1 1 genes were selected for the analysis of ge- 
nomic alterations. Primers were designed using Prime/Select 4.05 Windows 
32 software (DNASTAR, Madison, Wisconsin), avoiding pseudogenes or po- 
tential homologous regions. Forward and reverse primers for the genes are 
provided (Supplementary Methods online). Quantitative genomic-PCR was 
then applied and analyzed as described* 1 . 

Immunohistothemical staining. The H&E-stained slides of all primary 
lung tumors were used to identify the most representative regions ol each 
tumor and a (issue mkioarray (7 MA) block was constructed as described 41 . 
Irnmunohiitochemrstry (IHC) was periormed using both routine and sec- 
tions from the TMA block as described 3 '. Detailed methods and the con- 
centrations used for all antibodies are provided in the Supplementary 
Methods. 

Statistical methods, ( tests were used to identify differences in mean gene- 
expression levels between comparison groups. Agglomerative hierarchical 
clustering" was applied using the average linkage method to investigate 
whether there was evidence for natural groupings of tumor samples based 
on correlations between gene-expression profiles. To investigate the ro- 
bustness of the clustering inference, gene- expression values were per- 
turbed by adding random .Gaussian error of magnitude obtained from a 
duplicate sample to each data point and then rectu:tered to determine con- 
cordance in the tumor's class membership. Peaoon, y J and fisher's e>act 
tests were used to assess whether cluster membership was associated with 
physical and genetic characteristics ol the tumors. 

To determine whether gene-expression profiles were associated with 
variability in survival times, 2 sepaiate but complementary approaches 
were used. In the first approach, the 86 tumors were randomly aligned to 
equivalent training and testing sets consisting of equal numbers ol stage I 
and Ml tumors in order to validate a novel risk-index function that captured 
the effect of many genes at once ; In the second approach, cross-validation" 
was used to more robustly identity the genes associated with survival. 
Briefly, a 'leave-one-out' cross-validation procedure in which 85 of the 86 
tumors (the training set) was used to identify genes that were univarialely 
associated with survival. The risk index was defined as a linear combination 
ol the gene-e^ pression values loi the top qenes identified by univariate Cox 
proportional- hazard regression modeling* 1 , weighted by their estimated re- 
gression coefficients. Kaplan- Meier survival plots and log- rank tests wen- 
then used to assess whether the risk- index argument to high/low cate- 
gories wai validated rn the test set A more detailed description is provided 
(Supplemental)' Methods online). 
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Rapid quantitation of proinflammatory and chemoattractaut cytokine 
expression in small tissue samples and uionocyte-derived dendritic cells: 
validation of a new real-time RT-PCR technology, 

Blaschkc V , Reich 1C Blasclike S . Zipprich S , Neumann C . 

Department of Dermatology, von-Siebold-Scr. 3, D-37075, Goettingen, Germany, 
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The analysis of cytokine profiles plays a central part in the characterization of disease- 
related inflammatory pathways and the identification of functional properties of immune 
cell subpopulations. Because tissue biopsy samples are too small to allow the detection of 
cytokine protein, the detection of mRNA by RT-PCR analysis is often used to investigate 
the cytokine milieu in inflammatory lesions. RT-PCR itselfis a qualitative method, 
indicating the presence or absence of specific transcripts. With the use of internal or 
external standards it may also serve as a quantitative method. The most widely accepted 
method is quantitative competitive RT-PCR, based on internal shortened standards. 
Recently, online real-time PCR has been introduced (LightCycler), which allows 
quantitation in less than 30 min. Here, we have tested its use for the analysis of cytokine . 
gene expression in different experimental in vitro and ex vivo settings. First, we 
compared quantitative competitive RT-PCR with real-tirne RT-PCR in the quantitation of 
transcription levels of the CD4(+) cell-specific chemoattractaut Interleukin- 1 6 during the 
maturation of monocyte-derived dendritic cells, and found a good correlation between 
both methods. Second, differences in the amounts of IL-16 mRNA in synovial tissue 
from patients with rheumatoid arthritis and osteoarthritis as assessed by real-time RT- . 
PCR paralleled differences in the level of IL-16 protein in the synovial fluid. Finally, we 
employed real-time RT-PCR to study the cutaneous expression of several cytokines 
during experimental immunomodulatory therapy of psoriasis by Interleukin- 10, and 
demonstrate that the technique is suitable for pharmacogenomic monitoring. In summary, 
real-time RT-PCR is a sensitive and rapid tool for quantifying mRNA expression even 
with small quantities of tissue. The results obtained do not differ from those generated hy . 
quantitative competitive RT-PCR. 

Publication Types: 

• Evaluation Studies 
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Buty rate-induced reversal of dexamethasoue resistauce in autonomous rat 
Nh2 lymphoma cells. 

Buckley AR, Krumenacker JS . Buckley DJ , Leff MA . Magnusou NS , Reed JC . 
Miyashitst T, de Jong G , Gout PW . 

Department of Pharmacology and Toxicology, University of North Dakota School of 
Medicine and Health Sciences, Grand Forks 58202-9037, USA. 
abuckIey@mail.med.und.nodak.edu 

The parental rat Nb2 lymphoma is a prolactin (PRL)-dependent T cell line. Exposure of a 
PRL-independent subline,- Nb2-SFJCD1, to sodium butyrate (NaBT) causes transient 
reversal of their growth factor-independent proliferation in association with constitutive 
expression of protooncogenes pirn- 1 and c-myc. In the present study, we investigated the 
effect of NaBT treatment on the sensitivity of Nb2-SFJCD1 cells to dexamethasone 
(DEX)-induced apoptosis. Pretreatment with NaBT (2 mM, 72 h) partially reversed 
resistance to apoptosis in Nb2-SFJCD1 cells exposed to DEX (100 nM) for 12 h, 
assessed by flow cytometric analyses of D.N A fragmentation. However, tire cytolytic 
effect of DEX was abrogated by PRL in a time- and concentration-dependent manner. 
Evaluation of apoptosis-associated gene expression in NaBT-pre-treated cultures 
incubated with DEX or DEX+PRL indicated that the apoptosis resistance did not stem 
from altered bcl-2 or bax expression. However, there was a strong correlation between 
; the resistance to DEX-activaled apoptosis and their enhanced expression of pim-1 mRNA 
and protein. The results show that it is possible to reverse DEX-induced apoptosis of Nb2 
pre-T cells and suggest the pim-1 gene product has an important role as a suppressor of 
this process, perhaps functioning as a mediator of PRL action. 
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Alterations iu neuropeptide Y levels and Yl binding sites in the Flinders 
Sensitive Liue rats, a genetic animal model of depression. 

Caberlotto L , Jimenez P. Overstreet PH . Kurd YL . Mathe AA , Fuxe K . 

Department of Neurosci ence, Karolinska Institute, Stockholm, Sweden. 

Previously, .we observed specific alterations of neuropeptide Y (NPY) and Yl receptor 
mRNA expression in discrete regions of the Flinders Sensitive Line rats (FSL), an animal 
model of depression. In order to clarify the correlation between mRNA expression and 
protein content, radioimmunoassay and receptor autoradiography were currently 
performed. In the FSL rats, NPY-Iike immunoreactivity (NP Y-LI) was decreased in the 
. liippocampal CA region, while Yl binding sites were increased; NPY-Ll was increased 
in the arcuate nucleus. Fluoxetine treatment elevated NPY- LI in the arcuate and anterior 
cingulatc cortex and increased Yl binding sites in the medial amygdala and occipital 
cortex in both strains. No differences were found regarding the Y2 binding sites. The 
results demonstrate a good correlation between NPY peptide and mRNA expression, and 
sustain the possible involvement of NPY and Y 1 receptors in depression. 



PMID: 10327163 [PubMed - indexed for MEDLINE] 



105: Eur J ^ Neurosci. 2003 May; 1 7(9): 1 736-46. Related Articles, Links 




Neurokinin 1 receptor arid relative abundance of the short and long 
isoforrns in the human brain. 

Caberloito L , Hurd YL t Murdock P , Walilin JP , Mclolto S , Gorsi M t CarlcUi R . 

Department of Biology, Psychiatry CEDD, GlaxoSmithKIine Medicine Research Centre, 
Verona, Italy. Laura. L.Caberldtto@gsLcom 

Substance P exerts its various biochemical effects mainly via interactions through 
neurokinin-1 receptors (NKl). Recently, the NKl receptor has attracted considerable 
interest for its possible role in a variety of psychiatric disorders including depression and 
anxiety. However, little is known regarding the anatomical distribution of NKl in the 
human central nervous system (CNS). Riboprobe in situ hybridization, quantitative PCR 
and in vitro autoradiography were performed. Highest NKl mRNA levels were localized 
in the locus coeruleus and ventral striatum, while moderate hybridization signals were 
observed in the cerebral cortex (most abundant in the visual cortex), hippocampus and 
different amygdaloid nuclei. Very low levels of the NKl mRNA were detected in the 
cerebellum and thalamus. In view of the existence of a long and short isoform of the NKl 
receptor, it was.of interest to assess whether there was a differential distribution of the 
two splice variants in the human CNS and peripheral tissues. A quantitative TaqMan 
PCR analysis showed that the long NKl isoform was the most prevalent throughout the 
human brain, while in peripheral tissues the truncated form was the most represented. 
3H-Substance P. autoradiography revealed a good correlation between receptor binding 
sites and NKl mRNA expression throughout the brain, with the highest levels of binding 
in the locus coeruleus. These results provide the anatomical evidence that the NKl 
receptors have a strong association with neuronal systems relevant to mood regulation 
and stress in the human brain, but do not suggest a region-specific role of the two 
isoforrns in the CNS. 
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Characterization of cyclin D2 expression io human endometrium. 
ChoiD, YoonS, LeeE, Hwang S . Song S . Kim J . Yoon BK. Lcc JH . 

. Department of Obstetrics and Gynecology, Samsung Medical Center, SungkyunJcwan 
University School of Medicine, Seoul, South Korea. dschoi@smc.samsung.co.kr 

OBJECTIVE: This study was undertaken to investigate cyclin D2 mRNA and protein 
expression in human endometrium during the menstrual cycle. METHODS: Endometrial 
samples were obtained from I S premenopausal nonpregnant women who had 
hysterectomies for benign gynecologic reasons. They were divided into the following five 
groups according to histologic dating: early proliferative (n = 3), mid to late proliferative 
(n - 3), early secretory (n = 3), mid secretory (n = 3), and late secretory (n = 3). Cyclin 
D2 mRNA and protein expression were analyzed using reverse transcriptase-polymerase 
chain reaction, Western blotting, and trhmunohistochemistry. RESULTS: Cyclin D2 
mRNA and protein were expressed in human endometrial tissue throughout the menstrual 
cycle. Cyclin D2 mRNA and protein expression of proliferative phase endometrium were 
significantly higher than those of secretory phase endometrium (P <.05). The staining 
intensity of cyclin D2 in proliferative phase endometrium was higher than that in 
secretory phase (P <.05). Cyclin D2 mRNA level showed good correlation with cyclin 
D2 protein level (R = 0.579, P <.03), and cyclin D2 protein also showed good correlation 
with immunohistochemical staining intensity (R = 0.562, P <03). CONCLUSION: 
Cyclin D2 was expressed in human endometrium throughout the menstrual cycle. Cyclin 
D2 mRNA and protein were expressed at high levels in proliferative phase endometrium 
especially in the early proliferative phase, and then decreased in die secretory phase. 
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Human chorionic gonadotroph^ beta expression in malignant Barrett's 
oesophagus. 

Couvclard A , Paraf F , Vidaud D , Dubois S , Vidaud M Flcjou JF , Degott C . 

Service d'Anatomie Pathologique, Hopital Beaujon, 921 18 Clichy cedex, France,. 
amexouvelard@bjn.ap4iop-paris.fr 

BACKGROUND: Human chorionic gonadotroph^ beta (hCGbeta) is expressed in 
several non-trophoblastic tumours, and this is usually associated with aggressive 
behaviour. Little is known about hCGbeta expression in Barrett's adenocarcinoma. 
MATERIALS AND METHODS: We determined the hCGbeta profile in a large series of 
surgically resected Barrett's adenocarcinoma (a) at mRNA level using real-time 
quantitative reverse-transcription polymerase chain reaction analysis and (b) at protein 
level using urununohistochemistry with a polyclonal antibody and with a monoclonal . 
antibody specific for free hCGbeta. We then sought links between the hCGbeta protein 
expression pattern and clinical and pathological parameters, including patient outcome as 
well as vascular endothelial growth factor (VEGF) expression. RESULTS: hCGbeta 
protein expression was observed in 43 of 76 (57%) Barrett's adenocarcinomas. We 
showed a strong correlation between hCGbeta protein abundance and CGB mRNA level. 
We observed a statistical link between hCGbeta protein expression and infiltrative 
tumour type ( P=0.023), perineural neoplastic invasion ( P=0.007) and VEGF protein 
expression ( P=0.0 1 6). hCGbeta expression tended to be associated with a poor outcome 
(16% versus 36% survival 8 years after resection). CONCLUSION: Expression of 
hCGbeta correlates with specific infiltrative characteristics and is associated with higher 
VEGF expression. Both molecules may play a co-ordinated role in the development of 
Barrett's adenocarcinomas. 
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Dowtiiegulation of ENaC activity arid expression by TNF-alpha in alveolar 
epithelial cells. 

Dagcnais A , Frechette R, Yamagata Y . Yamagata T . Carmel JF . Clermont ME . 
Brochiero E , Masse C. Berthiaumc Y . 

Centre de recherche, CHUM-Hotel-Dieu, 3850 St-Urbain, Montreal, Quebec, Canada 
H2W 1T7. andre.dagenaisxhum@ssss.gouv.qcca 

Sodium absorption by an amiloride-sensitive channel is the main driving force of lung 
liquid clearance at biith and lung edema clearance in adulthood, m this study, we tested 
whether tumor necrosis factor-alpha (TNF-alpha), a proinflammatory cytokine involved 
m several lung pathologies, could modulate sodium absorption in cultured alveolar 
epithelial cells. We found that TNF-alpha decreased the expression of the alpha-, beta-, 
and gamma-subunits of epithelial sodium channel (ENaC) mRNA to 36, 43, andl6% of 
the controls after 24-h treatment and reduced to 50% the amount of alpha-ENaC protein 
in these cells. There was no impact, however, on alpha(l) and beta(l) Na(+)-K(+)- 
ATPase mRNA expression. Amiloride-sensitive current and ouabain-sensitive Rb(+) 
uptake were reduced, respectively, to 28 and 39% of the controls. A strong correlation 
was found at different TNF-alpha concentrations between the decrease of amiloride- 
sensitive current and alpha-ENaC mRNA expression. All these data show that TNF- 
alpha, a proinflammatory cytokine present during lung infection, has a profound 
influence on the capacity of alveolar epithelial cells to transport sodium. 
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Involvement of the CCNDl gene iu hairy cell leukemia. 

dc Boer CJ . Kluln-Nelcmaris JC . Dreef E , Kester MG , Khun PM, Schuming van 
Krieken JH . 

Department of Pathology, University of Leiden, The Netherlands. 

BACKGROUND: Previous results suggested increased mRNA expression of CCMDl in 
hairy cell leukemia (HCL). The CCND1 gene is involved in the t(l I;l4)(ql3;q32) 
chromosomal rearrangement, a characteristic abnormality in mantle cell lymphoma 
(MCL). We and others reported that, in contrast to other B-cell lymphomas, almost all 
MCL have over-expression of the CCND1 gene with a good correlation between RN A 
and protein analysis. Recent studies showed that overexpression of the cycl in D I protein 
can be easily detected by tmmunohistochemistry (IHC) on formalin- fixed, paraffin 
embedded tissues. PATIENTS AND METHODS: To investigate whether the CCNDl 
gene is involved in HCL, we performed IHC on a series of 22 cases using formalin-fixed 
paraffin embedded splenectomy specimens. For IHC the sections were boiled in citrate 
buffer. The presence of rearrangements within the BCL-1 locus and the CCND 1 gene 
was analyzed in 13 of 22 cases by Southern blot analysis using all available break-point 
probes. Expression of CCNDl was analyzed at the mRNA level (Northern blot) and 
protein level (IHC). RESULTS: Overexpression of the cyclin Dl protein using IHC was 
observed in all cases, with strong expression in 5 cases. Pre-existing B- and T-cell areas 
of the spleen did not express significant levels of the cyclin Dl protein. Seven of 9 cases 
analyzed by both IHC and Northern blotting showed overexpression of the CCNDl gene 
with both methods. No genomic abnormalities were observed in any of the 13 cases 
studied by Soudiem blot analysis. Additionally, no I lql3 abnormalities were detected by 
banding analysis of 19 of 22 cases. CONCLUSIONS: The elevated levels of CCNDl 
mRNA and protein in conjunction with the absence of overt rearrangements within the 
BCL-1 locus distinguish HCL from MCL and other B-cell malignancies. This suggests 
that activation of the CCNDl gene in HCL is due to mechanisms other than chromosomal 
rearrangement. 
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luhibiu aud activin production aad subunit expression iu human placental 
cells cultured in vitro. 

Dcbieve F ; Pampfcr S , Thomas K . 

Department of Obstetrics and Gynecological Endocrinology, Universite Catholique de 
Louvain, 1 200 Brussels, Belgium. 

Inhibins and activins are dimeric proteins, with each subuiiit being one of three related 
protein subunits (alpha, betaA or betaB). The mRNA levels of these subunits. were 
studied quantitatively during in-vitro differentiation of human cytotrophoblast cells into 
syncytium, using Northern blot analysis and semi-quantitative reverse transcription- 
polymerase chain reaction (RT-PCR) analysis. The corresponding protein concentrations 
were determined by specific enzyme-linked immunosorbent assays for inhibin A, B, pro 
alphaC and activin A in cellular protein extracts and culture medium (n = 5). 
Immunofluorescence studies showed syncytium formation after 48 h. The alpha subunit 
was present before plating and increased at 48 h (P<0.001) while the betaA subunit was 
weak before plating and increased at 24 h. The betaB subunit was not detected. With 
respect to corresponding protein synthesis, inhibin A (alpha + betaA) had risen after 48 h 
in cellular protein extract and after 72 h in culture medium, while activin A (betaA + 
betaB) was detected after 24 h, with no significant variations in culture medium. There 
was a good correlation between inhibin A and alpha subunit expression (r = 0.736, 
PO.001), as well as between activin A and betaA subunit expression (r = 0.755, 
PO.001). Tliis study showed that mRNA expression parallels protein synthesis of inhibin 
and activin in trophoblast cells; Inhibin A synthesis appears to be dependent on alpha 
subunit mRNA expression, rather than on the betaA subunit which controls activin A 
synthesis. This study has also shown that isolated cytotrophoblast cells do not produce 
dimeric inhibin. However, during the transformation of cytotrophoblast cells into 
syncytium, betaA subunit mRNA expression may be an indicator of cell aggregation, 
while alpha subunit mRNA expression may be an indicator of cell fusion. 
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Expression of membrane-type matrix metalloproleinases 4, 5, and 6 in 
mouse corneas infected with P. aeruginosa. 

Dong Z , KatarM , Alousi S , Berk RS . 

Department of Immunology and Microbiology, Wayne State University School of . 
Medicine, 540 E. Canfield, Detroit, MI 48201, USA; 

PURPOSE: To investigate the expression and regulation of membrane-type matrix 
metalloproteinases (MT-MMPs) 4, 5, and 6 in the mouse corneas infected with 
Pseudomonas aeruginosa. METHODS: G57BL/6J mice were intracorneally infected with 
P. aeruginosa. The expression of MT4-, MT5-, and MT6-MMP was detected at both the 
mRNA and protein levels by RT-PCR and immunoblot analysis. Immunohistochemica! 
staining was performed to.localize the expression of MT4- and MT5-MMP in the mouse 
corneas. RESULTS; Expression of MT4- and MT5-MMP was detected in the normal ' 
(uninfected) cornea by RTjPCR and immunoblot analysis. When infected with P. 
aeruginosa,, the corneas showedsignificant induction of each MT-MMP. Localization of 
MT4- and MT5-MMP revealed that the expression of MT5-MMP was restricted to the 
epithelial tissue in the normal cornea, whereas the induced expression of MT4- and MT5- 
MMP was predominantly in the substantia propria, which contained most of the 
infiltrating cells. MT6-MMP expression was not detected in the uninfected cornea but 
was upregulated in the infected corneas. CONCLUSIONS: Expression of MT4-, MT5-, 
and MT6-MMP was induced in corneas infected with P. aeruginosa. 
Irnmunohistochemistry showed predominant immunoreactivity of MT4- and MT5-MMP 
in the substantia propria. Previous histologic studies have revealed different patterns of 
inflammatory cell infiltration with an increased number of polymorphonuclear 
neutrophils (PMNs) during the early stage of inflammation and increased macrophages 
during the late stage. These results indicate a good correlation between the 
overexpression of the MT-MMPs in the infected corneas and the inflammatory response- 
that is, leukocyte infiltration-indicating that inflammatory cells such as macrophages and 
PMNs may play a role in the upregulation of MT-MMPs during corneal infection, which 
in turn can cause the destruction of corneal tissue. 
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Assessment of proliferative activity in colorectal carcinomas by 
quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). 

Puchrow M , Hascmcycr S , Broil R , Bruch Ui\ VVindhovei U . 

Surgical Research Laboratory, Surgical Clinic, Medical University of Lubeck, 
Ratzeburger Allee 160, D-23538 Lubeck, Germany. 

The monoclonal antibody fCi-67 and the isospecific monoclonal antibody M1B-1 are 
routinely used in oncology to assess the proliferation index of tumor cells. A more 
objective and sensitive method is the determination of the of Ki-67 protein-specific 
mRNA by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). In 25 
resected colorectal adenocarcinomas of different stages and grades we determined 
between 0.2 and 4.4 amol (I0(-18) mol) Ki-67 protein-specific mRNA per microgram 
total RNA (median = 0.88 amol). The corresponding KJ-67 indices (expressing the 
percentage of Ki-67/MIB-I positive tumor cells) ranged from 41 to 81% (median = 61%). 
We found a good correlation between Ki-67 index and mRNA expression (r = 0.75), a 
significant correlation between both data and tumor stage (primary tumor, regional nodes, 
metastasis [pTNM] staging classification) (p < 0.001), but not between both data and 
tumor grade. Both Ki-67 indices (p = 0.05) and mRNA levels (p = 0.014) correlated 
significantly to the patients' survival. These results demonstrate that the Ki-67 protein- 
specific quantitative RT-PCR is a useful method for the characterization of tumor cell 
proliferation. 
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Molecular characterisation of carbohydrate digestion and absorption in 
equine small intestine. 

Dyer J, Fci nandcz-Castano MercdLz; E , Salmon KS , Proudman CJ , Edwards GB , 
Shirazi-Becchcy SP . 

Department of Veterinary Preclinical Sciences, University of Liverpool, UK.. 

Dietary carbohydrates, when digested and absorbed in the small intestine of the horse, 
provide a substantial fraction of metabolisable energy. However, if levels in diets exceed 
the capacity of the equine small intestine to digest and absorb them, they reach the 
hindgut, cause alterations in microbial populations and the metabolite products and 
predispose the horse to gastrointestinal diseases. We set out to determine, at the 
molecular level, the mechanisms, properties and the site of expression of carbohydrate 
digestive and absorptive functions of the equine small intestinal brush-border membrane. 
We have demonstrated that the disaccharidases sucrase, lactase and maltase are expressed 
diversely along the length of the intestine and D-glucose is transported across the equine 
intestinal brush-border membrane by a high affinity, low capacity, Na+-/glucose 
cotransporter type I isoform (SGLT1). The highest rate of transport is in duodenum > 
- jejunum > ileum. We have cloned and sequenced the cDNA encoding equine SGLT1 and 
alignment with SGLTl of other species indicates 85-89% homology at the nucleotide and 
84-87% identity at the amino acid levels. We have shown that there is a good correlation 
between levels of functional SGLTi protein and SGLTl iriRNA abundance along the 
length of the small intestine. This indicates that the major site of glucose absorption in 
horses maintained on conventional grass-based diets is in the proximal intestine, and the 
expression of equine intestinal SGLTl along the proximal to distal axis of the intestine is 
regulated at the level of mRNA abundance. The data presented in this paper are the first 
to provide information on the capacity of the equine intestine to digest and absorb soluble 
carbohydrates and has implications for a better feed management, pharmaceutical 
intervention and for dietary supplementation in horses following intestinal resection. 
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Suppressors of cytokine signaling proteins arc differentially expressed in 
Till and TIi2 cells: implications for Tb cell lineage commitment and 
maintenance. 

Egwuagu CE , YiiCR, Zhang M , Mahdi RJM Kim SJ , Gcry I . 

Laboratory of Immunology, National Eye Institute, National Institutes of Health, 
Bethesda, MD 20892, USA. emeka@helix.rtih.gov 

Positive regulatory factors induced by IL- 1 2/STAT4 and IL-4/STAT6 signaling during T 
cell development contribute to polarized patterns of cytokine expression manifested by 
differentiated Th cells. These two critical and antagonistic signaling pathways are under 
negative feedback regulation by a multimember family of intracellular proteins called 
suppressor of cytokine signaling (SOCS). However, it is not known whether these 
negative regulatory factors also modulate Thl/Th2 lineage commitment and maintenance. 
We show here that CD4(» naive T cells constitutively express low levels of SQCS I , 
SOCS2, and SOCS3 rnRNAs. These mRNAs and their proteins increase significantly in 
nonpolarized Th cells after activation by TCR signaling. We further show that 
differentiation into Thl orTli2 phenotype is accompanied by preferential expression of 
distinct SOCS mRNA transcripts and proteins. SOCS1 expression is 5-fold higher in Thl 
than in Th2 cells, whereas Th2 cells contain 23-fold higher levels of SOCS3. We also 
demonstrate that IL-12-induced STAT4 activation is inhibited iaTh2 cells ihat express 
high levels of SOCS3 whereas IL-4/STAT6 signaling is constitutively activated in Th2 
cells, but not Thl cells, with high SOCS1 expression. These results suggest that mutually 
exclusive use of STAT4 and STAT6 signaling pathways by differentiated Th cells may 
derive in part, from SOCS3- or SOCS I -mediated repression of IL-12/STAT4- or IL- 
4/STAT6 signaling in Th2 and Thl cells, respectively. Given the strong correlation 
between distinct patterns of SOCS expression and differentiation into the Th l or Th2 
phenotype, SOCS1 and SOCS3/proteins are therefore Th lineage markers that can serve 
as therapeutic targets for immune modulation therapy! 
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Intravitreal invading cells contribute to vitreal cytokine milieu in 
proliferative vitreoretinopathy. 

El-Ghrablv IA , Dua HS , Orr GIM Fischer D. Tigbe PJ. 

Larry A Donoso Laboratory for Eye Research, Department of Ophthalmology, University 
of Nottingham, UK. 

AIM: To examine the contribution of infiltrating cells in the local production of cytokines 
within the vitreous of patients with proliferative vitreoretinopathy (PVR); METHODS: 
Thepresence of mRNA coding for IL-6, IL-8, IL-lbeta, IL-1 alpha, TNFalpha, 
IFNgamma, IL-12, and HPRT was investigated in 25 vitreous samples from patients with 
PVR, 1 1 vitreous samples from patients with retinal detachment (RD) not complicated by 
PVR, and 10 vitreous samples from patients with macular hole (MH). A quantitative 
reverse transcriptase polymerase chain reaction (RT-PCR) using an internal competitor 
was used to investigate these samples. From these samples, 15 PVR, 8 RD, and 8 MH 
were analysed for the' protein levels of the same cytokines using eniyme linked ' ' ' . 
immunosorbent assay (ELISA). Spearman correlation was used to test any association 
between mRNA and cytokine protein levels^as an indicator of the contribution these cells 
make to the intravitreal cytokine milieu. RESULTS: A strong correlation was found 
between mRNA and their respective cytokine levels ;(protein products) for IL-6, IL-8, IL- 
lbeta, IL-1 alpha, TNFalpha, IFNgamma (Spearman r = 0:83, 0.73, 0.67, 0.91, 0.73, and 
0.73 respectively), but not for IL-12. The median levels of IL-6, 11^8, IL-lbeta, and 
IFNgamma mRNA and their respective cytokines were significantly higher (p <0.05) in 
patients with PVR than in those with macular' hole. There was no statistically significant 
difference in the median levels of IL- 1 alpha mRNA between PVR and MH but the 
cytokine IL-1 alpha was detected at a significantly higher level in PVR compared with 
MH patients. Between PVR and RD patients, there was no statistically significant 
difference in mRNA levels for all the investigated cytokines (p >0.05) except for IL-6 
where there was a statistical significance (p== 0.038). In contrast, the median levels of IL- 
6, IL-8, and IL-lbeta cytokines were significantly higher (p <0.05) in patients with PVR 
than in those with RD, whereas for IL-1 alpha and IFNgamma no significant statistical 
difference was detected between PVR and RD patients (p >0.05). When results of RD 
and MH patients were compared, a statistical difference was only detected in mRNA 
levels of INFgamma (p = 0.008). However, no difference was detected for INFgamma 
(protein product) or for any of the other cytokines between RD and MH patients. 
CONCLUSION: Levels of both protein and mRNA encoding IL-6, IL-8, IL-lbeta, and 
IFNgamma is significantly increased in vitreous samples from patients with PVR. The 
strong correlation between ELISA detectable cytokines (protein products) and their 
respective mRNA levels suggest that intravitreal, invasive cells are the major source of 
these cytokines, with the exception of 1L-I2. Cells invading the vitreous do not appear to 
locally produce IL-12 mRNA. This would appear to implicate cells peripheral to the 



vitreal mass as the major source of this cytokine. 
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Modulation of the glutamatergic receptors (AMPA and NMD A) and of glutamate 
vesicular transporter 2 in the rat facial nucleus after axotomy. 

ELcflreX, Cassias I. Vidal PP . de Wacle C 

LNRS (CNRS-Paris V), ESA 7060, Centre Univcrsitaire des Saints-Peres, 45 rue des Saints- 
Peres, 75270 Paris Cedex 06, France. 

Facial nerve axotomy is a good model for studying neuronal plasticity and regeneration in the 
peripheral nervous system. We investigated in the rat the effect of axotomy on the different 
subun.ts of excitatory glutamatergic AMPA (GLuRl-4), NMDA (NR1, NR2A-D) receptors post- 
synaptic density 95, vesicular glutamate transporter 2, beta catenin and cadherin mRNA levels 
and/or protein production were analyzed 1, 3, 8, 30 and 60 days after facial nerve axotomy by in 
situ hybridization and immunohistofluorescence. mRNAs coding for the GLuR2-4 NR1 NR2A 
B, D subunits of glutamatergic receptors and for post-synaptic density 95, were less abundant after 
axotomy. The decrease began as early as 1 or 3 days after axotomy; the mRNAs levels were 

lowest 8 days post-lesion, and returned to normal or near normal 60 days after the lesion The 
NR2C subuhit mRNAs were not detected in either lesioned or intact facial nuclei. 
Immunohistochemistry using specific antibodies against GLuR2-3 subunits and against NR1 
confirmed this down-regulation. There was also a large decrease in vesicular glutamate transporter 
2 immunostaining in the axotomized facial nuclei at early stages following facial nerve section In 
contrast, no decrease of NR2 A subunit and of post-synaptic density 95 could be detected at any 
time following the lesion, beta Catenin and cadherin immunoreactivity pattern changed around the 
ceil body of facial motoneuron by day 3 after axotomy, and then, tends to recover af day post- 
lesion 60 days. Therefore, our results suggest a high correlation between restoration of 
nerve/muscle synaptic contact, synaptic structure and function in facial nuclei. To investigate the 
mechanisms involved in the change of expression of these proteins following axotomy the facial 
nerve was perfused with tetrodotoxin for 8 days. The blockade of action potential significantly 
decreased GLuR2-3, NR1 and NR2A mRNAs in the ipsilateral facial nuclei. Thus axotomy- 
induced changes in mRNA abundance seemed to depend partly on disruption of activity: 
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Hepatic estrogen receptors (ER) mediate estrogenic effects on mammalian liver 
metabohsm and are thereby involved in the regulation of important 
phys.ological/pathological processes, such as coagulation, atherosclerosis and 
hypertens.on. The regulation of the formation of the ER in primary cultures of rat 

3^1^ c ™«'<>n was measured using two different methods, a ligand- 

b.ndmg techmque and an ER enzyme immunoassay. The results obtained by he twT 
merhods showed good correlation, and linear regression analysis gave a correlation 
T^vT l 0 - 95 ,,^,— fe» to low steady statekvels withi^h after 
estabhshmg the eel culture and remained low in the absence of hormonal substirudon 

fDE^ Um H Tk? Wfth PltUit3ry GH »* ^ g'-ocorticoid dexamethasone 

(DEJQ ,n comb.nat.on, the ER concentration increased 6-fold from 4.2 +/- 1 .0 to 25 8 +/- 

LrS Cy T™ eiri - ER mRNA W3S measured ^ s <* uti ™ hybridization ' 
Substi^on with GH and DEX in combination increased ER mRNA to 210 -<-/- 14% of 

IctdStt I T ER r^ RNA StabUity ^ — ^ h " *eatment.7t s 
concluded that the regulatory effects of GH and DEX on the hepatic ER in this in vitro 

system are very smular to the effects of these hormones under in vivo conditions The 
nducble expression of the ER has neyer before, to our knowledge, been demoStrat^ in 
any mammalian liver cell culture system. "cmunMraiea m 
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Cyclin Dl-negative mantle cell lymphoma: a clinicopathologic study based on 
gene expression profiling. 
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Cyclin Dl overexpression is believed to be essential in the pathogenesis of mantle cell lymphoma 
(MCL). Hence, the existence of cyclin D 1 -negative MCL has been controversial and difficult to 
substantiate. Our previous gene expression profiling study identified several cases that lacked 
cyclin D 1 expression, but had a gene expression signature typical of MCL. Herein, we report the 
clinical, pathologic, and genetic features of 6 cases of cyclin Dl-negative MCL. All 6 cases 
exhibited the characteristic morphologic features and the unique gene expression signature of 
MCL but lacked the t(l l;14)(ql3; q32) by fluorescence in situ hybridization (FISH) analysis. The 
tumor cells also failed to express cyclin Dl protein, but instead expressed either cyclin D2 (2 
cases) or cyclin D3 (4 cases). There was good correlation between cyclin D protein expression and 
the corresponding mRNA expression levels by gene expression analysis. Using interphase FISH, 
we did not detect chromosomal translocations or amplifications involving CCND2 and CCND3 ' 
loci in these cases. Patients with cyclin Dl-negative MCL were similar clinically to those wiih 
cyclin Dl -positive MCL. In conclusion, cases of cyclin Dl-negative MCL do exist and are part of 
the spectrum of MCL. Up-regulation of cyclin D2 or D3 may substitute for cyclin Dl in the 
pathogenesis of MCL. 
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Oxytocin receptors ia boviae cervix: distribution and geue expression 
during the cstrous cycle. 
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Oxytocin (OT) receptor (OTR) concentrations were determined in the cervix of 
nonpregnant cows on cycle Days 0, 3, 7-8, 1 7, and 19 (n = 3-4 cows each day); [3HJOT 
was used as the labeled ligand. Mucosal and muscle layers of the cervix were also 
analyzed separately for both ligand binding and expression of the OTR gene using a 
newly developed RNase protection assay (RAP). Cellular localization of OTR protein 
was determ.ned by irnmunohistochemistry. Al I regions of cervix from cows at estrus had 
high concentrations of OTR; in the luteal phase, all were sharply down-regulated At 

n™ S IT l3 ^ r h3d 3b0Ut 3 °- f0,d hi & Gt concentrations than the muscle layer. 
OTR mRNA was read.ly detected by RAP in the mucosa from estrous cows, while much 
weaker signals were found in the muscle. On Days 7-17, the OTR mRNA signals in both 
mucosa and muscle were very faint or nondelegable. Thus, there was a good correlation 
between hgand binding and mRNA expression, which suggests that OTR concentrations 
are mamly regulated at the transcriptional level. The epithelial cells at the luminal surface 
of the mucosa were the principal site of immunoreactive OTR; muscle cells showed 
sign.ficantly weaker signals. Previously, OT was found to stimulate prostaglandin (PG) 
b.2 output m vitro m bovine cervical tissues. Since PGE2 is capable of softening the 
cervix, our findmgs suggest that OT may have a novel physiological function to cause 
softening of the bovine cervix mediated by the release of PGE2. 
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A sampling of the yeast protcome. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis 
and gathered quantitative information from about 1,400 spots. We found that there is an 
enormous range of protein abundance and, for identified spots, a good correlation 
between protein abundance, mRNA abundance, and codon bias. For each molecule of 
well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was 
examined and found to be insignificant for abundant proteins. Some phosphoproteins 
were identified. The behavior of proteins in differential centrifugatibn experiments was 
examined. Such experiments with 2D gels can give a global view of the yeast proteome. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis and gathered quan- 
titative information from about 1,400 spots. We found that there is an enormous raDge of protein abundance 
and, for identified spots, a good correlation between protein abundance, mRNA abundance, and codon bias. 
For each molecule of weJMrans) a ted mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and elbanoj media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phospboproteins were identified; The behavior of proteins in 
differential centrifugation experiments was examined. Such experiments with 2D gels can give a global view of 
the yeast proteome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measured (27, 30). The next logical level 
of analysis is that of the expressed set of proteins. We have 
begun to analyze the yeast proteome bv using two-dimensional 
(2D) gels. 

2D gel electrophoresis separates proteins according to iso- 
electric point in one dimension and molecular weight in the 
other dimension (21), allowing resolution of thousands of pro- 
teins on a single gel Although modern imaging and computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yeast pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 7, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
cerevisiae is a good organism for this approach since it has a 
well-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the sequence 
3nd the relative lack of introns in S- cerewsiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are. three pillars on which such a database resls: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (iii) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for 5. cerevisiae have been described else- 
where (7, 8)- Here we dcsciibe quantitative data for these 
proteins under a variety ol expeiimenlal conditions. 

MATERIALS AND METHODS 
Strains and media. S. cererisuit W'303 [MAT? adc2-} his3-}],}5 teu?-X }}2 
lrpl-1 uro3-l cnnl-JOO) was used (26). - Met YNB (yeast nitrogen base) medium 
wa$ 1.7 g of YNB (Drfco) per liiei. 5 t of ammonium sulfate per liter, and 
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adenine, uracil, and all amino acids except meihionine; -Met.-Cys YNB me- 
dium was the same but without meihionine or cysteine. Medium was supple- 
mented with 2% glucose (for most experiments) or with 2% ethanol (for ethanol 
experiments). Low-phosphate YHPD was described by Warner (28). 

Isotopic labeling of yeast and preparation of cell r\ I rue Is, Yeast strains were 
labeled and proteins were extracted as described by Garrets el al. (7, 8). Briefly, 
cells were grown to 5 X TO' 1 cells per ml. at 30*C; J ml of culture was trans/erred 
to a fresh lube, and 0_3 mCi of | ?5 SJmelhionine (e.g.. Express protein labeling 
mix; New England Nuclear) was added to this I -ml culture. The cells were 
incubated for a further 10 to 15 min and then transferred to a }.5ml microcen- 
trifuge tube, chilled on ice, and harvested by centrifugation. The supernatant was 
removed, and the cell pellet was resnspended in J00 pJ of. lysis buffet (20 mM 
• Tris^HCI JpH 7.6J, 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA, 
0.1% deoxycholole: just before use, phenyimcthylsulfonyl fluoride was added to 
) mM, leupeptin was added to J jig/ml, pepsin I in was added to 1 u.g/ml T losyl- 
sulfonyl phcnylalanyl chlororoethyl ketone was added to 10 Mg/mt, and soybean 
trypsin inhibitor was added to 10 ng/ml). 

The resuspended cells were transferred to a screw- cap J. 5- ml polypropylene 
lube containing 0.23 g of glass beads (Oi-mm diameter; Biospec Products) or 
0.40 g ofzirconia beads (0i-mm diameter; Biospec Products). After the cap was 
secured, the tube was inserted into a MiniBeadbeater 8 (Biospec Products) and 
shaken at medium high speed at 4'C for 1 min. Breakage was typically 75%. 
Tubes were ihen spun in a microcentrifuge for JO s at 5,000 X g at 4"C. 

With a very fine pipelic lip, liquid was withdrawn horn the beads and trans- 
ferred lo a prechilled J .5- ml tube containing 7 ul ol DNase 1 (0.5.mg/ro); Cooper 
product no. 6330)-RNase A (0.25 mg/ml; Cooper product no. 5679)-Mg (50 mM 
MgCU) mix. Typically 70 of liquid was recovered. The mixture was incubated 
on ice for 10 min lo allow the RNase and DNase lo work. 

Next, 75 til of 2x dSDS (2x dSDS is 0.6% sodium dodecyl sulfate ISDSJ, 2% 
mercaptoelhanol, and 0.1 M Tris-1 IC1 |pH 8 J) was added. The lube was plunged 
into boiling water, incubated for 1 min, and then plunged into ice. After cooling, 
the lube was cenlriruged ol 4°C (or 3 min at M.000 X g. The supernatant was 
transferred to a fiesh tube and frozen at -70"C. About 5 til of this supernatant 
was used fox each 2D gc). 

2B polvacrvlamidr gels. 2D gels were made and run as described elsewhere 
(6-8). 

Imagr analysis of Ihr gets. The Quest J] software system was used for quan- 
titative image analysis (20, 22). Two techniques were used to collect quantitative 
daia lor analysis by Quest )) software. First, bclore the advent of phospborim- 
agers, gels were dried and fhiorographed. Each gel was exposed lo him for three 
different limes (typically 1 day, 2 weeks, and 6 weeks) to increase ihc dynamic 
range ol the data. The films woe scanned along with calibration strips to relate 
film optical density to disintegrations per minute in the gels and analyzed by the 
software to obtain a linear relationship betwrcn disintegrations per minute in the 
spots and optical densities of the film images. The quantitative data are ex- 
pressed as parts per million of the loial cellular protein. This value is calculated 
from the disintegrations per minute of ihc sample loaded onto the gel and by 
comparing the film dcn>ity of each data spot with density of the film over the 
calibration strips of known radioactivity exposed to the same film. This yields the ' 
disintegrations per minute per mil lime lei for each spot on the gel and thence its 
pajis-pei-rninuie value. 

After the advent ot phosphor imaging, gels bearing ' 15 S-labeted proteins were 
exposed to phosphor imager scieens and scanned by a Fuji phosphor imager, 
typically for r»o exposures pc< gel. Calibration Miips of known radioactivity were 
exposed simuh anrouNly. Scan data from ihr phosphor imager was assimilated by 
Quest II software, and quantitative data woe recorded for Ihr -.pots On the gels. 
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Measurements of protein turnover. Cells in exponential phase were pulse- 
labeled with | j5 S)methionine, excess cold Met and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 min (in one 
experiment) or up to ) 60 min (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sample (JO min).. Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., the different 
samples had different amounts of protein but equal amounts of ^S). Spots were 
quant itated with a phosphor imaging and Quest software. 

The software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
line through the data points, and looked for spots where this line had a statis- 
tically significant negative slope, in one of the experiments* there was one such 
spot. To the eye, this was a minor, unidentified spot seen only in the first two 
samples (JO and 20 min). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, wc concluded that none of the identified 
spots (and all but one of the visible spots) represented proteins with long 
half-lives. 

Centrifugal fractionation. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above except that no detergent (Le., no 
deoxycholaie) was present in the lysis buffer. The crude lysate was cleared of 
. unbroken cells and large debris by cenlrifugation al 300 X g for 30 s. The 
supernatant of this centrifugal ion was then spun at 16,000 X £.for 10 min to give 
the pellet used for Fig. 6B. The supernatant of the 16,000 X g, JO-min spin was 
then spun at 100,000 x g for 30 min to give the supernatant used for Fig. 6A. 
• Protein abundance calculations; A haploidyeas! cell contains about 4 > :}0~ 17 
g of protein (J, 15). Assuming a mean protein mass of 50 kDa, there are about 
50 x 10* molecules of protein per cell. There are about 1.8 methionines per 10 
kDa of protein mass, which implies 45 X 10 8 molecules of methionine per cell 
(neglecting the small pool of free Mel). We measured (i) the counts per minute 
in each spot on the 2D gels, (ii) the total number of counts on each gel (by 
integrating counts over the entire gel), ami (iii) the total number of counts 
loaded on the gel (by scintillation counting of the original sample). Thus, we 
know what fraction of the total incorporated^ radioactivity is present m each spot. 
After correcting for the methionine (and cysteine [see below)) content of each 
protein, wc calculated an absolute number of protein molecules based on the 
fraction of radioactivity in each spot and on 50 X 10* total molecules per cell. 

The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, the number of cysteine molecules 
peT protein was also taken into account in calculating the number of molecules 
of protein, but Cys molecules were weighted one-fifth as heavily as Met mole- 
cules. 

mRNA abundance calculations. For estimation of roRNA abundance, we used 
SAGE (serial analysis of gene expression) data (27) and Affymetrix chip hybrid- 
ization data (29a. 30). The mRNA column in Table 1 .shows mRNA abundance 
calculated from SAGE data alone. However, the SAGE data c;imc from cells 
growing in YEPD medium, whereas our protein measurements were from cells 
growing in YNB medium. In addition, SAGE data for low- abundance mRNAs 
suffers fiom statistical variation. Therefore, we also used chip hybridization data 
(29a, 30) for mRNA from cells grown in YNB. These hybridization data also had 
disadvantages. First, the amounts of high-:ibundance mRNAs were systemati- 
cally underestimated, probably because of saturation in the hybridizations, which 
used JO H g of cRNA. For example, the abundance of ADHJ mRNA was 197 
copies per cell by SAGE but only 32 copies per cell by hybridization, and the 
abundance of EN02 mRNA was 248 copies per cell by SAGE but only 4] by 
hybridization. When the amount of cRNA used in the hybridization was reduced 
\o 1 jig, the apparent amounts of mRNA were similar to the amounts determined 
by SAGE (29a, 29b). However, experiments using 1 jig of cRNA have been done 
for only some genes (29a). Because amounts of mRNA were normalized to 
15 ; 000 per cell, and because the amounts of abundant mRWAs were underesti- 
mated, there is a 2.2- fold overestimate of. the abundance of nonabundant 
mRNAs. We calculated this factor, of 22 by adding together the number of 
mRNA molecules from a large number of genes expressed at a low level for both 
SAGE data and hybridization data. The sum for the same genes from hybrid- 
ization data is 2.2-fold greater than that from SAGE data. 

To take into account these difficulties, we compiled a list of "adjusted" mRNA 
abundance as follows. For all high- abundance mRNAs of our identified proteins, 
we used SAGE data. For all of these particular mRNAs, chip hybridization 
suggested that mRNA abundance was the same in YEPD and YNB media. For 
medium-abundance mRNAs, SAGE data were used, but when hybridization 
data showed a significant difference between YEPD and YNB, then the SAGE 
data were adjusted by the appropriate factor. Finally, for low- abundance 
mRNAs, we used data from chip hybridizations from YNB medium but divided 
by 2.2 to normalize to the SAGE results. These calculations were completed 
without reference to protein abundance. 

CAL The codon adaptation index (CAl) was taken from the yeast protcome 
database (YPD) (I3) : foi which calculations were made according to Sharp and 
Li (24). Briefly, ihe index uies a reference set of highly expressed gents to assign 
a value to each codon. and then h score for a 'gene is calculated bum the 
frequency of- use nf the vjjious codons in that gene (24). 

Statistical analysis. The .IMP prrtgi am wns used with the aid of T. Toll v. The 
J MP piogram showed thai neither mRNA nr.i pint tin ubundances »-*rl- nor- 
mally tlismbiiiot; iheieloie. Spearman rank correlation cocfl'n'H'nt.s (/.) weio 



calculated. The mRNA (adjusted and unadjusted) and protein, data were also 
transformed so that Pearson product-moment correlation coefficients {r p ) could 
be calculated. First, this was done by a Box-Cox transformation of log- trans- 
formed data. This transformation produced normal distributions, and an r of 
0.76 was achieved. However, because the Box- Cox transformation is complex, wc 
also did a simpler logariihmic transformation. This produced a normal distribu- 
tion for the protein data. However, the distribution for the mRNA and adjusted 
mRNA data was close to, but not quite, normal. Nevertheless, we calculated the 
r p and found that it was 0.76, identical to the coefficient from the Box-Cox 
transformed data. We therefore believe that this correlation coefficient is not 
misleading, despite the fact that the log(mRNA) distribution is not quite normal. 



RESULTS 

Visualization of 1,400 spots on 'three gel systems. Yeasl 
proteins have isoelectric points ranging from 3.J to 12.8, and 
masses ranging from Jess than 10 kDa to 470 kDa. It is difficult 
to examine all proteins on a single kind of gel, because a gel 
with the needed range in pi and mass would give poor resolu- 
tion of the thousands of spots in I he central region of the gel. 
Therefore, we have used three gel systems: (i) pH "4 to 8 W with 
10% polyacrylamide; (ii) p H "3 to 10" with 10% polyacryj- 
amide; and (iii) nonequiJibrium with 15% polyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-8, 10% polyacrylamide gel. the pH 
at the basic end of ihe isoelectric focusing gel cannot be main- 
tained Ihroughout focusing, and so the proieins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
ihese pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend Ihe pi range somewhat 
beyond pH 7.5, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 15% 
acrylamide in the second dimension. These allow visuaiizalion 
of about 100 very basic proteins and about 170 small proteins 
(less than 20 kDa). In total, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 different 
proteins, which is about one-quarler to one-third of the pro- 
teins expressed under these conditions (27; 30). Here, we focus 
on the proteins seen on the pH 4-8 gels. 

Although nearly all expressed proteins ate present on these 
gels, the number seen is limited by a problem we call coverage. 
Since there are thousands of proteins on each gel, many pro- 
teins comigrate or nearly comigrate. When two proteins are 
resolved, but are close together; and one protein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at long exposures when the weak signal is strong 
enough for detection, the signal from the strong spot spreads 
and covers the signal from the weaker spot. Thus, weak spots 
can be seen only when I hey arc well separated from strong 
spots. 

For a given gel, the number of delectable spots initially rises 
with exposure time. However, beyond an optimal exposure, the 
number of distinguishable spots begins to decrease, because 
signals from strong spots cover signals from nearby weak spots. 
Al long exposures, the whole auloradiograrn turns black. Thus, 
iherc is on optimum exposure yielding the maximum number " 
of spots, and at this exposure the weakest spots are noi seen. 

Largely because of the problem of coverage, the proieins 
seen are strongly biased toward abundant proteins. All identi- 
fied proteins have a CAl of 0.18 or more, and we have iden- 
tified no transcription factors or protein kinases, which are 
nonabundant proteins. Thus, Ibis technology is useful for ex- 
amining protein synthesis, amino acid metabolism, and glyco- 
lysis but no! for examining transcription, DNA replication, or 
the ceil cycle. 
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Spot identification. The identification of various spots has 
been described elsewhere (7, 8). At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25). The 
main methods used in spot identification have been analysis of 
amino acid composition, gene dverexpression, peptide se- 
quencing, and mass spectrometry. 

Pulse-cbase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein half-lives (Materi- 
als and Methods). Ceils were labeled with [ 35 S]roetbionine for 
10 min. and then 3n excess of unlabeled methionine was added. 
Samples were taken at 0, 10, 20, 30, 60, and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were run, and spots were quantitaled. 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
translational modifications (e.g., phosphorylation of RpaO and 
Efbl). Thus, the proteins being visualized are all or nearly aJl 
very stable proteins, with half-lives of more than 90 min. Gygi 
et al. (10) have come to a similar conclusion by using the N-end 
rule to predict protein half-lives. This result does not imply 
that all yeast proteins are stable. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein quantitation. Because all of the proteins seen had 
effectively the same half-life, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine, and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified protein (Tables 1 and 2;*Materia)s and 
Methods). About 1,000 unidentified proteins were also quan- 
tified, assuming an average content of Mel and.Cys. 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total protein amount. 
However, many proteins probably have minor spots that we are 
not aware of, causing the amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had S,904 ppm, the 10th 
most abundant had 2,842 ppm, the 100th most abundant had 
314 ppm, the 500th most abundant had 57 ppm, and the 
1,000th most abundant (visualized at greater than optimum 
exposure) had 23 ppm. Thus, there is more than a 300-fold 
range in abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pH 4-8 gel. the most abundant 60 proteins account 
for 50%, and the mo.st abundant 500 proteins account for 80%. 
Since it seems likely that the pH 4-8 gels give a representative 
sampling of all proteins;- we estimate that half of the total 
cellular protein is accounted lor by fewer than 100 different 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlation of protein abundance with inRNA abundance. 
Estimates of mRNA abundance for each gene have been made 
by SAGE (27) and by hybridization of cRNA to oligonucleo- 
tide arrays (30). These two methods give broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods). Table J lists the number of molecules of mRNA 
per cell for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); a second incorpo- 
rates data from both SAGE and hybridization (30) (adjusted 
mRNA) (Table J; Materials and Methods). We correlated 
protein abundance with mRNA abundance (Fig. 2). for ad- 



justed mRNA versus protein, the Spearman rank correlation 
coefficient, r^ was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r pl on log transformed data (Materials and 
Methods) was 0.76 (P < 0.00001). We obtained similar corre- 
lations for mR^A versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation is far from perfect; for mRNAs of a given abundance, 
there is at least a 10-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due to post transcriptional 
regulation, and some is due to errors in the mRNA or protein 
data. For example, the protein Yef3 runs poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the Jow protein/mRNA 
ratio of YeJ3. It is the most extreme outlier in Fig. 2. 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
cell and about 205,000 molecules of protein. Assuming an 
mRNA half-life of 30 min (12) and a cell doubling time of 120 
min, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 times. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be 
the mRNAs that are translated best. 

A full complement of cell protein is synthesized in about 120 
min under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every .2 s. This is a remarkably. high rate; it 
implies that if an average mRNA bears 10 ribosorhes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average protein has 450 residues; this in turn implies 
translation of over 20 amino acids per s, a rale considerably 
higher than estimated for mammalians (3 to.8 amino acids per 
s) (18). These estimates depend on the amount of mRNA per 
cell (11, 27): 

The large number of protein molecules that can be made 
from a single mRNA raises the issue of how abundance is 
controlled for less abundant proteins. Many nonabundant pro- 
teins may be unstable, and this would reduce the protein/ 
mRNA ratio. In addition, many nonabundant proteins may be 
translated at suboptima! rates. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
translalional initiation. For example, there are over 600 yeast 
genes which probably have short open reading frames in the 
mRNA upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias with protein abundance. The 
mRNAs for highly expressed proteins preferentially use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may make translation faster or more 
efficient and may decrease mis incorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic— highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bias wiih abun- 
dance continues for medium- abundance proteins. There are 
various mathematical expressions quantifying codon bias; here, 
we have used the CAJ (24) (Materials and Methods) because 
it gives a result between 0 and J. The r, for CA1 versus protein 
abundance is 0.S0 [P < 0.0001). similar to the mRNA-protein 
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TABLE 1. Quantitative data" 



Function 



Carbohydrate metabolism 



Protein svntbesis 



Heat shock 



Amino acid synthesis 



Miscellaneous 



Name 




mRNA 


Adjusted mRNA 


Protein (Glu) (10 3 ) 


Adhl . 


0.830 


197 




1 9VI 


Adb2 


0^504 


0 




o 


Cil2 


0.185 


1 


2.8 


23 


Eriol 


0.870 


No Ma 




410 


cnoz 




248 


248 


650 


roa j 


U.ouO 


179 


179 


640 




U~>UU 


13 


103 


62 




noci 
0.2D I 


0 




0 


x QUI 


U.J4/ 


5 


■ 

5 


41 


PdcJ 


0909 


zzo 




280 


Pfkl 


0^465 


5 


5 


75 


Pgil 


0.681 


14 


14 


160 


Pycl 


0.260 


1 


0.7 


37 


Tall 


0.579 


5 


5 


110 


Tdh2 


0.904 


63 


63 


430 


Tdh3 


0.924 


460 


460 


1,670 


Tpil 


0.817 


No Nla 




No Met 


Efbl 


0.762 


33 


163 


358 


Eftl,2 


0.801 


26 


26 


• 99 


Prll 


0.303 


4 


0.7 


12 


RpaO 


0.793 


246 


246 


277 


Tif} ,2 


0.752 


' 29 


29 


233 


YeB 


0.777 


36 


36 ' 


14 



Protein (Eth) (1(P) EJG ratio 



Hsc82 

Hsp60 

Hsp82 

Hspl04 

Kar2 

Ssal 

Ssa2 

Sscl 
. Ssel 
Stil 

Adel 

Ade3 

Ade5,7 

Arg4 

Gdbl 

Glnl 

His* 

IM 

hys9 

Mei6 

Pro2 

SerJ 

Trp5 

Actl 
Adkl 
A!d6 
Atp2 
Bmhl 
Bmh2 
Cdc48 
Cdc60 
. Erg20 
GppJ 
Gspl 

jp P i 

Lcbl 

Moll 

P;tbJ 

Psal 

Rnr4 

Saml 

Snm2 

Sod] 

Ubal 

YKL056 

YLR109 

YMR716 



0.581 

0381 

0.517 

0.304 

0.439 

0.709 

0.802, 

0.850 

0J21 

0.521 

0.247 

0.229 
0.276 
0.257 
0.229 
0.585 
0.524 
0.267 
0.801 
0.332 
0.657 
0.248 
0.258 
0.3 J 9 

0.710 

0.531 

0.520 

0.424 

0.322 

0.384 

0.306 

0.299 

0.373 

0.603 

0.621 

0.620 

0.173 

0.123 

0.488 

0.600. 

0.497 

0.494 

0.497 

0.376 

0.212 

0.731 

0.549 

0.777 



2 
9 
2 
7 

5 * 

2 
10 
50 

2 

8 
I 

4 

2 

2 

J 
10 
11 

3 

6 

4 

No Mo 

3 
2 
5 

54 
NoA7o 
3 
1 

46 
1 

2 
2 
5 
16 
3 
4 

0.3 

0 

3 
15 

6 

5 

3 
36 

2 
62 
21 
4] 



2.9 
23 
1.3 
7 

; 10.1 

4.3 
5 
50 
2.6 
8 

1.1 
4 

1.7 

1.4 

8.1 
27 
Jl 

3 

6 

4 
22 

3 

1.2 
5 

54 
3 

4.1 
46 
1.4 
2.4 
0.86 
5 
5 
3 
4 

0.8 

0.45 

3 

15. 

6 

5 
J5 

36 ■ 
2 
62 
21 
41 



112 

35 
52 
70 
43 

303 
.213 

270 
. 68 
96 
25 

14 
12 
14 

. 41 
148 
77 
15 
152 
32 
190 
30 
15 
28 

205 

47 

181 

76 

191 

134 

32 
6 

92 
234 
115 
254 

19 

20 

4) 
148 

44 

59 

63 
631 

14 
253 
930 
184 



972 
963 - 
288' 
974 . 
215 
608 

46 
671 

33 
205 

53 
120 

34 

35 
876 
1,927 
No Met 

362 
54 
6 

100 
106 
ND 

75 

82 
135 
161 
102 
421 
324 

85 

SO 

48 

44 

27 
9 

4 • 

41 

55 
104 

23 
109 

17 

80 

12 
8 

12 

164 
43 

159 

109 

137 

147 
26 
2 

39 

158 

39 

147 

40 

16 

19 

56 

37 

21 

20 
618 

20 
112 

40 



0.79 
>20 
12 
2.4 
0.33 
0.95 

>20 

0.73 
0.71 
0.75 



NR 
NR 



0.55 

036 
0.46 



0.67 

23 

2.6 

2.3 

2.4 

1.4 

1.5 

1.2 

1.7 



1.3 

1.5 

0.7 

032 

0.42 



0.78 



1.4 
0.72 



0.34 
0.58 



0.47 



0.44 
0.20 
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TABLE 2. Functions of proteins listed in TabJc 1 



YPD title lines* 



Alcohol dehydrogenase I; cytoplasmic isoTyme reducing acetaldehyde to etbanoL regenerating NAD* 
Alcohol dehydrogenase H; oxidfees ethanol to acetaldehyde, glucose repressed 

^'SLVrt ^j 50 ™ 1 <npnmjlochondrial); converts acetyl-CoA and oxaloacctate to ciUale plus CbA 
Fnn ^ ? ?^ P W CCI:1 ! C ^yj ralase £ converts 2-phospho-D-glycerate lo phosphoenolpymvate in grycorysis . 

Hexo£inase I; converts hexoses to hexose phosphates in grycorysis; repressed by Rlucose 

Hexobmsc JI; converts hexoses to hexose phosphates in grycorysis and plays a regulatory role in glucose repression 
{SQOlrale ryase peroxisomal; carries out part of the gryoxyjate cycle; requi/ed for gtuconeogenesis re P resSH > n 
Pyruvate dehydrogenase complex, El beta subunil , 
Pyruvate decarboxylase isozyme J 

rh^^K^T - ,pha ' Ub ° ni! ' P3M 0/ > a COm P ,ex wi,b which carries out a key regulatory step m elycolys« 

Gmcose-6-phosphaie isomerase, converts glucose-6-phosphaie lo Iructose-o-phosphate ■ ^ step m grycoiysB 
Pyruva e carboxylase J; converts pyruvate to oxaloacetate for gluconeogenesis 
Transaldolase; component of nonoxidalive pari of pentose phosphate pathway 

fihS^5^riS? P J! a !! ^M r °£ enasc J converts D-grycer aldehyde 3-phosphate lo 13-dephosphogrycerale - 
Grycerdldehyde-3-phosphate dehydrogenase 3; converts D-gryceraldebyde 3-phospbate to 1 Jdepbospholryceralc 
Triphosphate so^^ . 

Translation elongation factor EF-10; GDP/GTP exchange factor for Tef jp/Tef2p 

S "J"**™ J^or EF-2; contains diphthamide which is not essential for activity; identical to Efl2p 
Transition elongation factor EF-2; contains diphthamide which is nol essential for activity- dent ca o EfllS 
Translation initiation factor eJF3 beta subtmit (p90); has an RNA recognition domain * ' P 

Acidic ribosoroal protein AO 

Translation initiation factor 4A (eIF4A) of the DEAD box family 
Translation initiation factor 4A (e)F4A) of the DEAD box family 
Translation elongation factor EF-3A; member of ATP-binding cassette superfamiry 

Chaperonin homologous lo £ colt HtpG and mammalian HSP90 

Mitochondrial chaperonin lhal cooperates with HspJOp; homoJog of K coU GroEL 

Heat-mduoble chaperonin homologous lo£ coli HtpG and mammalian HSP90 

"^[raVpHoncoS iDdUCeJ *** -olubilizing aggregates of denatured proteins; important for IpsPf 

HC and1o°?^ '^T^^^^^f^ *» ^ —location across the endop.asmic reticulum membrane 

Cytoplasmic chapcrone; heal shock prolein of Ihe I1SP70 family 
Cytoplasmic chaperone; member of Ihe HSP70 family 

Heal shock protein of HSP70 family involved in Ihe Iranslalional apparatus 
Heat shock protein of HSP70 family, cytoplasmic 

snoffiein^ Tm44p * «* -««^ — **** of protein chains 

p£5%2&£^^ %$Z£*>~ - — ' h -P * * -° P»*e biosynthesis pa.mvay 

hif.mctio,,, pro.ein ■ 

^Tttht^r^ py.o P hosphohvd.o.ase/hi S ,idinol dehydrogenase; 2nd, 3rd, and ,0th sleps of 

^!^^^S^r^' OV - aLid (a.phu-ke.o-ehyd.o^acy,) reducoisomerase); second step in Val ' 

"iteBelS--^ - 
^Glutamyl phosphate reductase (phosphogmlamale dehydrogenase), proline biosynihetic eiwvme 
f'£h ermC »»»7>n^ invoNed in synthesis of serine from S-phSspbo^crKe * 
Tryptophan synthase, last (5th) step in tryptophan biosynthesis pathway 

Aclin; involved in cell polarization, endocytosis, and other cvloskeletal functions 
Adenylate kinase (CTP;AMP phosphotransferase), cytoplasmic 
Cytosohc acetaldehyde dehydrogenase 

Beta subunil of F) ATP synthase; 3 copies arc found in each Fl olicbmer 
Homo og of mammalian 14-3-3 protein; has strong similarity to Bmh2p 
Homolog of mammalian J4-3- 3 protein; has strong similarity to BmhJp 

may bc ' ytc,imi,ine 5,cp in bi ~ is . 

l^^^^O^^^ ^''^ ' M in -lea, pores 

^S^^r °' a » J '-' C '^ ^ 3'-^ "NA.pnKcssing complex (cleavage factor I); has 4 RNA 

Mannosr-J-phosphaie gui.nyltransfeiasc; GDP-rnannose py.ophosphonlase 

Ribonucleotide reductase small subunil K J 

5-Adenosylmeihionine synihciase 1 

S-Adenosyhnelhionine synthetase 2 

Copper-zinc superoxide dismutase 

Ubitjuiiin-aclivaiing (El) en2vme 



Adhl 

Adh2 

Cil2 

Enol 

Eno2 

Fbal 

Hxkl 

Hxk2 

Icll 

Pdbl 

Pdc) 

Pfkl 

Pgil 

Pycl 

Tall 

Tdh2 

Tdh3 

TpiJ 

Efbl 
Eftl 
Eft2 
Prtl 

RpaO (RPPO) 

Tifl 

Tif2 

Yef3 

Hsc82 
Hsp60 . 
Hsp82 
Hsp]04 

Kar2 

Ssal 
Ssa2 
Ssbl 
Ssb2 
Sscl 

Ssel 
Slil 

Adel 

Ade3 

Ade5,7 

Aik4 

Gdh) 

Glnl 

His*t . 

Ilv5 

Kys9 
. Mct6 

Prt>2 
SerJ 
Trp5 

Act I 

AdkJ 

Ald6 

Alp2 

BmhJ 

Bmh2 

Cdc48 

CticbO 

Gpp) (Rhr2) 
GspJ 
»PPJ 
Lcbl 

Mol) (Thi4) 
Pub) 

PsaJ 
Rm4 
Saml 
Sam2 
Sod! 
UbyJ 
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YMR116 (Ascj) Abundant pio.ein with rllefis On Uanslaiional efficiency and cell si2e , ha S hv 0 WD (WD-40) icpea.s 
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FIG. Z Conclation of protein abundance wilb adjusted roRNA abundance. 
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of molecules peT ceD of the cognate mRNA, with an r p of .0.76. Note the 
logarithmic axes. Data for mRNA were tafteo from refeiences 27 and 30 and 
combined as described in Materials and Methods. 



correlation, confirming a strong correlation between CA1 and 
protein abundance (Fig. 3). The relationship between CA1 and 
protein abundance is Jog linear from about 1,000,000 to about 
10,000 molecules per cell. We have no data for rarer proteins. 

It is not clear whether CAJ reflects. maximum or average 
levels of protein expression. The proteins used for the CA]- 
prolein. correlation included some proteins which were not 
expressed at maximum levels under the condition of the ex- 
periment (Hsc82, Hspl04, Ssal, Adel, Arg< His4, and others). 
When these proteins were removed from consideration and 
the correlation between CAI and the remaining (presumably 
constitutive). proteins was recalculated, the r s was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (protein 
molecules/cell) = (2.3 X CAJ) 4- 3.7. Thus, under certain 
conditions (a CA) of 0.3 or greater; a coostitutivery expressed 
gene), a very rough estimate of protein abundance can be 
made by raising 10 to the power of [(2.3 X CAJ) + 3.7]. 

The distribution of CA) over the genome (Fig. 4) consists of 
a lower, belJ-shaped distribution, possibly indicating a region 
where thcre.is.no selection for codon bias, and an upper, flat 
distribution, starting al a CAJ of about 03, possibly indicating 
a region where I be re is selection for codon bias. Almost aJJ of 
the proteins whose abundance we have measured are in the 
upper, flat portion of the distribution. In the lower, bcJJ-shaped 
region, we do not know whether there is a correlation between 
CAI and protein abundance. 

Changes in protein abundance in glucose and ethanol. A 
comparison of cells grown in glucose (Fig. JA) with cells grown 
in ethario! (Fig. IB) is shown in Table. 1. As is well known, 
some proteins are induced tremendously during growth on 
ethanol. Two striking examples are the peroxisomal enzymes 
Icll (isocitrate lyase) and Cit2 (citrate synthase), which are 
induced in ethanol by more than 100- and 12- fold, respectively 
(Fig. 1; Table 1). These enzymes art key components of the 
glyoxylale shunt, which diverts some acetyl coenzyme A 
(acetyl- CoA) from the tricarboxylic acid cycle to gluconeogen- 
esis. & cercvisiae requires large amounts of carbohydrate for its 
cell wall; in ethanol medium, this caibohydrate comes from 
gluconeogenesis, which depends on the glyoxylale shunt and 
on (he glycolytic pathway running in reverse. The need foj 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylate shunt enzymes 
. in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the celj wall, is a major 
metabolic activity under these conditions. . 

During gluconeogenesis, substrate-product relationships are . 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse . 
reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly affected. Adhl and Adh2 each interconvert 
acetaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (37 mM), while Adh2 has a lower K m (0.8 mM) (5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acetaldehyde) (5, 29). Similarly, Enol (enolase 
1) is induced in ethanol, while its isozyme Eno2 (enolase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by . 
2-phosphogrycerate (the glycolytic substrate), while Eno2 is 
inhibited by pbbsphoenolpyruvate (the gluconeogenic sub- 
strate) (4). Perhaps Enol has a lower K m for phosphoenol- 
pynivate than does Eno2, though to our knowledge this has not 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol). 

Many heat shock proteins (e.g., Hsp60, Hsp82, Hspl04, and 
Kar2) were about twofold more abundant in ethanol medium 
than in glucose medium. This is consistent with the increased 
heat resistance of ceJJs grown in ethanol (3). 

Enzymes involved in protein synthesis (Efil, RpaO, and Tifl) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose; 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproteins. About 300 distinct spots, probably represent- 
ing 150 to 200 proteins, could be seen on pH 4-8 gels (Fig. 5B). 
We then aligned autoradiograms of three gels, each with a . 
different kind of labeled protein ( 32 P only |Fig. 5B], 32 P plus 
35 S [Fig. 5 A], and 35 S only (not shown, but see Fig. 1 for . 
example]). In this way, we made provisional identification of 
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some of the 32 P-labeJed spots as particular 35 S-Jabeled spots. 
All such identifications are somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
actly comigrate. Nevertheless,. we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P-Iabeled proteins are the hexokinases Hxkl and 
Hxk2 ; the acidic ribosome-associated protein RpaO, ihe trans- 
lation factors Yef3 and EfbJ, and probably Hsp70 heat shock 
proteins of the Ssa and Ssb families. RpaO and Efbl are quan- 
titatively monopbosphorylated. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has live or more spots, at least four of which 
comigrate with 32 P. Tpil has a major spot showing no 32 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. Tiff has at least seven spots (7); two of these 
overlap with some 32 P label, but five do not (Fig. 5). Eftl has 
at least three spots (7), and none of these overlap with 32 P, 
although theie are three nearby, unidentified 32 P-labe)ed spots 
(a, c, and d in Fig. 5). Spots that seem lo be extra forms of 
Met6, Pdcl, Eno2, and Fbal can be seen in Fig. 6A, but there 
is little 32 P at these positions in Fig. 5. Thus, phosphorylation 
explains some but not all of the different protein isoforms seen. 

The cell cycle is regulated in part by phosphorylation. We 
compared 32 P- labeled proteins from cells synchronized in Gj 
with o-facior, in cells synchronized in G 1 by depletion of G, 
cyclins, and in cells synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were difficult 
to reproduce. The cell cycle proteins regulated by phosphory- 
lation may not be abundant enough for this technique to be 
applied easily. 

Centrifugal fractionation. We fractionated 3 *$-)abeled ex- 
tracts by centrifngalion (Materials and Methods). Figure 6A 
shows the proteins in the supernatant of a high- speed 
(10G\000 X g; 30 min) centrifugation, while Fig. 6B shows the 
proteins in the pellet of a low-speed (16,000 X ^ 30 min) 
centrifugaiion. Many proteins are tremendously enriched in 
one Jiaciron oi the other, while others aie present in both. 



Most glycolytic enzymes (e.g.,Tdh2, Tdh3, Eno2, Pdcl, Adh), 
and Fbal) are enriched in the supernatant fraction. The only 
exception is Pflcl (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Efil^YeA Prtl, Tifl, and RpaO) are in the pellet, 
possibly because of the association of ribosomes with the en- 
doplasmic reticulum. However, Efb] is in the supernatant, as is 
a substantial portion of the Eftl. Perhaps surprisingly, several 
mitochondrial proteins (Atp2 (not shown) and Uv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondria) proteins. The nuclear protein Gsp] is in 
the pellet fraction. The enrichment produced by centrifugal ion 
makes it possible to see minor spots which are otherwise poorly 
resolved from surrounding proteins. Figure 6B shows that the 
previously identified Tifl spot is surrounded by as many as six 
other spots that cofractionate. We observed six identical or 
very similar additional spots when we overexpressed Tifl from 
a high-copy-number plasmid (not shown). Signal overlaps only 
one or two of these spots in 32 P- labeling experiments (Fig. 5), 
and so the different forms are not mainly due lo different 
phosphorylation states. 

DISCUSSION 

Our experience wjib developing a 2D gel protein database 
for S. cerevuiae is summarized here. With current technology, 
we can see the most abundant J ,200 proteins, which is about 
one- third to one-quarter of the pjoteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods that we have used, not because of a bck of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the 1,200 proteins seen, we have identified 148, with a 
bias toward Ihe most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene over expression will be particu- 
larly useful, since it is not affected by the lower abundance of 
the remaining visible proteins. 
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2D gels of (be kind that we have used are not suitable for 
visualization of rare proteins. However it will be possible to 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, glu- 
coneogenesis, amino acid synthesis, cell wall synthesis, nucle- 
otide synthesis, lipid metabolism, and the heat shock response. 

Gygi et ah (10) have recently completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
mRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi et al. feel that mRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or mRNA 
levels" (10). These different conclusions are partly a matter of 
viewpoint. Gygi et al. focus on the fact that the correlations of 
mRNA and codon bias with protein abundance are far from 
perfect, while we focus on the fact that, considering the wide 
range of mRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly due to 
different methods of statistical analysis and IQ real differences 
in data. With respect to statistics, Gygi et al. used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
covariance of mRNA and protein abundance. Depending on 
the subset of data included, their r p values ranged from 0.1 to 
0.94. Because of the low r p values with some subsets of the 
data, Gygi et al. concluded that the correlation of mRNA to 
protein was poor. However, the r p correlation is a parametric 
statistic and so requires variates following a bivariate normal 
distribution; that is, it would be valid only if both mRNA and 
protein abundances were normally distributed. In fact, both 
distributions are very far from normal (data not shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion that codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (r,). Since this 
statistic is nonparametric. there is no requirement for the data 
to be normally distributed. Using the r Jf we find that mRNA 
abundance is well, correlated with protein abundance (r, = 
0.74), and the CA1 is also well correlated with protein abun- 
dance {r s = 0.80) (and also with mRNA abundance [data not 
shown]). For the data of Gygi et aj. (10), we obtained similar 
results, though with their data the correlation is not as good; r, 
= 0.59 for the mRNA-to-prolein correlation, and r s — 0.59 for 
the codon bias-to-pjotein correlation. 

In a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Box- Cox and logarithmic, were used; both gave 
good correlations with our data [e.g., r p = 0.76 for Jbg( adjusted 
RNA) to log(protein)). Wc were not able to transform the data 
of Gygi et al. to a normal distribution. 

Finally, there are also some differences in data between the 
two studies. These may be partly due to the different measure- 
ment techniques used: Gygi et al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospho- 
r imaging of intact gels coupled to image analysis. We com- 
pared our data lo ibeiis for the proteins common between the 
studies (but excluding proteins whose mRNAs are known to 
differ between rich and minimal media, and excluding Tif'l. 
which was anomalous in differing by 3 00- fold between the two 
data sets). The r s between the two protein data sets was 
{P < 0.0001). Although this is a strong con elation, the fact thai 



it is less than 1.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After- 
normalizing the two data sets to assume. the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data of Gygi et al. to be slightly higher than ours for 
the highest- abundance proteins and also for the lowest-abun- 
dance proteins but slightly lower than ours for the middle- 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. Although we do. not 
know what the errors are, we suggest the following as a rear, 
sonable speculation. For the highest-abundance proteins, we 
may have underestimated the amount of protein because of a 
slightly nonlinear response of the phosphor imager screens. For 
the lowest -abundance proteins, Gygi et al. may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately anting very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi et al. 
have a poor correlation with mRNA abundance. We calculate 
that the r s is 0.74 for the top 54 proteins of Gygi et al. but only 
0.22 for the bottom 53 proteins, a statistically significant dif- 
ference. However, with our data set, the r 3 is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for the bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi et al.). Thus, our data set maintains a good 
correlation between mRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by pbosphorim aging and image 
analysis may be more accurate for small, weak spots than is 
cutting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect the fact that we used both SAGE data and RNA 
hybridization data, which is most helpful for the least abundant 
mRNAs. Jn summary, we feel that the poor correlation of 
protein to mRNA for the nonabundant proteins of Gygi et al. 
may reflect dillkulty in accurately measuring these nonabun- 
dant proteins and mRNAs,. rather than indicating a truly poor 
correlation in vivo. It is not surprising that observed correla- 
tions would be poorer with less-abundant proteins and 
mRNAs, simply because the accuracy of measurement would 
be worse. 

How well can mRNA abundance predict protein abun- 
dance? With r p = 0.76 for logarithmically transformed mRNA 
and protein data, the coefficient of determination, (r p ) 2 y is 0.58. 
This means thai more than half (in log space) of the variation 
in protein abundance is explained by variation in mRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table 1), and (r^) 2 — 
0.58 for the log data means that of this 200-fold variation, 
about 20-fold is explained by variation in the abundance of 
mRNA and about ]0-fold is unexplained (but could be due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between mRNA and protein abundance will be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. 

Some important conclusions can be drawn from this sam- 
pling of the protcome. First, there is an enormous range of 
protein abundance, from nearly 2,000,000 molecules per cell 
for some glycolytic enzymes to about 100 per cell for some cell 
cycle proteins (26a). Second, nbout half of all cellular protein 
is found in fewer than J 00 different gene products, which are 
mostly involved in carbohydrate metabolism oi protein syndic- 
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sis. Third, the correlation between protein abundance and CAI 
is log linear as far as we can see, which is from about 10,000 
protein molecules per cell to about 1,000,000. This is somewhat 
surprising, because it implies that selective forces for codon 
bias are significant even at moderate expression levels, it also 
means that codon bias is a useful predictor of protein abun- 
dance even for moderately low bias proteins. Fourth, there is a 
good correlation between pTOtein abundance and mRNA 
abundance for the proteins that we have studied. This validates 
the use of mRNA abundance as a rough predictor of protein 
abundance, at least for relatively abundant proteins. Fifth, for 
these abundant proteins, there are about 4,000 molecules of 
protein for each molecule of mRNA. This last conclusion 
raises questions as to how the levels of nonabundarit proteins 
are regulated and suggests that protein instability, regulated 
translation, suboptimal rates of translation, and other mecha- 
nisms in addition to transcriptional control may be very impor- 
tant for these proteins. 
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Pre-transiational regulation of cytochrome P450 genes is responsible for 
drrhoS PedfiC ChangeS ° f iI,divid,,a, P45 ° a ™«g patients with 

George J; LiddleC Murray M ,BYth_K, FarreUGC 

S^Tmcw 0 ! 8 troe " terol °gy He Patology, University of Sydney at Westmead 
Hospital, NSW, Australia. 

We have recently reported that disease-specific differential alterations in the hepatic 
expression of xenobiotic-metabolizing cytochrome P450 (CYP P450) enzymes occur in 
patiente with, advanced liver disease. In order to determine whether the observed changes 
m CYP proteins are modulated at pre- or pbst-translatiorial levels, we have now examLd 
me hepatic levels of mRNA for CYPs 1 A2, 2C9, 2E1 and 3A4 by solution hybridizX 
in the same livers of 20 controls (surgical waste from histologically normal livers) 32 
cases of hepatocellular and 18 of cholestatic severe chronic liver disease. CYP1 A2 
rnRNA and CYP 1 A immunoreactive protein were both reduced in livers with 
hepatocellular and cholestatic types of cirrhosis. In contrast, CYP3A4 mRNA and protein 
were reduced only in livers from patients with hepatocellular diseases. For lA2and3A4 
?«Z7?«\T, ^j™**™* betw «^ mRNA species and the respective protein 
contents (rSl A2 = 0.74, rS3A4 = 0.64, P < 0.0001). CYP2C9 mRNA was reduced in 
patients with both cholestatic and hepatocellular types of liver disease, but 2C protein ■ 

rw^ o p^/ m i' atientS WiA ******* dysfunction. The correlation between 
CYP2C9 mRNA and protem, was also significant (rs = 0.36, P < 0.005) but mRNA levels 
accounted for only 13% of the variability in protein rankings. This is p/obabryT 

irsfpT c r teins apart from 209 bein * de ^ ted *y the 

antibody CYP2E1 mRNA and protein were reduced in patients with cholestatic liver - 
disease, but in hepatocellular disease the expression of only CYP2E1 mRNA was 
decreased CYP2E1 mRNA was significantly correlated with CYP2E1 protein but 
a^untedforonlyl8%ofthevariabilitymproteinrankings(rs = 0.4^ P<00005) 
Taken collectively these data indicate that the disease-specific alterations of xenobiotic- 
metabohzing CYP enzymes among patients with cirrhosis is due, at least in part, to pre- 
translahonal mechanisms. The lack of a strong correlation between CYP2E1 mRNA and 
protem suggests that this gene, like its rat orthologue, may be subject to pre-transIatiOnal 
as well as translational and/or post-translational regulation; 

PMID: 7741759 [PubMed - indexed for MEDLINE] 
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Cyclooxygenase-2 expression in macrophages: modulation by protein 
kuiase C-alpha. 

Giroux M Descoteaux A 

Cyclooxygenase^ (COX-2) is an inducible enzyme responsible for high levels of PG 
production during inflammation and immune responses. Previous studies with 
pharmacological inhibitors suggested a role for protein kinase C (PfCC) in PG production 
possibly by regulating COX-2 expression! In this study, we addressed the role of PKC- 
alpha in the modulation of COX-2 expression and PGE2 synthesis by the overexposing 

rr^r investigated the effect of various stimuli on COX-2 expression, namely 

' A !^™T a ' intracellular parasite Leishmania donovani. Whereas LPS- 

induced COX-2 raRNA and protein expression were down-regulated in DN PKC-abha- 
overexpressing clones, IFN-gamma-uiduced COX-2 expression was up-regulated in DN 
r KC-aipha-overexpressing clones with respect to normal RAW 264 7 cells 
Measurements of PGE2 levels revealed a strong correlation between PGE2 secretion and 
ffN-garnma-induced COX-2 mRNA and protein levels in DN PKC-alpha-overexpressing 
clones. Taken together, these results suggest a role for PKC-alpha in the modulation of 
LPS-«nd IFN-gamma-induced COX-2 expression, as well as in IFN-gamma-induced . 
rubZ secretion. - 

PMfD: 11034408 [PubMed- indexed for MEDLINE] ' 
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Transcript profiling of liuraao platelets using microarray and serial 
analysis of gene expression. 

Gnatcnko DV , DunnJJ , McCorkleSR, Weissmann D Perrotta PL . Bal.ou WF . 

Department of Medicine, Program in Genetics, State University of New York Stony 
Brook 11794-8151, USA. ' 

Human platelets are anucleate blood cells that retain cytoplasmic mRNA and maintain 
functionally intact protein translation^ capabilities. We have adapted complementary 
techniques of m.croarray and serial analysis of gene expression (SAGE) for genetic 
profiling of tnghiy purified human blood platelets. Microarray analysis using the 
Affymetnx HG-U95Av2 approximately 12 600-probe set maximally identified the 
expression of 2147 (range, 13%-17%) platelet-expressed transcripts, with approximately 
22/o collectively involved in metabolism and receptor/signaling and an 
overrepresentation of genes with unassigned function (32%). [n contrast, a modified 
SAGE protocol using the Ty,,e IIS restriction enzyme Mmel (generating 2 1 -base pair 
|bp] or 22-bp tags) demonstrated that 89% of tags represented mitochondrial (mf) 
transcripts (enriched in 16S and 12S ribosomal RNAs), presumably related to persistent 
mt-traiiscnption in the absence of nuclear-derived transcripts. The frequency of non-mt 
i»AGE tags paralleled average difference values (relative expression) for the most 
abundant" transcripts as determined by microarray analysis, establishing the 
concordance of both techniques for platelet profiling. Quantitative reverse transcription- 
polymerasecham reaction (PCR) confirmed the highest frequency of mt-derived 
transcripts, along with the mRNAs for neurogranin (NGN, a protein kinase C substrate) 
and the complement lysis inhibitor clusterin among the top 5 most abundant transcripts 
tor confirmatory characterization, immunoblots and flow cytometric analyses were 
performed, establishing abundant cell-surface expression of clusterin and intracellular 
expression of NGN. These observations demonstrate a strong correlation between high 
transcript abundance and protein expression, and they establish the validity of transcript 
analysis as a tool for identifying novel platelet proteins that may regulate normal and 
pathologic platelet (and/or megakaryocyte) functions. 
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Transcript profiling of human platelets using microarray and serial analysis 
of gene expression 

Dmitri V. Gnatenko, John J. Dunn, Sean R. McCorkle, David Weissmann, Peter L. Perrotta, and Wadie F. Bahou 



Human platelets are anucleate blood cells • 
that retain cytoplasmic mRNA and main- 
tain functionally intact protein transla- 
tional capabilities. We have adapted 
complementary techniques of microarray 
and serial analysis of gene expression 
(SAGE) for genetic profiling of highly 
purified human blood platelets. Microar- 
ray analysis using the Asymetrix HG- 
U95Av2 approximately 12 600-probe set 
maximally identified the expression, of 
2147 (range, 13%-17%) platelet-expressed 
transcripts, with approximately 22% col- 
lectively involved in metabolism and re- 
ceptor/signaling, and an overrepresenta- 
tton of genes with unassigoed function 
(32%). In contrast, a modified SAGE proto- 
col using the Type IIS restriction enzyme 



Mmel (generating 21-base pair |bpj or 
22-bp tags) demonstrated that 89% of 
tags represented mitochondrial (mt) tran- 
scripts (enriched in 16S and 12S ribo- 
somal RNAs), presumably related to per- 
sistent mt-transcription in the absence of 
nuclear-derived transcripts. The fre- 
quency of non-mt SAGE tags paralleled 
average difference values (relative expres- 
sion) for the most "abundant** transcripts 
as determined by microarray analysis, 
establishing the concordance of both 
techniques for platelet profiling. Quantita- 
tive reverse transcription-potymerase 
chain reaction (PCR) confirmed the high- 
est frequency of mt-derived transcripts, 
along with the mRNAs for neurogranrn 
(NGN, a protein kinase C substrate) and 



the complement lysis inhibitor clusterin 
among the top 5 most abundant tran- 
scripts. For confirmatory characteriza- . 
tion, immunobfots and flow cytometric 
analyses were performed, establishing 
abundant cell-surface expression of clus- 
terin and intracellular expression of NGN. 
These observations demonstrate a strong 
correlation between high transcript abun- 
dance and protein expression, and they 
establish the validity of transcript analy- 
sis as a tool for identifying novel platelet 
proteins that may regulate normal and 
pathologic platelet (and/or megakaryo- 
cyte) functions. (Blood. 2003;101:2285-2293) 



e 2003 by The American Society of Hematology 



Introduction 



Human Wood platelets play critical roles in normal hemostatic 
processes and pathologic conditions such as thrombosis, vascular 
remodeling, inflammation, and wound repair. Generated as cytoplas- 
mic buds from precursor bone marrow megakaryocytes, platelets 
are anucleate and lack nuclear 0NA f although they retain 
mcgakaryocyte-denved mRNAs.' - 7 Platelets contain rough endo- 
plasmic reticulum and polyribosomes, and they retain the ability 
for protein biosynlhcsis from cytoplasmic mRNA. 3 Quiescent 
platelets generally display minimal trnnslational activity, although 
newly fonned platelets such as those found in patients with 
immune thrombocytopenic purpura (JTP) synthesize various a-gran- 
ule and membrane glycoproteins '(CPs), including GPJb and 
GPIlb/Ula (a }Jb (5j). Furthermore, stimulation of quiescent platelets 
by agonists such as a -thrombin increases protein synthesis of 
various platelet proteins, including Bcl-3. 4 Like nucleated cells, the 
rapid translation of preexisting mRNAs may be regulated by 
integrin ligation to extracellular matrices.- Jn the case of platelels, 
the primary integrin involved in ibis process appears to be a Ub $ 3 
with cooperative signals mediated by ihe collagen receptor « r p* t A 1 



Jntegrin-mediated platelet protein synthesis appears to be regulated 
at the level of iranslation initiation involving the eukaryolic 
initiation factor 4E (cIF4E). instead of directly influencing elF4E 
activity via posttranslalional modifications (ie, phosphorylation), 
platelet e)F4E activity best correlates with its spatial redistribution 
to the mRNA-enriched cyloskelelon. 8 Furthermore, because pro- 
tein translation is partially inhibited by the immunosuppressant 
rapamycin, it suggests that adhesion- and/or aggregation- induced 
outsidc-in- signaling function to regulate protein synthesis through 
the mTOR (mammalian target of rapamycin) pathway. M - 9 

Despite the biologic importance of platelets and their intact 
protein synthetic capabilities, remarkably little is known about 
platelet mRNAs. Younger platelets contain larger amounts of 
mRNA with a greater capacity for protein synthesis, as deiennincd 
by using fluorescent nucleic acid dyes such as thiazole orange.' 0 
This assay has been used as a quantitative determinant of younger 
or "reticulated" platelets (RPs). indeed increased reticulated plate- 
lets are typically found in patients with conditions associated with 
rapid platelet turnover such as )TP; typically RP percentages in 
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such patients approach 10% to 20% of all platelets, considerably 
higher than in healthy control subjects. n Interestingly^ high RPs 
have been associated with enhanced thrombotic risk when identi- 
fied in patients with thrombocytosis,' 0 suggesting thai quantita- 
tively increased mRNA levels may be associated with the prothrom- 
botic phcnotype. Whether this is related to globally altered gene 
expression profiles or to select changes more evident during 
situations of rapid platelet turnover remains unknown. Certainly, 
technical limitations of this assay limit its utility in defining 
pTOthrombotic genotypes, 10 *' 2 and it cannot identify differentially 
expressed genes that may be causally implicated in disordered 
platelet phenotypes. 

Toward the goal of defining the molecular anatomy of the 
platelet genome, we have adapted complementary techniques of 
microarray and serial analysis of gene expression (SAGE) for 
genetic profiling of highly purified human blood platelets. Microar- 
ray technology represents a "closed" profiling strategy limited by 
the target genes imprinted onto gene chips. In contrast, SAGE is an 
"open" architectural system that can be used to identify novel genes 
and to quantify differentially expressed mRNAs. ms The sequence 
of each tag along with its positional location uniquely identifies the 
gene from which it is derived, and differentially expressed genes 
can be identified in a quantitative manner because the tag frequency 
reflects the mRNA level at the time of cellular harvest and analysis. 
By using both technologies, we have identified a number of 
previously uncharacterized genes that appear to be expressed in 
human platelets, while simultaneously establishing the dominant 
frequency of mitochondrial- expressed genomes comprising the 
platelet mRNA pool. These observations provide a panoramic 
overview of the platelet transcriptome, while additionally provid- 
ing insights into the molecular pathways regulating platelet (and/or 
megakaryocyte) function in normal and pathologic conditions. 



Materials and methods 

Reagents and supplies 

Thermus aquatints fTnn) polymerase was purchased from (Roche, Indianap- 
olis. IN). T4 DNA ligase was purchased from; Invirrogen (Carlsbad. CA), 
and restriction enzymes were from New England Biolabs (Beverly. MA), 
except for Mme\. which was obtained from the Center for Technology 
Transfer (Gdansk, Poland). All oligonucleotides were synthesized on an 
Applied Biosy stems (Foster Cily, CA) 3 -channel synthesizer and are lis led 
in Table I. Monoclonal antibodies ' used for flow cytometric analysis 
included the FITC (fluorescein isothi6cyan:ite)-conjugared anti-CD4| 
(ottuPo) immunoglobulin Gt llgGl; Immunotech, Miami. FL); phyco- 
erythrin (PE)- conjugated antiglycophorin (lgG2; Becton Dickinson Pharm- 
ingen. San Diego. CA): and pcridinin chlorophyll protein (PERCP)- 
conjugated anti-CDJ> (IgG): Breton Dickinson Pharmingcn). 

Platelet isolation, purification, and immunodetection 

All human subjects provided informed consent for an IRB (Institutional 
Review Board) -approved protocol completed in conjunction with the 
General Clinical Research Center at Stony Brook University Hospital. 
Peripheral Mood (20 ml.) from hraltby volunteers drawn into 2 mL of 4% 
sodium c in ale 1 0/1% vol 'vol final concentration) was used to isolate 
erythrocytes by differentia! centrifugal ion fl500jp) or to isolate pure 
leukocytes by density- "Kidicni ccnirifugairon as previously described."' 
Plaielcts collected horn healthy volunteers by nphercsis were used within 
24 hours ol collection. After addition of 2 mM F.OTA (e thy lencdia mine tcl- 
raacetic acid), apheivsis-dr lived platelets from a single donor were 
cenirihiged at \<iUy' for 15 minutes ai ?5°C. To minimize leukocyte 
contamination, curly ihe upper 9/ 1 0 ol the platelet- rich plasma |PRP) was 



used for gel filtration oyer a BioGel A50M column (1000 mL total volume) 
equilibrated with HBMT (II EPES- buffered modified Tyrodes buffer: 10- 
mM HEPES (W-2-hyu^oxyeth*ylpipera2ine-/V'-:2-eth3ncsulfor>ic acid) pH 
7.4/150 mM NaCI, 2.5 mM KC1. 0.3 mM NaH 2 P0 4 , 12 mM NaBCOj, 
0.2% bovine serum albumen [BSA], 0.1% glucose, 2 mM EDTA). 
Gel-filtered platelets (GFPs) were subsequently filtered through a 5-pjn 
nonwetting nylon filament filter (BioDesign, CarmeL NY) at 25°C and 
harvesled by centrifugal ion at J 500^ for 1 0 minutes at 25°C. Platelets were 
gentry and thoroughly resuspended in 10 mL HBMT buffer and incubated 
with 120 >L murine monoclonal anti-CD45 antibody conjugated to 
magnetic microbeads (Miltenyi Biotec, Bergisch Gladbach, Germany) on a 
rotating platform for 45 minutes at 25°C. Magnetic separation columns 
were used to capture CD45* cells (leukocyte fraction) by positive selection 
(MACS II; Miltenyi Biotec). Purified platelets were concentrated by 
cenln fugat ion at 1500g and immediately used for total RNA isolation. 

The efficiency of platelet purification was documented at each step by 
flow cytometry. 17 Briefly, aliquots containing 2X I0 6 platelets were 
incubated with saturating concentrations of FITC- conjugated ariri-CD4I, 
PE-conjugaled antiglycophorin, and PERCP-conjugatcd anti-CD45 for 15 
minutes in the dork at 25 0 C, washed with phosphate-buffered saline (PBS), 
and fixed in PBS/1% formalin. Samples were analyzed using a FACScan 
(fluorescence-activated cell sorter scan) flow cylometer (Becton Dickinson) 
using CELLQuest software designed to quantify the number of CE>45 + and 
glycophorin- positive events in the sample (expressed as Ibc number of 
events per 100 000 CD41 + events). For some experiments, fixed platelets 
were permeabilized with 0.1% Triton-X/PBS for 30 minutes at 25°C prior 
to the addition of primary antibodies, all as previously described. 17 

Platelet protein detection was completed by sodium dodecyl sulfate 
(SDS)-polyacrybmide gel electrophoresis (PAGE) and immunoblot analy- 
sis as previously described, using the species-specific horseradish pcroxidase- 
conjugated secondary antibody and enhanced chemiluminescence. 16 Anti- 
bodies included the anticlusterin monoclonal antibody (Quide), Santa Clara, 
CA; 1:1000 primary and 1:10 000 secondary) and the anlineurogranin 
rabbit polyclonal antibody (Chemicon International, Temecula, CA; 1:1000 
primary and 1:10 000 secondary). 

Molecular analyses and microarray profiling 

Purified, individual cell fractions were resuspended in 10 mLTrizol reagent 
{Invirrogen), transferred into diethylpyrocarbonale (DEPC)-lreated Corex 
(Springfield, MA) tubes, and serially purified and precipitated by using 
isopropanol essentially- as previously described. 16 . Total cellular RNA was 
harvested by centrifugalion at 12 500# for 20 minutes at 4°C, washed 2 
times with 75% ethano) (JO mL/tnbc). and resuspended in 100 p.L 
DEPC- treated water. Plate let. mRNA quantitation was performed by using 
fluorescence-based real-time PCR (polymerase chain reaction) technology 
(TaqMari Real-Time PCR; Applied Biosy stems, Foster City. CA). Oligonu- 
cleotide primer pairs were generated by using Primer 3 software (www- 
genome. wi.mit.edu), designed to generate approximately 200~base pair 
Ibp) PCR products at the same annealing temperature, and are outlined in 
Table I. Purified platelet mRNA (4 ng) was used for first-strand cDNA 
synthesis using oligo(dT) and Superscript )) reverse transcriptase (Invjlro- 
gen): For real-time reverse transcription (TO*)- PCR analysis, the RT 
reaction was equally divided among primer pairs and used in a '10-cycle 
PCR reaction for each target gene by using the following cycle: 94°C for 30 
seconds, 55°C for 30 seconds, 77°C for 1 minute, and 7I°C for 10 seconds 
{40 cycles total). mRNA levels were quantified by monitoring real-lime 
fluoiomeiric intensity of SYBR cteen I. Relative mRNA abundance was 
determined from triplicate assays performed in parallel for each primci pair 
and was calculated by using the comparative threshold cycle number (A-Ci 
method) as previously described." 1 

Gene expression profiles were completed by using the approximately ■ 
12 600-prnbc set MG-U95Av2 gene chip (AfTymeti i.x. Santa Clara, CA). 
Total cellular RNA (5 pg) was used for cDNA synthesis by using 
Super Script Choice system (Life Technologies. Kockville. MD) and an 
oligo(dT) primei containing the 77 polymerase recognition sequence 
tPiiincr SI; Table I), followed by cDNA purification using GFX spin 
columns. In vitro tianscription was completed in the presence of biotinyl- 
aied ribonucleotides by using a BioAnay High Yield RNA Transcript 
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Table 1. Oligonucleotide primers 



Primer 



Gene and primer 
direction 



SI 


Qligo (dT) 


Cassette A 


SAGE 


Cassette B 


. SAGE 


S2 


SAGE • 


S3 


SAGE 


GP4 


Glycoprotein fIB |F) 


GP5 


Glycoprotein IIB |R) 


GP6 


Glycoprotein IDA |FJ 


GPV 


Glycoprotein ItIA |R) 


PAR18 


PARI [F| 


PAR19 


PARI [Rj 


SR8 


16SrRNA |F) 


SR9 


16SrRNA|R| 


NADH10 


NADH2IH 


NADH11 


NADH2[R| 


THYM12 


Thymosm [J4 |FJ 


THYM13 


Thymosin p4 |RJ 


CLUS14 


ClusterinJF] 


CLUS15 


Ctusterin |RJ 


NRG 16 


Neurogranin (FJ 


NRG17 


Neurogranin fR) 




T-ceB receptor 


7CR1B 


p-cham(F) 




T-ceH receptor 


TCR19 


p-cbain JR) 


CD4520 


CD45|F) 


CD4521 


CD45 f R| 



Sequence (5' - 3') 



Nucleotide 
Position 



S'-Bn-GGCCAGTGftATTGTAATACGACTCACTATAGGGAGGCGG- {dT) y 

5 ' - TTTGGATTTGCTGGTCGAGTACAACTAGGCTTAATCCGACATG - 3 ' 
3 ' - * CCTAAACGACCAGCTCATGTTGATCCGAATAAGGCTp- 5 ' 
5* - pT TC ATG GCGGAGA CGTC CGC CA CT A GTGTCGCAA CTGA CT A * - 3 ' 
3 *- KNAAGTACCGCCTCTGCAGGCGGTGATCACAGCGTTGACTGAT- 5' 



Bn - GGATTTGCTGGTCGAGTACA- 3 ' 
Bn-TAGTCAGGTGCGACACTAGTGGC- 3 * 
AGGGCTTTGAGAGACTCATCTGTA - 3 ' 

- ACAATCTTGCTGTTTGGATTCTG - 3' 
TATAAAGAGGCCACGTCTACCrrC - 3 ' 

- CACTTCCACATACTGACATTCTCC- 3 ' 

- AATGTCAGTTCTGATATGGAAGCA - 3' 
CCCAAATtJTTCAAACTTCTTTAGC' 3 ' 

- TGCAAAGGTAGC ATAATCACTTGT - 3 ' 

- GTTTAGGACCTGTGG G TTTGTTAG - 3 ' 

- CTAGCCCCCATCTtAAATCATATAC- 3 ' 

- AATGGTTATGTTAGGGTTGTACGG - 3 ' 

- AAGACAGAGACGCAAGAGAAAAAT- 3 ' 

- GCAGCACAGTCATTTAAACTTGAT- 3 ' 
- CCAACAGAATTCATACGAGAAGG- 3 ' 

- CGTTATATTTCCTGCTCAACCTCT- 3 ' 

- GCCCTTTTAGTTAGTTCTGCAGTC -3' 

- TTTTCTTTAAGTGAGTGTGCTTGG - 3 ' " 



5 - CCACAA CTATGTTTTGGTATCGT- 3 * 

5 ' - CTAGCA CTGCAGATGTAGAA GCT - 3 ' 
5 ' - GCTCAG AATGGACAAGTA- 3 ' 
5 ' - CA CA C C C ATA CACA CATACA - 3 ' 



2094-2117 
.2301-2279 
2335-2356 
. 2532-2509 

2585- 2608 
2776-2753 

2586- 2609 
2785-2762 ' 
4875-4898 
5075-5052 

135-158 
336-313 • 
1006-1028 
1222-1 199 
135M374 
1567-1544 



332-310 
3771-3788 
4280-4261 



IF J indicates forward (sense) sit and; (RJ, reverse {antisense) strand. Bn, biotin; p. a phosphorylaled 5' end (cassettes A and B); under lining, Ntolfl sites in cassettes A and B; 
arrows, conespondbg sequence lor S2 and S3 within cassettes A and B. respectively; bold, the Mmel site: and N, A. C. T. or G, nucleotide position based on the following 
accession numbers: glycoprotein IIB (J02764), glycoprotein IDA (M35999). PARI (M62424). 16S rRNA and NADH2 (NC_ 001807). thymosin &4 (M17733). ctusteiin (M25915). 
neuroorarwn {X99076}, TCR B-chain (AF 043182), CD45 (Y00638). 

"Indicates an amino-modified 3' end in both cassettes: — . not applicable. 



Labeling Kit (Enzo Diagnostics, Farmingdale, NY), and, after metal- 
induced fragmentation, 15 p-g biorinylaied cRNA was hybridized to the 
HG- U95Av2 oligonucleotide probe array for 16 hours at 45°C After 
washing, the cRNA was detected wilb slreplavidin-phycoerythrin (Molecu- 
Jar Probes, Eugene, OR) and analysis was completed by using a Hewlett- 
Packard Gene Array Scanner (AtTymcirix). The average difference value 
(AD) for each probe set was quantified using MAS 4.01 software 
tAjTymctri*), calculated, as an average of fluorescence differences for 
perfectly matched versus single-nucleotide mismatched 25-mer oligonucle- 
otides (lb to 20 oligonucleotide pairs per probe set). The software is 
designed to exclude "positive colls" in the presence of high average 
differences with associated high mismatch intensities. 

SAGE profiles 

I Mate let SAGF. libraries .were genet at ed essentially as previously de- 
scribed. 1 ' modi lied as outlined in Figure I for the "use of Mme\ as (he 
lagging enzyme.'* This type IIS restriction enzyme cleaves 20 of 18 bp past, 
its nonpalindiomic (TCCRAC) recognition sequence, thereby genei3ting 
longer tags I? I - or 2?-mcr) than those obtained using Bsmf\ as 'he standard 
tagging enzyme (13-14 bp tags). These ■ longci .Vmel-generaicd tags 
potentially provide for more definitive "tag- lo- gene 1 ' identification and are 
particularly useful in characterizing expression patterns in the absence of 
complete genomic sequence data (comprehensive met hods detailed in Dunn 
cl al l? ). Briefly. poly(A) mRNA was isolated from 10 pg toial platelet RNA 
using the olrgo-tdT) SI primer conjugated to magnetic beads (Dynal 
Ototcch. Lake Success. NY). followed by cDNA synthesis using Super- 
script I) icvcrse transcriptase I Invitiogen). The cDNA was then digested 



with the restriction enzyme NIa)]) (anchoring enzyme), ligated to cassette A 
using T4 DNA ligase, and, after the beads were extensively washed, the 
cDNA was digested with Mmel to release the tags from the beads. After 
purification, lags were ligaicd to degenerate cassette B linkers (specifically 




F inure 1. Schema ou I lining the modified SAGE protocol used in platelet 
analyses. The final tags Ore flanked by the M/alll (anctronng en/yrne) CAlG 
sequence, thereby ptovidino lap- to- gene identification wNen exported lo a relational 
database ( refer to "Biyitttonnatic analyses" ant) labfp 1 lor details). 
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designed to anneal to the nonuniform Mmel overhangs), and PCR-ampIified 
using biolinylated primers S2 and S3 for 30 cycles (95°C for 30 seconds; 
. 58°C for 30 seconds; 12°C for 30 seconds) using Platinum Taq DNA 
polymerase (Gibco BRL). A fraction (20%) of the pooled PCR products 
were then subjected to one round of linear amplification using primer pair 
S2/S3, followed by a second round of 25 amplifications using primer S2 
alone (95°C for 30 seconds. 58°C for 30 seponds, 72°C Tor 30 seconds); 
Primer S3 was subsequently added for one cycle (95°C for 2.5 minutes, 
58 0 C for 30 seconds, 72°C for 5 minutes); the latter steps were collectively 
adapted to exclude heteroduplex formation. 18 Unincorporated primers were 
removed by incubation with 200 U Escherichia colt exonucleasc I for 60 
minutes at 37°C PCR products were then pooled and digested with AVoJIJ 
to release tags, and bio tiny la ted linker arms were cleared using streptavidin- 
coated immunoaffinily magnetic beads (Dyna! Biotech). Tags were con- 
caramelized using 5 U/uX T4 DNA ligase, and products more than )00 bp 
were isolated by size- fractionation in low-melting agarose gels. The DNA 
was purified by GFX spin columns., and the concatamers were cloned into 
the Sph\ site of pZero (Invitrogen). Afler transformation into Ecoli TOP 10 
cells, recombinant clones were isolated and sequenced in 96-well microliter 
plates using, an ABI 377 sequencer and ABI Prism BigDye terminator 
chemistry (Perkin-Elmer Applied Biosyslems. Brancbburg, NJ). . 

Btoinformatic analyses 

Functional grouping of genes determined lo be present by AfTymetrix MAS 
4.01 software was performed using a dChip program linked to the National 
Center for Biotechnology LocusLink, which is an annotated reference 
database for genes and their postulated functions. 20 Of the approximately 
12 600-probe sets represented on the Asymetrix HG-U95Av2 Gene chip, . 
functional annotations exist for approximately 8100 with the remainder 
categorized as unknown. Microarray data were visualized and analyzed 
using BRB- Array Tools software (Version 2.1), kindly developed and 
provided by Dr Richard Simon and Amy Peng (Iinus.nci.nib.gov/BRB- 
ArrayTools.html). A logarithmic (base 2) transformation was applied to the 
average difference values for individual data sets for determination of 
microarray concordances. Discordancy was defined as a 2- log difference tn 
the maximum log intensities between individual experiments. 

SAGE lags were extracted by usinj: in-house SAGE software uniquely 
modified to identify Mme\ tags. The software ensures that only unambigu- 
ous 21- to 22-bp tag sequences are extracted for transcript profiling. Tags 
with ambiguities (Ns), Icnglhs other lhao 21 or 22 bp, or with ambiguous 
orientations were extracted to separate files for manual editing or further 
examination. Finalized data were exported to a relational database for tag 
quantification and genetic identification/ 1 * 



Results 

Platelet purification 

To ensure that the RNA profiles accurately represented those of 
circulating blood platelets, a number of complementary methods 
were implemented to remove contaminating nucleated leukocytes. 
Purification methods incorporating gel filtration, □ 5-pm leukocyte 
reduction filter, and magnetic CD45 immunodepletion allowed for 
the cumulative enrichment of highly purified platelets. The efficacy 
of this "purification method was initially established by using, 
peripheral blood platelet-rich plasma as the sorting material. The 
final product contained no more than 3 lo 5 leukocytes per I X I0 5 
platelets as defer mined by parallel flow cytometric analysis, 
representing an approximate 450- fold reduction of nucleated 
leukocytes. These results correlated well with molecular evidence 
for leukocyte depletion as drier mined by RT- PCR using both CD45 
and T-cell receptor [i- chain (TCRp) primers (see Figure 2). 
Because I he total RNA yield front peripheral blood platelets was 
insufficient for microanny studies, we adapted the protocol lo 
platelet aphetesis donors with nearly identical final purily (Figure 



A B 




PRP Rtotetots 

Figure 2. Determination of platelet purity. (A) Total cellular RNA (1.8 jig) from 
pbtelel-ricb plasma (PRP) or purified platelets from a single aphcresis donor were 
analyzed by RT-PCR (35 cydes) using oHgonucleotide primers specific for glycopro- 
tein lib (GPIIb), T-celJ receptor p-chain (T CRJJ), or CD45; 10 j±L of the 50 uL reactions 
were analyzed by etbidiunvstained agarose gel electrophoresis. Minimal to no TCR0 
gene product was visually evident only in PRP. Size markers corresponding to 
/ toe til- restricted +X174 DNA are shown. (B) Real-time RT-PCR was completed by 
using 1.8 jig total RNA and TCR3-specific oligonucleotide primers optimized lor 
quantitative analysis by real-rime PCR.* 8 On the basis of parallel ^terminations 
using RNA isolated from known amounts of purified leukocyte standards, the 
leukocyte-depletion protocol represents an approximate 2.5- log purification from the 
starting PRP. Results are representative of on© complete set of experiments repeated 
on 2 separate occasions, and data points represent the mean from triplicate wells, 
with standard errors of the mean (SEM) less than 1 % (not shown). 

2). The platelet recovery was nearly 65% of the starting material, 
yielding approximately 2.3 X 10" platelets from an initial aphere- 
sispack containing approximately 3.6 X 10" platelets. The bulk of 
the losses occurred during the initial centrifugal ion and filtration 
steps. The purification protocol was less effective at removing 
erythrocytes, although there were less than 50glycophorin-posiiive 
cells per 1 X JO 5 platelets after the final purification step. Nonethe- 
less, these cells represent unlikely sources for contaminating 
cellular RNA (see "Cellular microarray analysis** below). 

Cellular microarray analysis 

The purified platelet RNA was sufficient for microarray studies and 
was used for cRNA generation and hybridization to the Aflymelrix 
HG-U95Av2 GeneChip. The anatomic profile of platelet RN As 
from 3 healthy male donors was determined by using AfTymetrix 
software. Of the 12 599 probe sets imprinted onto the chip, a 
maximum of 2147 (17%) transcripts were computationally identi- 
fied as "present" by the AfTymetrix software, 152 (1.2%) were 
equivocal, and nearly 82% were absent. As a fraction of the total 
genes present on the chip, the percentage of platelet- expressed 
genes (I5%-17%) was generally lower than that obtained from 
other human cell types in which 30% to 50% of genes are present as, 
determined by AfTymetrix software (J. Schwedes, personal commu- 
nication. May 2002). The "limiled number" of platelet-expressed 
transcripts presumably reflects the lack of ongoing gene transcrip- 
tion in the anuclcote platelet. Because less than 1% of circulating 
red blood cells contain residual RNA, it is unlikely thai any of these 
transcripts are ery throcyte derived, although this was formally 
addressed by isolating total cellular RNA from 20 mL of whole 
blood (corresponding loan -3-log fold excess of erythrocytes than 
thai identified in our final sample). The total cellular yield of RNA 
from this starting material was approximately 250 ng f suugcstinc 
lhat less than 1 nc erythrocyte-derived RNA was present in the 
purified platelet preparations. Despite this., however, both a- and 
(3-globin transcripts — alom: with both the ferritin heavy and light 
chain? — were identified as abundant transcripts (Table 2). Al- 
though the most parsimonious explanation would be residual contami- 
nating reticulocytes, (his is not supported by our ery throcyte contami- 
nation estimates, and their significance remains unresolved. 

As a means of better dissecting the molecular anatomy of the 
platelet, expressed genes were grouped on the basis of assigned 



BLOOD, 15 MARCH 2003 * VOLUME 101; NUMBER 6 



PLATELET TRANSCRIPT PROFILING 2289 



Table 2. Top 50 human platelet-expressed genes . 


Access) on no. 


Gene symbol 


AD values, range* 


Gene transcript^ 


Leukocyte expressford; 


" M17733 


TMSB4X 


140142-307 852 


Thymosin 84 mRMA, complete cds 




X99076 


NRGN 


101 510-148 279 


Neurogranin gene 


+ 


M25079 


HBB 


40 839-229566 


P-gtobtn mRNA. comptete cds 


+ 


M25915 


CLU 


84 720-140 246 


Complement cytorysis rnhtoitor (clusierin) comptele cds 


_ 


J04755 


FTHP1 


82 980-148 621 


Ferritin H processed pseudogene. comptete cds 


_ 


D78361 


OAZ1 


73 098-118 140 


mRNA for ornithine decarboxylase aniizyme 


_ 


X04409 


GNAS 


77 761-94 781 


mRNA Tor coupling protein G(s) o-subunit (alpha-Si) 




M25897 


PF4 


62 61 1-126 908 


Platelet factor 4 mRNA. complete cds 


_ 


AB021288 


B2M 


61 689-108 021 


p2- microglobulin 




X0035I 


ACTB 


25 143-73 775 


mRNA for p-aclin 


_ 


D21261 


TAGLN2 


76687-101931 


mRNA for KIAA0120 gene . 




AL031670 


FTLL1 


69 865-99 966 


Ferritin, light polypeptide 1 


* ■ + 


U59632 


" GPIIB 


41404-110328 


Platelet gtycoprotein top chain mRNA 




M21121 


CCL5 


47 308-106 399 


T-cett-spetific protein (RAN7ES) mRNA, complete cds 




X13710 


GPX1 


41 316-96 878 


Urvspficed mRNA lor glutathione peroxidase 




. J00153 


HBA1 


21 326-144 201 


Alpha gtobtn gene cluster on chromosome 16 




M22919 


MY16 


46337-106 833 


Nonrnusde/smooth muscle aDtali myosin fight chain gene 




L20941 


FTH1 


52 787-74 763 


Ferritin heavy chain mRNA, complete cds 




J03040 


SPARC 


51 156-74 261 


SPARC/osleonedtn mRNA, complete cds 




X 56009 


GNAS 


45 543-72 096 


riH^rvi a ^Ui/Ulifl Ul 1 * U'/jOo)Q protein 




X58536 


HLA 


31 183-82 613 


mRNA for rriA»(">f Mt A rfn^i t Irm r* koum r-hitn 
vi '* list Mkcsfm ni_n i^i«jc>o i pt>cAJc» l# ntjovy cnam 




M54995 


PPBP 


46 571-67 169 


F!nnnpf-|kv^ >ic k» trtfl/atirm ru>rhttrla til mDMA 




U34995 


GAPD 


35 095-70 250 


Nflfmnl k f>f ^TlTVWV/lo ihchvlinn ITRr*m» rnDMA r-lAAn VVWft 

twitnoi ntncriiinA.yit; ;>uu;>u t H.lJUn (IDTDry ijlrtrMA, ClOne HZ^a 


+ 


L40399 


MLM3 


32 107-73 364 


Vr'uf ie x up i i z (inu<L prtnem nornoioy -J ) mKMA 




X77548 


NCOA4 


• 31 452-61 036 


cDNA for RFG (RETprolo-oncogene RE7/PT C3) 




U90551 


H2AFL 


35 086-51 892 


Histone 2A-tiKe protein (H2A/I) mRNA 




Ml 1353 


H3F3A 


31 614-55 813 






212962 


RP141 




mRNA tor bomologue to yeast ribosomal protein L4 1 




X06956 


TUBA1 




nALrnA 44 gene tor a-tuouftn 




AB028950 


TLN1 


?4 Kit 


mRNA for KtAA 1027 protein 




Y12711 


PGRMC1 


JO I / H 


mRNA for putative progestetone binding protein 


- 


M16279 


MIC? 


30 894-48 166 


Integra led membrane piotein (MIC2) mRNA 




D78577 


YWHAH 


-24 785-50 437 


Oiain 14-3-3 protein 0-chain 




AF 07 0585 


70P3B 


20 027-67 945 


Clone 24675. unknown cDN A 




AA524802 


Unknown 


23 846-39 481 


CDNA. IMAGE clone 954 2 1 3 




AB009010 


UBC 


28 745-38 389 


mRNA lor potyvbiquitin UbC 


4 


X57985 


H2AFO 


21678-52 108 


Genes lor hrslones H2B. 1 and H2A 




X54304 


MICB 


25 733-34 109 


mRNA lor myosin legutotory fight chain 




M14539 


F13A1 


23 691-48 474 


Factor XIII subunil n-polypeptide mRNA. 3' end 




AI540958 


Unknown 


24 872-41 118 


cDNA. PEC 1.2_15_HOI,r 5' end/cton 




AL050396 


FLNA 


13 634-55 235 


cDNA DKFZp 586K1720 




X56841 


HLA- E 


12 890-49 327 


Nonclossical MHC class 1 antigen gene 




M26252 


PKM2 


15 450-47 786 


7CB (cylosolic thyroid hormone- binding piotein) 




M14630 


P7MA 


19 314-45 088- 


Prolhymosin alpha mRNA 




AF045229 


RGS10 


19 156-34 243 


Regulator ot G piotein signating 10 mRNA 




AA477898 


Unknown 


16 863-44 756 


cDNA. Z«34l08."rl 5' end 




X95404 




15 216-37 456 


mRNA loi nonmusde type cofitin 




M34480 


HGA2B 


8 627-45 405 


Ptaletet gtycoprotein lib (GPttb) mRNA 




283738 


M2BFF 


T8 001-31 306 


HH2B/e gene 




1 19779 


H2AFO 


17 319-38 951 


Histone H2A.2 mRNA. complete Cds 





Gene op.es*.nn ouanKf^cons were calculated as ^average d,lference (AD) votue (matched versus mismatched oligonucleotides) lor each p,obe set using Arfymetm 
GeneChrp -.o)tw3.e. version 4.01. Ihe range of values from 3 distinct platelet micioanays is shov.n; the normalization value lor aft microanay analyses was 250 

17,anscrppts are .»nk-«dered (highest to lowest) using BRB-Arr.^lools soflwar e by tog in.ens.lies ol AD values obtained from 3 different healthy donors 33 ot the top 40 
transcripts were listed among ihe lop 50 in all 3 micioa/roy sets. * 

» Le uho ^ te "session was delcnnincd by mkroorray analysis usinp p, w ified peripheral blood leokocytes. followed by construction ot rank-iotensiry pt 0 . s lor comparison to 
platelet top 50 (mnr.cp.s » lop leukocyte- derived tronscripts kfcrriihed within the r 3 nked lop 50 platelet transcripts are depicted by a H ) picsent or (- ) abserr. 

cds indicates coding sequence. 



<?cnc annotation*, and this analysis was used to provide a pan- 
oramic (tcfmrtii)n of fl.e plaick-t Uanscriptonie.-Of the gents lhal 
could be caialoLH-d within assigned "clusters/' those involved in 
metabolism (11%) ™d rect-pior/signaling (IJ%) represented the 
larpcst groups. Also evident in these analyses is ihe relalivelv lnii>e 
pciccmape ol genes involved in functions umelaied to these key 
groups lie. niisccllaneons. 25%). and ihe ovenepreseniatkm of 
cene^ with unknown function (32%) as annotated by Aflymctrix 



and RefScq databases^' These results identify □ vasi anay (neaily 
one halQ ol' platelet genes {and gene products) thai presumably 
have important, but poorly characterized functions, in plalelel 
nnd-'or megakaryocyte biology. 

Although microanny analysis is not truly quantitative, rank- 
oidering using ihe mean Jog- intensities from 3 independent micioar- 
ray analyses allowed for the categorization of the top platelef 
transcripts {Table 2). Computational analyses demonstrated lhal 
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only ] 0 of the top 100 genes were discordant among the 3 platelet 
microarrays, although 71 of 100 genes were discordant between 
platelet and leukocyte arrays. An inventory of the top 50 platelet 
genes is listed in Table 2, which also delineates those found to be 
highly expressed in peripheral blood leukocytes by parallel microar- 
ray experiments with this purified cellular fraction (data not 
shown). Further analysis of these cell subsets demonstrated that 
approximately 25% (n - 547) of the total platelet transcripts were 
platelet restricted. Furthermore^ only 10 of the 50 most highly 
expressed genes were found to overlap,, confirming the distinct 
cellular profiles of each transcript ome. Of the 12 overlap genes, 3 
corresponded to globin or ferritin chains (again suggesting the 
presence of contaminating reticulocytes in both purified fractions), 
and another 4 were involved in actin cytoskeletal reorganization 
and human leukocyte antigen (HLA) expression, gene products that 
regulate critical functions in both cell types. Given the importance 
of cytoskeletal reorganization in downstream platelet activation 
events, il is not unexpected that components of the actin machinery 
system would demonstrate prominent transcript expression. Previ- 
ous estimates suggest that 20% to 30% of the total platelet 
proteome is comprised of aclin with other components such as 
actin- binding protein, mysosin, and talin accounting for an addi- 
tional 2% to 5% of the total protein.'- 27 The mRNAs encoding the 
actin-related machinery are overrepresented in , our microarTay 
analysis, with 8 such transcripts found among the 50 highest 
platelet-expressed genes. Interestingly thymosin 04 demonstrated 
the highest expression pattern. In unstimulated platelets, 30% to 
40% of actin is polymerized as F- actin, 22 whereas the balance of 
actin monomers (G-aciin)'are polymerization inhibited by seques- 
tering proteins such as proftJin (100 jiM) and thymosin 04 (600 
pM)." The high thymosin (34 transcript expression not only 
correlates with its known abundance in platelets but also supports 
the importance of actin inhibitory proteins in maintaining the 
nonslimulated state of circulating platelets. 

Platelet SAGE analyses 

Although these initial studies identified the distribution and relative 
expression patterns of the genes within the Asymetrix data set, 
they do not allow for analyses of genes ihal arc unrepresented by 
these oligonucleotide chips. Unlike closed microarray profiling 
strategies, SAGE is an open architectural system that is ideally 
suited for novel gene and pathway identification. Accordingly, the 
platelet RNA used for microarray studies was used for platelet 
SAGE. A total of 2033 tags were initially cataloged, of which J 800 
(89%) corresponded to mitochondrial-derived genes. These results 
weie quite differ em from those obtained by microarray analyses, 
but ihe discrepancy can be resolved by the nonrepresentation of the 
mitochondrial genome on the gene chip. The mitochondrial ge- 
nome is a compact approximately 16.6-kilobasc (kb) sequence 
encoding 13 genes and 2 ribosomal submits. 74 Primary mitochon- 
drial transcripts are polycislronie and rypically contain premature 
termination or unpredictable splice sites, resulting in multiple 
polya.denylated transcripts from individual genes.- 4 : - s Indeed, the 
overall distribution of platelet- derived mitochondrial SAGE lags is 
quite similar to that found in muscle. 7 - 4 All 13 genes containing 
Midi I sites were detected, whereas neither of the non-A'MID- 
comarning ^encs were identified -(nicotinamide adenine dinuclco- 
tidc (NADU J dehydrogenase subunit 41. and adenosine triphos- 
phatase (ATPaseJ 8). Most of the tags were from the 16S and 12S 
ribosomal RN As— which collectively accounted for 68% of the 
total mitochondrial tags— with the fewest tags represented by 
NADU dehydrogenase subunits 3.5. (>, and cytochrome c oxidase 1 



l I KADHI Ft WADH2 re&J COI ft COO B 

~J~i — n ji — i i ii rnr — rr 

K f . ,l. c . otH|1 ■. "M"** M . t * Mmi 'i^m?'*gr 

t K KHL i ♦ i * i i 1 n u ' ii"n — 



ATT»*>c* 



NADH3 
J 



Figure 3. Schema d( the mitochondrial genome with SAGE tog distributions 
(only tags with identical matches are displayed). The abundance of the SAGE 
tags (n = 1800) a* individual /Vtetll sites (arrows) within the mitochondria) fWavy 
strand is shown on the bottom, whereas those tags corresponding to the mitochon- 
drial Hght strand are definealed above the arrows (the presence of an unaccompanied 
arrow implies no SAGE tags at that Malll site). The gene products of mt-DNA 
(RelSeq accession no. NC_001BO7) are delineated by the open rectangles, whereas 
stippted boxes represent tRNA genes and control regions (the single tag represented 
by. the n refers to rrolochondriat transler RNA-serrne). Note that'NADH6 is encoded 
by the fight strand and that them are no MatH sites within the ATPaseB gene segment. 
CO|n), cytochrome c oxidase subunit: Cyt b, cytochrome b. 

(Figure 3). The NADH dehydrogenase subunit 6 RNA is the only 
mRNA encoded by the light (L) strand of mitochondrial DNA and 
was the least abundantly detected transcript. • 

The unusually high preponderance of mitochondrial-derived 
genes is not inconsistent with the known enrichment of these 
genomes in human platelets/- 24 and presumably reflects persistent 
transcription from the mitochondrial (mi) genome in the absence of 
nuclear- derived transcripts. This overrepresentalion of mtDNA in 
platelets is considerably greater than that of its closest cell type 
(skeletal muscle), in which ml genomes represent approximately 
20% to 25% of all SAGE tags. 25 Interestingly, the energy metabo- 
lism of platelets is not dissimilar from that of skeletal muscle, both 
cell types actively using glycolysis and large amounts of glycogen 
for ATP generation. 7 * Like muscle, platelets are metabolically 
adapted to rapidly expend large amounts of energy required for 
aggregation, granule release, and clot retraction. Similar to the 
situation in all eukaryotic cells, platelet mitochondria represent the 
primary source of ATP, which is generated from oxidative phosphor- 
ylation reactions occurring within these organelles. Mitochondria 
are also responsible for. most of the toxic reactive oxygen species 
generated as by-products of oxidative phosphorylation and are 
central regulators of the apoptotic process in other cellular types. 
The mtDNA encodes polypeptides found within 4 of the 5 
multifunctional complexes that regulate oxidative phosphorylation 
within the platelet mitochondria. 27 Whether the continued genera- 
tion of these polypeptides has a role in platelet energy metabolism 
and/or the apoptotic mechanisms regulating platelet survival re- 
mains speculative, although not inconsistent with our observations. 

Comparative analysis of SAGE and microarray 
transcript abundance 



Complete SAGE libraries require the sequencing of up to 30 000 
tags for nn exhaustive cataloging of individual mRNAs, especially 
those with limited copy number s. ,X2 * Given the preponderance of 
ml- derived transcripts, comparable sampling would have required 
sequence analysis of nearly 300 000 SAGE lags, an inordinate 
number for comprehensive analysis of the platelet transcriptome. 
hot platelets, alternative methodologies incorporating subtract ive 
SAGE will be required for more comprehensive transcript profil- 
ing/ 0 Our initial sampling of nonmitochomlrial genes temains 
informative, however. and entirely consistent with, the results of 
platelet microarray studies. As shown in Table 3, SAGE tags lot the 
genes encoding thymosin 3-1, |*2- microglobulin, newogranin. and 
the platelet glycoprotein Ibt3 polypeptide were among the most 
frequently identified platelet genes, similar to the rank -otde red 
results determined by microarray analysis. To formally confirm the 
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Table X SAGE-idcntified nonmitochondrial tags 


Frequency 


CATG + SAGE lags' 


Accession no.t 


Gene 


Mfcroarrayt 


26 


GTT G7 GGTTAA7 CT GGT 


NM_ 004048.1 


^2-miCT09lobufin{B2M), mRNA 


PPP 


21 


TTGGTGAAGGAAGAAGT 


NM_021 109.1 


Thymosin B4 ; X chromosome (TMSB4X), mRNA 


P 


8 


AGCT CCGCAGCCAGGTC 


NM_002620.1 


Platelet factor 4 variant 1 (PF4VI). mRNA * 


P 


8 


AGCTCCGCAGCCGGGTT 


NM_002619.1 


Platelet factor 4 (PF4), mRNA 


P 


7 


T GTATAAAGACAACCT C 


NM_0O2704.1 


Proplatelet basic protein (pvthrovnbogtobuBn) 


FT 


5 


GGGCACAATGCGGTCCA 


NM_000407.1 


Glycoprotein Ibp* porypeptide. mRNA 


P 


3 


AGGTAATAAAAGGTAAT 


NM_00351Z1 


H2A hislone tarnfly, member L (H2AFL). mRNA 


P 


3 


AG7GGCAAG7AAATGGC 


NM_ 021914.2 


CofiJin 2 (muscle) (CFL2), mRNA 


N/A . 


.3 


T GACT GTGCTGGGTTGG 


NM_006176.1 


Neurogranin {protein kinase C substrate. RC3) mRNA 


P 


3 


TTGGGGTTTCCTTTACC 


NM_ 002 032.1 


. Ferritin, heavy polypeptide 1 (FTH1). mRNA 


P 


2 


CCCTTGTGACTACCTA7 


NM^ 025 158.1 


Hypolhetica? protein FU22251 (FU22251). mRNA 


N/A 


2 


CCT G7AACCCCAGCTAC 


NM_032779.T 


Hypothetical protein FU14397 (F1J14397). mRNA 


N/A 


' 2 


CTTGTAGTCCCAGCTAC 


NM.017962.1 


Hypothetical protein FLJ20825 (FLJ20825), mRNA 


. N/A 



"Unique tags identified moro than once. 

1 Refers to the RefSeq accession no. 2 ' Note that this number does not necessarily correspond to the accession no. provided by Asymetrix software annotations (Table 1). 

JPresence (P) or absence (A) is based oh results from 3 distinct platelet mtcroarray experiments. Capitalized *P" designates a gene that is in the top 50 on all 3 micraarray 
experiments, whereas small "p" designates those transcripts nol jn the top 50. Two of the genes {^-microglobulin and J^lhjombogJobufin) are represented by 3 and 2 probe 
sets, respectively, on the HG-U95Av2 gene chip; for B2-M, all 3 probe sets were in the top 50 genes, whereas for thymosin p4 1 of 2 wasjn the top 50 for aO experiments (the 
other probe set was in the top 75 lor all experiments). N/A indicates oligonucleotide not present on Affymetrix HG-U95Av2 gene chip. 

sented here. Overall, a total of J 26 unique lags were identified, the 
majority of which (94) were represented only once. Of the tot aJ 
unique tags, nearly one half represented novel genes hot present on 
the Aflymetrix U95Av2 GeneChip. Of the genes with unique tags 
identified more than once, there was excellent concordance with 
microarray expression analysis, with nearly all of the SAGE tags in 
Table 3 corresponding to platelet top 75 microarray transcripts. The 
platelet factor (PF) 4 variant represents a single aberration because 
this was rank-ordered approximately 350 by microarray, although, 
its SAGE tag frequency was identical to that of the predominant 
PF4 transcript. The lack of extensive nonmitochondrial SAGE 
sampling precludes any further extrapolations from this apparent 
aberration. Of note, a subset of these tags had Jong poly(A) tracts; 
although they all corresponded to genes identified in the RefSeq 
database. 71 We cannot exclude the possibility of a SAGE artifact 
for this small subset of. tags (~2%, representing 46 of 2033 tags), 
although the authenticity of the vast majority of tags (—98%) 
clearly validates the methodology. These tags are most likely 
explained by the unique biology of the platelet (ie, mRNA decay in 
the absence of de novo transcription) or to mRNA degradation 
occurring during the extensive purification methods. In summary, 
even with a remarkably limited sampling, the power of this 
approach in gene identification of relatively abundant and less 
abundant transcripts is evident. It is clear, however, given the 
unique molecular anatomy of the platelet (ie, . abundance of 
mitochondrial transcripts), that SAGE adaptations will be required 
for more comprehensive genetic profiling. 79 

Protein immurioanalysis of platelet clusterin and neurogranin 

Although most of the "most abundant" transcripts would conform 
ic> a priori predictions for platelet-expressed inRNAs, a number of 
transcripts were identified that had been poorly characterized in 
hitman platelets. To further establish the authenticity of highly 
expressed transcripts such as clusterin and neurogranin, confirma- 
tory protein analyses were completed. As shown in Figure 5, both 
proteins were clearly delected in purified platelet lysates; further- 
more, theii cellular' platelet distributions conformed io l hose 
predicted based on previously proposed functions. Note for c.x- 
ample thai clusterin — functionally characterized as a complement 
lysis inhibitor able to block the terminal complement reaction — is 
primarily expressed on (he extracellular platelet membrane. 5 " 



results independently obtained by SAGE and microarray analysis, 
quantitative RT-PCR was completed by using oligonucleotide 
primers specific for 2 abundant mitochondrial transcripts, 16S 
rRNA and NADH2 thymosin (34 (high- abundance by microatTay 
and SAGE), 2 incompletely characterized high-abundance tran- 
scripts (neurogranin and clusterin; see "Protein immunoanalysis of 
platelet clusterin and neurogranin"), a low- abundant transcript 
(T-cell receptor p,- polypeptide), and the genes encoding proteins 
with well-established quantitative determinations (ie, glycoprotein 
a,inp 3 [—50 000 rcccplors'plaielei]; protease-activated receptor- 1 
. (PARI) [—1800 receptors/platelet]).* As shown in Figure 4, these 
analyses reveal excellent concordance between SAGE and microar- 
ray studies, demonstrating the predominant frequency of the 
mitochondrial-derived I6S rRNA/NADH2 transcripts, with incre- 
mentally lower expression of other transcripts as initially demon- 
strated by microarray (1 6S > NADM2 > thymosin £4 > neurogra- 
nin > clusterin > a nop., > PAR I > TCR0). 

Given the small number of nonmitochondrial SAGE lags 
available for analysis (n = 233), limited conclusions can be drawn 
using traditional (nonsubtraction) platelet SAGE libraries as pre- 




» rt» JO JO 



riguie 4. Quantitative tcnl-lime R7-PCR analysis ot plalelel transcripts. Real- 
time RT-PCR was completed by using purified platelet RNA and oligonucleotide 
primer pairs specifically designed using PnmeO software to generate simitar ly-.s»2ed 
(--?Q0 bp) PCR products, optimised to the same annealing temperature. In graph. 
(O) represents Hb. \W) tepie:-ents tlta, f.U) represents PARI, r» represents l&S 
iRNA, (V) represents MADH7. | ▼) teptesents thymosin, (O) represents clusterin. (♦) 
represent neui ogr onin. and (•) represents I CRfi Curves are representative ot one 
complete set ol e>pe»irnents (repented twice), and line plots relied overage 
determinations tiom 3 v.X'Hs p^rtornx-d in parallel with SEM less th;>n, 1% lor oil 
d^ta points. 
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Non-perroeabnized 





Figure 5. Jmmtwxwnarysis of platelet neurogranin and 
chreterin. (A-O) GeJ-fWered ptotetoto were eilhef feted if» 
3,7% forniaktehydo (nonpcrmeabfeod) or ftxod with perrr»- 
abifization in the presence of 0.1% Trftori-X" totaved &cw 
cytometric analysis using arrbdustorin, anS-tlh/llb, cr anfinev^ 
rogranin antBxxJies and the FTTC^njurpled species- 
specific secondary antibody f« C> Iho FIT&corpjgated 
anfrabbil and aniimouse controls are esscntiaJry siperim- 
posed). (E-f) Ten mkjograms of sofubifood ttepG2 ccte 
(hepatocyte cell Bne). human brain, or purified pbtelet rysates 
were analyzed by SOS-PAGE, 17 ond irnmunobW analysis 
were ccmpteted by using titOODdBuBoris of either antineuro- 
granin {18% SDS-PAGE) cr arifidusterin (8% SOS-PAGE) 
antibodies. The arrtWustorin an&ody recognized 2 ptatetet 
irnmuncfeactive species under, shorter exposure. Although 
the relative neurogranin and dus twin protein abundances 
are subopfimafly quantified by Ihese analysos^ pbtefei dus- 
terirt appears to demonstrate cbreWerobte expression when 
compared wSh that proviousfy idorrtiftsd in hop3tocytes* 



Given the importance of complement activation in platelet destruc- 
tion, the prominent expression of cell-surface chisterin might 
suggest a role for this protein in normal and pathologic events 
regulating platelet survival. Interestingly, a clusterin-deficient 
knockout mouse has been generated that demonstrates enhanced 
cardiac dysfunction in a model of autoimmune myocarditis. 31 
Although these mice apparently have normal baseline hemograms 
(B. Aronow, personal communication, October 2002), it remains 
unestablished if they would be predisposed to immune- type 
thrombocytopenia in systemic models of autoimmunity. 

Similarly, the gene encoding an intracellular effector protein 
that may have key roles in downstream platelet activation events 
has now been demonstrated to have abundant transcript expression 
in human. platelets. Neurogranin is a highly expressed platelet 
transcript with its gene product demonstrating a primarily intracel- 
lular pattern of distribution. Neurogranin is generally described as a 
brain-specific. Ca ?4 -sensitive calmodulin-binding phosphoprotein 
that is preferentially expressed in neuronal cell bodies and den- 
dritcs."- 3 It is a specific protein kinase C (PKC) substrate that can 
also be modified by nitric oxide and other oxidants to form 
intramolecular disulfide bonds. Both its phosphorylation and 
oxidation state attenuate its binding affinity for calmodulin. 33 In 
stimulated platelets, PKC generation is linked to various activation 
pathways such as calcium-regulated kinases, mitogen- activated 
protein (MAP) kinases, and receptor tyrosine kinases.' Thus, these 
observations suggest that platelet neurogranin may function as a 
previously unidentified component of a PKC- dependent activation 
pathway coupled to one (or more) of these effector proteins. 



Discussion 

These data provide documentation for a unique platelet mRNA 
profile that may provide a tool for analyzing platelet molecular 
networks. Nonetheless, the molecular analysis of the platelet 
transcriptome mav be confounded by the constant decay of 
mRNAs in the absence of new gene transcription, a si mat ion that 
may. for example. limit the identification of low- abundance 
trans-cripls. Similarly, because the circulating platelet pool contains 



a mixed population of variably aged platelets, a "static" mRNA 
profile represents an average of this heterogeneous blood pool. 
Despite these potential limitations, the combination of genomic 
and proteomic technologies are likely to provide powerful tools for 
the global analysis of platelet function. Current strategies for 
cataloging "whole cellular proteomes" are generally accomplished 
by using 2 developing methodologies:,(l) high resolution 2-dimen- 
sional polyacrylamide gel electrophoresis (2-DE) with mass spec- 
irometric sequence identification, 34 and (2) microcapillary liquid 
Chromatography with tandem mass spectrometry (p.LC-MC/ 
MC). 3 J Further modifications of both procedures have been devised 
for direct comparative studies between 2 cellular proteomes. The 
introduction of 2-DE differential gel electrophoresis has now made 
it possible to detect and quantify differences between experimental 
sample pairs resolved on the same 2-dimensional gel. 36 Likewise, 
the application of isotope- coded affinity tags to uJLC-MC/MC 
represent a novel means of quantitative analyses between cellular 
. proteomes. 37 The success of both approaches relies on the availabil- 
ity of comprehensive genomic databases and mathematical algo- 
rithms for optimal protein identification. Indeed, mathematical 
modeling studies have demonstrated the need to delineate both 
protein and mRNA expression levels for optimal definition of 
intracellular networks 3 * Our data present an initial framework for 
delineating platelet function by defining the molecular anatomy of 
human platelets, information that is likely to provide important 
clues into the dynamic protein interactions regulating normal and 
pathologic platelet functions. Furthermore, because the platelet 
transcriptome mirrors the mRNAs derived from precursor 
megakaryocytes, these analyses may provide insights into the 
biochemical and molecular events regulating megak a ryocytopoi- 
esis anil/or proplatelel formation. 



Acknowledgments 

We thank Di Mam ecu Krnnse : Jean Waincr. and Lesley Scudder for 
assistance with some of the experiments;' John Schwedes (Univer- 
sity DNA microarray facility) with the micioarray analysis; and Ms 
Shirley Murray for manuscript preparation. 



References 



Steinberg P. Hilt R. Platelet and megakaryo- 
cytes. In; le R. el al. eds.- Wmlrobei Clinical He- 
matology. Baltimore. MD: Williams A WilVins; 

Newman P. Goiski J. While G. Gktwta S. Oetney 
C. Astt-r R Enzymatic amplification ol plaletel- 



".pt-citic messenger RNA ubing the polymcrabe 
chair* i ear. lion J Clin Invest 1«»88:0?:739 743. 

KieHei N. Guithard J. f aicei J. Vainthenkci W. 
Bn?ion-Goiius J Biosyntheiis ol ma jot plalelel 
piot fins in human blood platelets fcui J Biochem 
1267:164. t89 -tOb 



Weyiich A. Dixon O. Pablo R. ct ol. Signa'-depen- 
dent translation ol a regulatory protein, (id- 2. in 
activated human platelets. Pr oc Mai Acad Sci 
USA. V3<}8:9b.bS5CbS(>t. 
Benecke BJ, Ben 7e'ev A. Penman S. the control 

ol mRNA protection, translation and turnover in 



BLOOD, 15 MARCH 2003 - VOLUME 101. NUMBER $ 



PLATELET TRANSCWPT PROFdNG- 2293 



suspended and reattached anchorage- depen denl 
fibroblasts. CeW. 1978;14:931-939. 

6. Pabla R, Weyrich AS, Dixon DA, el al. Integrin- 
depertdenl control of translation: engagement of 
irrtegrin 3tphallbbeta3 regulates synthesis of pro- 
teins in activated human platelets. J Celt BioL 
1999;144: 175-184. 

7. Crwcurel ME. Singer RH, Meyer CJ.IngberDE. 
integrin binding and mechanical tension induce 
movement of mRNA and ribosomes to local ad- 
hesions. Nature. 1998:392:730-733. 

8. Lindemann S. Toltey N, Eyie J. Kraiss L, Ma- 
honey T, Weyrich A. Integiins regulate the inlra^ 
cellular distribution of eukaryolic initiation factor 
4E "m platelets. J Biol Chem. 2001;276:33947- 
33951. 

9. Brown EJ, Schreiber Sl_ A signaling pathway to 
transblional control. Cell. 1996;86:517-520. 

10. Rinder H. Schuster J, Rinder C, Wang C. Schw- 

. seidter H, Smith B. Con elation of thrombosis with 
increased platelet turnover in thrombocytosis. 
Blood. 1998;91:1288-1294. 

11. Richards E, Baglin T. Quantitation of reticulated 
platelets: methodology and clinical application. 
Br J Haematol. 1995;91:445-451. 

12. Robinson M. Mackie I, Khair K. el al. Flow cytcn 
metric analysis of reticulated platelets: evidence 
for a targe proportion of non-specific labelling of 
dense granules by fluorescent dyes. Br J Hema- 

, tot 1998:100:351-357. 

13. Vetculescu V, Zhang L. Vogelsleio B. Kinzlet K. 
' Serial analysis ol gene expression. Science. 

1995;270:484-487. 

14. Zhang L, Zhou W, Velculescu V, et al. Gene ex- 

• pression profiles in normal and cancer cells. Sci- 
ence. 1997;276:1268-1272. 

15. Morin PJ.SpaiksAB, Korinek V, et al. Activation 
ol beta-calenin-Tcf signatrng in colon cancer by 
mutations in bela-catenin or APC. Science. 1997; 
275:1787-1790. 

16. Bahou W. Campbell A. Wicha M. cDNA cloning 
and molecular characterization ol MSE55: a 



" novel human serum constituent protein that dis- 
plays bone marrow stromal endothelial eefl-spe- 
cific expression. J Biol Chem. 1992;267;13986^ 
13992. 

17. Bahou W. Cotter B, Potter C, Norton K. Kutok J. 
Goltgorsky M. The thrombin receptor extracellular, 
domain contains sites crucial for peptide ftgand- • 
induced activation. J Clin Invest. 1993;91:1405- 

, 14t3 - 

18. Heid C. Stevens J, Livak K. Williams P. ReaMime 
quantitative PCR. Genome Res. 1936;6:986-994. 

19. Dunn J, McCorkle S, Praissman L; et al. Genome 
signature lags (GSTs): a system lor profiling 
genomic DMA. Nucleic Acid Res. 2001^9:137- 
140. 

20. Krotl T, Wolfl S. Ranking: a closer took on global- 
ization methods for normalization of gene expres- 
sion arrays. Nucleic Acids Res. 2002;30:e50. 

21. Pruilt KD, Maglott DR. RelSeq and LocusLink: . 
' NCBI "gene-centered resources. Nucleic Acids 

Res. 2001;29:137-140. 

22. Fox JE, Boyles JK. Reynolds CC. Philips DR 
Aclin filament content and organization in un- 
stimulated platelets. J Cell Biol 1984:98:1985- 
1991. 

23. Safer D. Ebinga M. Nachnrias VT. Thymosin beta . 
4 and Fx, an aclnvsequester'mg peptide, are in- 
distinguishable. J Biol Chem. 1991;266:4029- 
4032. 

24. Wallace DC. Mouse models for mitochondrial dis- 
ease. Am J Med GeneL 2001;106:71-93. 

25. Wetle S. Bhatt K. Thornton C. Inventory of high- 
abundance mRNAs m skeletal muscle of normal 
men: Genome Res. 1999;9:506-513. 

26. Karpatkin S, Choimatz A. Langer RM. Glycogen- 
esis and gryconeogenesis in human platelets. 
Incorporation of glucose, pyruvate, and citrate 
into platelet glycogen; glycogen synthetase and 
lnjclose-1,6-dipbosphalase activity. J Clin Invest. 
1970;49:140-149. 

27. Roha S, Robinson BH. Mitochondria, oxygen tree 
radicals, and apoplosis. Am J Med Genet. 2001; 
106:62-70. 



. 28. YuJ.Zhar^UHwangP.jF^arQr^ 

gelstein B. Identification and classification of p53- 
togtilated gooes. Proc Nal Acad Sci U S A 1999; 
96:14517-14522. . 

29. WangE.Mi«erL.OhrunachtG;LroE,Marincola 
F. High-fidelity mRNA amplification for gene profil- 
ing. Nat Bio'technol. 2000;1B: 157-459. 

30. KirszbeumU Sharps JA.Mur^ 

lecular cloning and charactorrzation of tbo novel 
human corrrplomorH-os social ed protein, 
SP-40,40: a link belwoen the complement and 
reproductive systems. EMBOJ. 1989;8:711-718. 

31 . McLaughlin L, Zhu G. Mistry M, et al. Apofipopro- 
lein J/duslerin limits tno seventy of murine autcK 
immune myocarditis: J CBn Invest; 2000.6:1105- 
1113. 

32. MartinezDA.Pert2UL: Ben^ J.Cc*)rnaA. 
Structure, organization, and chromosomal map- 
ping of the human nourogranin gene.(NRGN). 
Genomics. 1997;4 1:243:249. 

33. WuJ, U J. Huang K,Huar^F. Attenuation of PKC 
and PKA signal transduction tn'the no urografin 
knockout mouse. J Biol Chem. 2002^77:19498- 
19505. 

34. Gygi S, Rochon V, Franza B, Aeborsold R: Cone- 
lafion between protein and mRNA abundance in 
yeast. Mol CeDBioL 1999;19:1720-1730. 

35. Link A. Eng J, Schiottz DM, el af. Direct analysis 
of protein complexes using mass spectrometry. 
Nat Biotechnol. 1999; 17: 676-682. 

36. Unlu M, Morgan M, Mindeh J. Difference gel elec- 
trophoresis: a single gel method for delecting 
changes in protein extracts. Electrophoresis. 
1997;18:2071-2077. 

37. Gygt S. Rist B. Gerber SA. Turecek F. Gelb MH. 
Aebersold R. Quantitative analysis of complex 
protein mixtures using isotope-coded affinity tags. 
Nat Biotechnol. 1999;17:994-999. 

38. Hatizmanikalis V. Lee K. Dynamical analysis ol 
gene networks requires both mRNA and piotoin 
expression information. MetabolEng. t999;1: 
275-281. 



30 1lwrtH? SSi nS^ Ug 14 - 27 3^33V21 lfiKS Related Articles. Links 




Overexpression of a DEAD box protein (DDX1) in neuroblastoma and 
retinoblastoma cell lines. 

Godbout R , Packer M. Bie W. 

Department of Oncology, Cross Cancer Institute and University of Alberta 1 1 560 
University Ave., Edmonton, Alberta T6G I Z2, Canada. 

The DEAD box gene, DDX1, is a putative RNA helicase that is co-amplified with 
MYCN , n a subset of retinoblastoma (RB) and neuroblastoma (NB) tumors and cell lines 
Although gene amplification usually involves hundreds to thousands of kilobase pairs of 
DNA, a number of studies suggest that co-amplified genes are only overexpressed if they 
provide a selective advantage to the cells in which they are amplified. Here, we further 
characterize DDX1 by identifying its putative transcription and translation initiation sites 
We analyze DDX1 protein levels in MYCN/DDXl-amplified NB and RB cell lines using 
polyclonal antibodies specific to DDX1 and show that there is a good correlation with 
DDX1 gene copy number, DDX1 transcript levels, and DDX 1 protein levels in all cell 
hnesstudied. DDXl protein is found in both the nucleus and cytoplasm ofDDXl- 
amplified lines but is localized primarily to the nucleus of nonamplified cells Our results 
indicate that DDX1 may be involved in either the formation or progression of a subset of 
NB and RB tumors and suggest that DDX1 normally plays a role in the metabolism of - 
RNAs located in the nucleus of the cell. 



PM1D: 9694872 [PubMed - indexed for MEDLINE] 
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Expression level of Ubc9 protein in riat tissues. 
GolcbiowskiF, Sztifc A . Sakowicz M. Szutowicz A. Patvelczyk T , 

Department of Molecular Medicine, Medical University of Gdansk, 80-2 1 1 Gdansk 
Poland. ' ' 

Ubc9 is a homologueoftheE2ubiquitin conjugating enzyme and participates in the . 
covalent linking of SUMO-1 molecule to the target protein. In this report we describe a 
simple and efficient method for obtaining pure human recombinant Ubc9 protein. The 
purified Ubc9 retained its native structure and was fully active in an in vitro sumoylation 
assay with the promyelocytic leukaemia (PML) peptide as a substrate. In order to better 
understand the physiology of Ubc9 protein we examined its levels in several rat tissues. 
Immunoblot analyses perfoimed on tissue extracts revealed quantitative and qualitative 
differences in the expression pattern of Ubc9. The Ubc9 protein was present at a high 
level in spleen and lung. Moderate level of Ubc9 was detected in kidney and liver. Low 
amount of Ubc9 was observed in brain, whereas the 1 8 kDa band of Ubc9 was barely 
Visible or absent in heart and skeletal muscle. In heart and muscle extracts the Ubc9 
antibodies recognized a 38 kDa protein band. This band was not visible in extracts of 
other rat tissues. A comparison of the relative levels of Ubc9 mRNA and protein 
indicated that the overall expression level of Ubc9 was the highest in spleen and lung. In 
spleen, lung, kidney, brain, liver and heart there was a good correlation between the 18 
kDa protein and Ubc9 mRNA levels. In skeletal muscle the Ubc9 mRNA level was 
unproportionally high comparing to the level of the 18 kDa protein. The presented data 
indicate that in the rat the expression of the Ubc9 protein appears to have some degree of 
tissue specificity. . 
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ABSTRACT 

Motivation: Protein abundance is related to mRNA 
expression through many different cellular processes. 
Up to now, there have been conflicting results on how 
correlated the levels of these two quantities are. Given that 
expression and abundance data are significantly more 
complex and noisy than Ihe underlying genomic sequence 
information, it is reasonable to simplify and average them 
in terms of broad proteorhic categories and features (e.g. 
functions or secondary structures), for understanding their 
relationship. Furthermore, it will be essential to integrate, 
within a common framework, the. results of many varied 
experiments by different investigators. This will allow one 
to survey the characteristics of highly expressed genes 
and proteins. 

Results: To this end, we oulline a formalism lor merging 
and scaling many different gene expression and protein 
abundance .data sets into a comprehensive reference 
set, and we develop an approach for analyzing this in 
terms ol broad categories, such as composition, function, 
structure and localization. As the various experiments are 
not always done using the same set of genes, sampling 
bias becomes a central issue, and our formalism is 
designed to explicitly show this and correct tor it. We apply 
our formalism to the currently available gene expression 
and protein abundance data for yeast. Overall, we found 
substantial agreement between gene expression and 
protein abundance, in terms of the enrichment .ol structural 
and functional categories. This agreement, which was 
considerably greater than the simple correlation between 
these quantities tor individual genes, reflects the way 
broad categories collect many individual measurements 
into simple, robust averages. In particular, we found 

*Tt> Itom coiic.vpi>r>dcnrc .should br aiicjfc.wrtl. 
: The:.e nurfioo, rtmiribim-i) equally to ihij woik. 



that in comparison to the population of genes in the 
yeast genome, the cellular populations of transcripts and 
proteins (weighted by their respective abundances, the 
transcriptome and what we dub the translatome) were both 
enriched in: (i) the smalt amino acids Vat, Gly, and Ala; 
(ii) low molecular weight proteins; (iii) helices and sheets 
. relative to coifs; (iv) cytoplasmic proteins relative to nuclear 
ones; and (v) proteins involved in 'protein synthesis,' 'cell 
structure,' and 'energy production/ 
Supplementary information: http://genecensus.org/ 
expression/translatome 
Contact: mark.gerstein@yale.edu 

INTRODUCTION 

High throughput experimentation, measuring mRNA 
(Schena et ui, J 995; Eisen and Brown, 1999; Ferea 
and Brown, J 999; Lipshutz et oi, 1999) arid protein 
expression (Anderson and Seifhamer, J 997; Futcher et ni t 
.1999; Gygi et ai } 1999a; Ross Macdonald et oi, J999; 
Lopez, 2000; MacBeath and Schreiber, 2000; Nelson 
et ai, 2000; Zbu et a!., 2000) are currently the single 
richest source of genomic information: However, how to 
best interpret this data is still an open question (Basset! 
et ai, 1996; Wines and Friedman, J 999; Zhang, J 999; 
Gerstein and Jansen, 2000: Searls, 2000; Sherlock.. 2000; 
Ciaverie, J 999; Einarson and Colemis, 2000; Epstein and 
Buiow, 2000; Shapiro and Harris, 2000). Understanding 
how protein abundance is related to mRNA transcript 
levels is essential for interpreting gene expression, protein 
interactions, structures and functions in a cellular sys- 
tem (Hnizimanikaiis et ai, 1999). Moreover, as protein 
concentration is the more relevant variable with respect 
to enzyme activity, it connects genomics My the physical 
chemistry of the eel) (Kidd et oi, 2001). Protein abun- 
dance may also be invaluable /or diagnostics and for 
determining drug targets (Cordials et oi. 200(1). 
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Previously, we surveyed the population of protein 
features — such as folds, amino acid composition, and 
functions — in yeast, and other recently sequenced 
genomes (Gerstein, 1997, J998a,b; Gerstein and Hegyi, 
J 998; Hegyi and Gerstein, 1999; Das and Gerstein, 
2000; Lin and Gerstein, 2000), and we extended this 
concept to compare the -population of features in the 
yeast transcriptome to that in the genome (Drawid et 
aL, 2000; Jansen and Gerstein, 2000). Others have- also 
done related work (Frishman and Mewes, 1997; Tatusov 
et aL, 1997; Jones, J998; Wallin and von Heijne, J998; 
Frishman and Mewes, 1999; Wolf et oL, 1999). Here, we 
present a new methodology to compare the features of the 
mRNA expression population with the protein abundance 
population. 

Precise terminology is essential for this comparison. 
Unfortunately, 'proteome 1 is used inconsistently. Pro- 
teome can logically be used to describe all the distinct 
proteins in the genome (Qi et aL, 1996; Cavalcoli et aL, 
1997; Fey et aL, 1997; Garrels et aL, 1997; Gaasterland, 
1999; Jones, 1999; SaJi, 1999; Tekaia et aL, 1999; 
Bairoch, 2000; CambiUau and Claverie, 2000; Doolittle, 
2000; Pandey and Mann, 2000; Rubin et aL, 2000) and, 
in this context, it is equivalent to what others may refer 
lo as the coding part of the genome. However, in papers 
on two-dimensional (2D) electrophoresis, it is often used 
to describe the sum total of proteins in a cell, taking 
into account the different levels of protein abundance 
(Shevchenko et oL y 1996; Gygi et aL, 2000a; Lopez, 
2000; Washburn and Yates, 2000). In an effort to be dear, 
we propose l he term 'translatome' for this second usage 
of prole ome. 

With this definition, we ore able lo refer compactly to 
three different cellular populations. These are illustrated 
in Figure 1 . 

(i) We use" the term genome when we refer to the 
population of open reading frames, where each ORF 
counts once. 

(ii) We use the term transcriptome when we refer to 
the population of mRNA transcripts. This term was 
originally coined by Vclculescu et aL (1997). Note 
that each ORF may give rise to different numbers 
of transcripts. Consequently, the transcript ome is 
essentially the same as the genome but with each 
ORF weighted by its expression level. 

(iii) The next level is the cellular population of proteins. 
As each protein represents a translated transcript, 
we make an analogy with the term transcriptome 
and use the term translatome as described above 
to describe this third population. Thus, the trans- 
Intome is n subset of the genome where each 
ORF is weighted by i'S associated level of protein 
abundance. 



Note that one could aJso. less compactly call the trans- 
1 at ome a 'weighted proteome.* However, doing so assumes 
one of the two aforementioned definitions of proteome. To 
avoid ambiguity, we studiously avoid the use of proteome 
altogether in the paper. 

Differences between the translatome and the transcrip- 
tome exist given thai transcripts from different genes 
can give rise to different numbers of proteins, due lo 
different rates of translation and protein degradation. 
Post- transcriptional modifications further affect the 
translatome. 

In our analysis of the transcriptome and translatome, we 
focus on global protein features rather than the compari- 
son of individual genes. Previous analyses have shown that 
differences between mRNA expression and protein abun- 
dance levels can be quite dramatic for individual genes. 
. This may either be due to the noise in the data or to funda- 
mental biological processes. However, our analyses show 
that the variation between transcriptome and translatome 
is much smaller for global properties that are computed by 
averaging over the properties of many individual genes. 

METHODS 

Dal a sources used 

For our analysis we culled many divergent data sets, 
representing protein abundance and mRNA expression 
experiments and also other sources of genome annotation. 
These are all summarized in Table 1. 

Biases in the data 1 

The databases that annotate the specific genes may 
not always be accurate (lsbii et aL, 2000). Gene Chip 
experiments suffer with regard to cross hybridization 
and the saturation of probes. SAGE data degrades for 
lowly expressed mRNAs. 2D gels are unable to resolve 
membrane proteins (approximately 30% of the genome) 
and basic proteins (Gerstein, 1998c; Krogh et aL, 2001). 
In addition, the procedures for identification and quan- 
tification of the protein spots are subject to. uncertainties 
(Haynes and Yates, 2000). Human biases include the 
lack of low abundance proteins (Fey and Larsen, 2001; 
Gygi et aL, 2000b; Harry et aL, 2000) and the differences 
between laboratories in sample preparation. Our reference 
expression data set attempts to resolve these problems. 

Data set scaling 

A reference set for mRNA expression. With many differ- 
ent mRNA. expression data sets available, it is worthwhile 
to integrate them into a single unified reference set, with 
the intention of reducing the noise and errors contained in 
the individual data sets and to obtain a unified estimate of 
the normal expression state in a cell. 
We adopt an iterative scaling and merging formalism. 
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Fig. J. Schematic overview of the analysis. On the left- side we outline the terms we use lo describe the process of gene expression. The 
coding section of the genome is transcribed into a population of mRNA transcripts called the 'trahscriptome/ The transcripts in lum are 
translated to a population of proteins; we use the term 'translatomc* for this protein population rather than the alternative 'proteome* because 
the latter term may be confounded with the protetn complement of the genome {which is not necessarily associated with a quantitative 
abundance level). 

The matrix in the middle schematically shows an analysis of the three stages of expression. In genera), we define a protein 'population' as 
a set of genes associated with a corresponding number of expression or abundance levels ('weights*): fn the matrix each row represents a 
weight and each column a gene set. In particular, we differentiate between the mRNA reference expression set (GmRNA = ^Ccd). which 
essentially covers the complete genome, and the reference protein abundance set (Gp^i) which contains the proteins in data sets 2-DE HI and 
2-DE H2 (see Table 1) because the protein abundance set is a significantly smaller subset of the genome. By definition, this subset contains 
only proteins that can be identified by 2-D gel electrophoresis and is therefore biased in this sense. The enrichment figures throughout this 
paper, through a comparison of the right- and left-sides of this figure, show the results of the experimental biases of 2D gels on the data set. 
Each pie chart represents a composition of a particular protein feature F (foi instance, an amino acid composition) in a population (represented 
by the symbol ji)'. We can further look at the 'enrichment' of this feature in one population jelative to another (represented by the symbol A, 
see Section 'Methods* for on explanation of the formalism). 



which we summarize below. We present a more detailed 
review of the met hot Is on our web site. 

We start with the values of one gene chip data set Oi 
where / is used throughout as a subscript to denote gene 
number. We then transform the values of (he next Gene 
Chip data set X t to Y t with the following non-linear regres- 
sion: min^T, (Y; - Uj) ? with Y t - AX? where A and B 
are the parameters of the regression. Note th3i two Gene 
Chip sets may not be defined for the same set 'of genes, 
so we have to perform the fit only over the genes com- 
mon to both sets. The motivation for scaling is that the 
dynamic range of observed expression levels varies some- 
what between different data sets, although celJ types and 
growth conditions are very similar. Reasons for dispar- 
ity may include different calibration procedures for relat- 
ing fluorescence intensity to a ceJJntor concentration (mea- 
sured >n copies of transcripts per cell) or different pro- 
tocols for harvesting and reverse- transcribing the cellular 
mRNA. 

We then merge and average the data to create a new 



reference set V as follows: 

If If; and Yi are both defined for gene / and — — < o 

Yi -r Uj 

Then V } = \{Yi + U,) 

Else if only Y) exists, V; = }) 

Else V) = Ui. 

As presented above, where only one data set has a value 
for the corresponding ORT\ we incorporated that vahte 
and did not exclude il. When both data sets have values 
for an ORF, we averaged the values if they were within 
\59i of each other; otherwise, we just stayed with the 
original chip data set £.',-. We used or — )5% in order to 
prevent outliers from skewing the result. This 15% value is 
a reasonable threshold for excluding outliers though other 
values (e.g. 10 oi 20%) would give similar results (data 
not shown). Othei data sets are subsequently included in 
the same procedure, continuing the iteration from the new 
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expression values V t . The initial iteration starts with the 
Young Expression Set, as U t , since we have the highest 
confidence in its accuracy. 

The SAGE data (Velculescu et al y 1997) was not 
included in the above procedure since it is of a funda- 
mentally different nature; An advantage of the SAGE 
technology over Gene Chips is that there is no possible 
signal saturation for high expression levels, as is possible 
for chips (Futcher et o/., 1999). Conversely, SAGE values 
are less reliable for lowly expressed genes since there 
is a chance that one might not sequence a SAGE tag 
corresponding to such a gene altogether. Therefore, if 
after the last iteration, the average Gene Chip expression 
level Vi was both above a certain threshold fi and below 
the SAGE expression level Si for the same gene, it was re- 
placed with the SAGE value; otherwise the average Gene 
Chip value was kept. This gave us our final expression 
set w m R N A- Our treatment of the SAGE data is modeled 
after that in Futcher et at (J 999), and like them, we used 
P= 16. 

This incorporation of the SAGE data into the reference 
data set ensures that the highly expressed outliers are as 
accurate as possible. 

Rather than plain arithmetic averaging, this overall scal- 
ing procedure with the or cutoff avoids 'artificial averages' 
that combine very different values for a particular gene. 
Some expression values might be statistical outliers. In 
addition, it may be possible thai the expression levels of 
a variety of genes can only be within mutually exclusive 
ranges or modes, such as when two alternative pathways 
are switched on or off. Simply averaging these would give 
values that are less representative of the particular mode 
values. This situation is analogous to that in averaging 
together an ensemble of protein structures (i.e from NMR 
structure determination). Each structure could be stereo- 
chcmicaJly. correct, with all side-chain atoms in predefined 
rotamer confirmations. However, an average of all struc- 
tures could yield one that is stereochemical^ incorrect if 
this involved averaging over particular side- chains in dif- 
ferent rotameric states. 

With regard to our regression analysis, we have investi- 
gated both non- linear and linear fits but found a non- linear 
procedure to be more advantageous. The non- linear rela- 
tionship between different expression data sets perhaps 
reflects saturation in one or more of the Gene Chips — not 
an uncommon phenomenon. This non-Jinenriiv is imme- 
(liaifly evident on scalier plots ol two data sets ayainst 
one another (see website). According ly. the non- linear 
fit produces a smaller residual than the linear fit: 98 297 
(non- linear) versus 122 J 82 (linear) for the scaling of the 
Church data set and 59828 (non-linear) versus 67*162 
(linear) for the Samson data set.. 



A reference set for protein abundance. We followed a 
similar procedure to calculate a reference protein abun- 
dance set from the two gel electrophoresis data sets. We 
first scaled the two data sets against the mRNA expres- 
. sion reference data set, getting regression parameters C j 
and Df 

i 

where the subscript j indicates the data set 2-DE #1 or 
2 DE #2 respectively; Pj j is the protein abundance value 
in data set j y and u; mR N A> ; the corresponding reference 
expression value, and Cj and Dj are the parameters of 
the non-linear regression. 

Using these parameters, we transformed the values of set 
2-DE #2 onto 2-DE #1. Then we combined both sets into 
the reference protein set wj^^ by averaging them, if both 
values existed. Otherwise, by using the existing value, viz: 

u/piou -= {Pu + j2;.2)/2 if both P^ i and Q i 2 exist. 
Else if only P i } exists, ivp, oL ,- = Pi \ 
Else if Qi 2 exists, wptm./ = £?/.2- 

Enrichment of features 

Formalism. In the next part of our analysis, we want 
to group a number of proteins together into various 
categories based on common features, and characterize 
those features that are enriched in one population relative 
to another, i.e. the translatome population of proteins 
as measured by 2D gels relative to the transcriptome 
population of transcripts or the genome population of 
genes. To this end, we set up a formalism that could 
be applied universally to all the attributes thai we were 
interested in. Due to the limitations of the experiments, 
the translatome, transcriptome, and genome populations 
are defined on different sets of genes, and sometimes we 
want to remove (his 'selection bias' by forcing them to be 
compared on exactly the same set of genes. This is a key 
aspect of our formalism as presented in Figure I. 

We call an entity like |w, C) a 'population/ where G 
is a set describing a particular selection of genes from the 
genome and w is vector of weights associated with each 
element of this population. In particular, we focus on three 
main populations here: 

l») IJ. ^Gmi] is 'he population ol genes in the genome, 
all 6280 genes weighted once (w = )); 

(ii) h v mRNA 7 Ctt>rna] is the observed population of the ■ 
transcripts in the transcriptome. i.e. the 62*19 genes 
in the reference expression set weighted by their 
reference expression value: 
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(Hi) (Wjproi, Gp,oi) is the observed cellular population of 
the proteins in, the transJatome, i.e. the 181 genes 
in the reference abundance set weighted by their, 
reference abundance value. 

(The set of genes in the genome <7 G en is approximately 
equal to the genes in set G mR M A , such that we can use 
both symbols interchangeably.) We can also use this nota- 
. tion to describe specific experiments— e.g. [w )acZ , C?iacz) 
describes the gene set and weights relating to the transpo- 
son abundance set. 

Furthermore, we define Fj as the value of a feature F 
in ORF j. For example, F could be the composition 
of leucine (a real number) or a binary value (0 or J) 
indicating whether an ORF contains a trans-membrane 
segment. Given these definitions, the weighted average of 
feature F in population [w, G) is: 



The. weighted averages of two populations |w, GJ and 
[v, S) can be compared by simply looking at their relative 
difference A: 



M(^,[v, S])-MF,l^ GJ) 
■^,KG)) 



where v and w are weights for the sets of ORFs S and G 
respectively. We call A the 'enrichment' of feature F 
because it indicates whether F .is enriched (if A is 
positive) or depleted (if A is negative) in population [v, S) 
relative to (w, GJ. 

Usually, the gene set G is defined by the particular 
experiment, for which the weight w was measured. 
However, it is also possible to combine the gene set 
associated with one experiment with expression levels 
from another set. One may want to do this to compute 
the enrichment only on the genes common to both 
populations, for which there are defined values for both w 
and v, viz: A(/\ \v, S O G], \w, S D GJ). In practice, 
this is most relevant for comparing G Pjp , and G mRNA . 
Since G Prol is completely a subset of G mRNAt we need 
not explicitly deal with intersections if we calculate all 
statistics directly over Gp I0( . 

One can adjust the weight vectors to take into account 
different types of averaging. For instance, when com- 
puting the amino acid composition (F ~ an) from the 
amino acid compositions of individual ORFs F f - ~ aa 
(V; e G). we weight by ORF length. In the case of . 
expression weights, we have: 

wj = Njtv mmA j Wj e G 

where Nj is a measure of the length of ORF / (such as the 
number of amino acitfs). 



On the other hand, when computing the average molec- 
ular weight per amino acid, we need to normalize by the 
number of amino acids per ORF, which is equivalent to 
choosing the following weights: 



vy e G. 



Application of methodology to quantitative 
abundance sets 

Having defined our formalism, we applied it to a diverse 
set of protein features in yeast. 

. Amino acid enrichment As shown in Figure 2a, we used 
pur methodology to measure the enrichment of individual, 
amino acids in both the transJatome and the transcriptome 
relative to the genome. We found that three amino acids — ■ 
valine, glycine and alanine — were consistently enriched in 
both transcriptome and translaipme populations. 

In Figure 2a we compare different gene sets. Jn Fig- 
ure 2b we focus mainly on the variation in enrichments 
when all the comparisons are restricted to the set of 181 
genes (Gp ro! O G m RNA = Gproi) common to all data sets. 
Thus, the differences between the populations now only 
reflect the effects of differential transcription of certain 
genes and differential translation of certain transcripts. 
We find here an enrichment specifically of cysteine in the 
translatome in relation to the transcriptome. 

To measure the statistical significance of the results on 
amino acid enrichment, we have performed a control anal- 
ysis on a randomized data set (Figure 2d). We randomly 
permutated the expression values of the ORFs 1000 times 
and then recomputed the enrichments. This allowed us to 
compute distributions for the amino acid enrichments and, 
from integrating these, one-sided p-values indicating the 
significance of the observed enrichments. 

Amino [acid enrichment in Transposon data set. We 
also tried to extend our methodology, ineffectively, to 
cope with the semi-quantitative Transposon set. We used 
only those 450 ORFs that consistently yielded either no 
expression or high expression, as binary data, on or off. We 
show the enrichments of amino acids computed from this 
filtered Transposon abundance set in Figure 2a. Overall, 
the enrichments from this set seemed to be attenuated in 
comparison to other data. 

Biomass enrichment. A corollary to amino acid enrich- 
ments is the determination of the average biomass of the 
franscriptome and translatome populations (shown in Fig- 
ure 2c). We found that the average molecular weight of 
a protein in both populations was, on average, lower than 
in the genome population. These preliminary observations 
suggest a cell preference to use less energetically expen- 
sive proteins for those thai :ire highly transcribed or trans- 



589 



D.Grcenbovm et at 



(a) 




t c s t r o i o 



(b) 

















\ 








B — - 





QA(aa, Iw^bh*. (1, O^D 

o Ataa, (w^ Qw), |1 t 

D A(aa, (w^nH*. f^J, [1. 

♦ Afaa, |wr^>, M Iw^a, Q^D 



o o 



IWMOMAf'frRK I H D 1 ir 5 ( V C 

Amino Acid 



Fig. 2. Amino acid and biomass enrichment, {a) Shows the amino acid enrichments between different populations as indicated by the legend 
to the right of the plot (the legend is ordered in the same way as the schematic illustration in Figure I). The bars indicate the enrichment of the 
transcripiomc relative to the genome, whereas the circles indicate, the enrichment of the tiansbtomc relative to the genome. Jn addition, we 
also show the enrichment for protein abundance from the Transposon abundance set, represented by the circles with the line through them, 
(b) Shows a different view of amino acid enrichment from thai contained in (a), now focusing on changes, and thus restricting the comparison 
to the genes common to all the data sets. The graph is ordered according to the enrichment from transcriptome to transjatome (black squares). 
We focus Here only on the changes for the abundance gene set (Gp roI ) to exclude the effects that arise from looking a I different subsets. In this 

. view the enrichments from genome to transcriptome (white squares) and from genome to itansbtomc (while diamonds) look more similar 
than do the analogous sets in (a). To make comparison with (a) easier we again show the enrichment from genome to the transcriptome for 
the complete gene set (Gcem shown in bars), (c) Shows biomass enrichment. The left panel depicts the average molecular weight per ORF 
(in units of kDa) and the right panel, the overage molecular weight per amino acid (in units of Daltons) in each of the three stages of gene 
expression. The numbers inside the circles indicate the average molecular weights. The values next to the arrows indicate the enrichments 
in biomass between differenl populations. Both the ciicle diamelers and the anow widths are functions of the corresponding values (the 
hollow arrow indicates a posiiive value). It is very clear that the average molecular weight per ORF is much lower in the translatome (by 20 
or 15%) and transcriptome (by 29%) than in the genome, This relative depletion of biomass mainly takes place as a result of transcription; the 
effect of translation is less clear, depending on the populations compared. On the other hand, the depletion in the average molecular weight 
per amino acid (-3.3% from genome to translatomc) is an oider of magnitude smaller than in the average weight per ORF This shows 
that the yeast cell favors the expression of shorter ORFs over longer ones, and agrees with our earlier observation that (here is a negative 

■ correlation between maximum ORF length and mRNA expiession (Jansen and Gcrstein, 2000); it seems that this effect mainly takes place 
during transcription r.tthcr than translation. (d)/This plot shows thai the amino acid enrichments are statistically significant. We have assessed 
significance by randomly permuting the expression levels among the genes and then recomputing the amino acid enrichments. This procedure 
can be repealed and used to generate distributions of random enrichments that can then be compared against the observed enrichments. Jn 
the plot the gray bars represent the observed enrichments already shown in Figure 3a. On lop of the gray bars we show- standard box plots 
of enrichment distributions based on 1000 random permutations. (The middle line represents the distribution median. The upper and lower 
.sides of the box coincide with the upper and lower quartiles. Outliers are shown as dots and defined as data points that are outside the range 
of the whiskcis. the length of which is 1.5 the interquartile distance.) Based on the random distributions, we can compute one-sided ^-values 
for the observed enrichments. Amino acids for which the values arc less than I0~" are shown in bold font. 



toted. However, we. also found that the average molecu- 
lar weight per ammo acid differed much less between the 
ttnnscripiome nod the iianslatome on the one hand, and the 
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genome on the other hand (though it was still slighily less). 
This finding indicates thai lower molecular weights in the 
iranslaiome and transcriptome relative to the cenome ate 



D.Gmenbaum et ah 



Secondary structure 




Localization 




Function 



□ Metabolism P) 
0 Energy {2) 

n CeH growth, cell division 

and DMA synthesis (3) 
D Transcription (4) 
B Protein synthesis (5) 

□ Cettutar ofganizolion (30) 
Unclassified (98. 99) 

DOthei 



Fig. 3. Breakdown of the transcriptome and translatome in terms of broad categories relating lo structure, localization,, and function. AJ1 
of the subfigures are analogous to the schematic illustration in Figure J. (a) Represents the composition of secondary structure in the 
different populations, (b) Represents the distribution of subcellular localizations associated with proteins in the various populations. We used 
standardized localizations developed earlier (Drawid and Ger stein, 2000), which, in turn, were derived from the MIPS, YPD, and SwissProl 
databases (Bairoch and Apweilei, 2000; Costanzo et ol, 2000; Mewes et ol. 2000). The subcellular localization has been experimentally 
determined for less than half of the yeasi proteins, so our analysis applies only to this subset, (c) Shows the division of ORFs into different 
functional categories (according lo the MIPS classification) in the various populations. Only the largest functional categories of the top level 
of the MIPS classification are shown. The group 'other' contains the smaller top-level categories lumped together. This 'other' group is 
different from the group 'unclassified,* which contains genes without any functional description. 



data points, we could find logical trends in the underlying 
data. For example, individual transcription factors might 
have higher or lower pioiein abundance, than one expects 
from their mRNA expression, but the category 'transcrip- 
tion factors" as a whole has a similar representation in the 
transcriptome ami transbtome. 

We found, as previously described (Fuicher et a/.. 1999; 
Gygi et 'ol. 1999b: Groenbanm et al r 2001), a weak 
correlation between individual measurements of mRNA 



and protein abundance. The outliers of this correlation 
tend to be associated with cellular organization. One 
might conceive of using these outliers (i.e. those with 
significant ly different transcriptional and translation^! 
behavior) to find consensus regulatory sequences. One 
possible method would involve using predicted mRNA 
stiuctuies f Jaeger et til, 1990; Zuker. 2000) to find and 
investigate consensus structural elements in these outlieis 
to wbirh the yeast ir.tnslniion.il machinery -is known to be 
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TaWt I. Data sets 



Description 



Size [ORFs) 



Reference • 



mRNA expression 
Young 

Church 

Samson 

SAGE 

Reference expression 

Protein abundance 
2-DE#l 

2-DES2 
Transposon ■ 

Reference abundance 

Annotation 

Annotated localization 

TM segments. 

MIPS functions 

GOR secondary structure 



Gene chip profiles yeast cells with - 
mutations that affect tjanscription 
Gene chip profiles of yeast cells 
under four different conditions 
Comparing gene chip profiles for 
yeasi ceils subjected to alkylating agent 
Yeast cells during vegetative growth 

Scaling and integrating ibe mRNA 
expression set into one data source 



Measurement of yeast protein 

abundance by 2D gel 

electrophoresis and mass spectrometry 

Similar to 2- DE set #1 

Large-scale fusions of yeast genes 
with locZ by transposon insertion 
Scaling and integrating the 2-DE 
data sets into one data source 



Subcellular localizations of yeast 
proteins 

Predicted TM and 
soluble proteins in yeast 
Functional categories for yeast 
ORFs 

Piedicted secondajy stroctuje 
yeast ORFs 



5455 

6263 

6090 

3778 
6249 

156 

71 
1410 



2133 
(6280) 

2710 
(6280) 

3519 
(6194) 

6280 



Hoktege el al (1998) 
Roth el of. (1998) 
Jclinsky and Samson (1999) 
Velculescbrtal. (1997) 

Gygiflo/.(I999a,b) 

Futcber et at. (1999) 
Ross-MacdonaJd ei oi (1999) 



Draw-id and Geratein (2000) 
Geislein(l99Sa.b,c) 
Mewesrrn/. (2000) ■ 
Ger stein (I998a,b,c) 



This table provides an overview of the data sets used in our analysis. The table is divided into three sections. The top section lists different mRNA expression 
sets. The middle section shows the protein abundance data sets used. The bottom section contains different annotations of piotein features. The column 'Data 
set* lists a shorthand reference to each data set used throughout this paper The next columns contain a brief description of the data sets, the number of ORFs 
contained in each of them, and the liierature reference. In contrast to the other data we investigated, the reference expression and abundance data sets have 
been calculated for the purpose of our analysis (see text). An expanded version of the table is available on our web site. 
Some funher information on the genome annotations: 

Localization. Protein localization information from YPD, MIPS and SwissProt were merged, filtered and standardized (Bairoch and Apweiler. 2000; 
Costanzo et at., 2000; Mewcs ei al. 2000) into five simplified r ompan me nis— < cytoplasm, nucleus, membrane, extracellular (including proteins in ER and 
gnlgi), and mitochondrial— according to the protocol in Drawid * / nl (7000). Tliis yielded a standardized annotation of protein subcellular localization lor 
2133 out of 6280 ORFs.' 

TM iesmenti. In 2710 out of 6280 yeast ORFs TM segments arc predic ted lo occur, ranging from low to high confidence (732 ORFs). The - fM prediction was 
performed as follows: the values from the scale for amino acids in □ window of size 20 (the typical size of a TM helix) were averaged and then compared 
against a cutoff of - 1 (teal mol - 1 . A value under this cutoff was taken to indicate the existence of aTM helix. Initial hydrophobic so etches cones ponding to 
signal sequences for membrane insertion were excluded. (These have the pattern of a charged residue within the first seven, followed by a soetch of 14 with 
an average hydrophobic iry under the cutoff.) These parameters have been used, tested, and re fined on surveys of membrane protein in genomes. 'Sure' 
membrane proteins had at least rvvoTM-segmcnis wiih an ave ra^e hydioprmbirity (ess thnn - 2 kcal mot" 1 .(Rosl ft nl. 1995: Ccrstein ei nl. 2000. Saritonr 
ri nl, 200O. Sencs ft nl. 2000). 

Funt lions. MIPS functional categories have been ^signed io3M9 otii of 6194 ORFs. (The remainder are assigned tocalegory '9S* or '99.' which 
corresponds to unclassified function.) 



sensitive (McCarthy, 1998).. 

In telattoti to functional categories, wc found throe 
trends that were panicnlaily notable: (i) the "cellular 



Organization/ 'protein synthesis/ and energy production 1 
categories were increasingly enriched 35 we moved from 
genome to ttanscripiome to transJoiome. In the itanscrip- 
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tome and translatome population relative to the genome; 

(ii) proteins with 'unclassified function' are significantly 
depleted, perhaps reflecting a bias against studying them; 

(iii) proteins in the 'transcript ion* and 'cell growth, cell 
division, and DNA synthesis* categories were consistently 
depleted. This reflects the fact that many of these proteins, 

. such as transcription factors, act as 'switches' such that 
only small quantities of the protein are necessary to 
activate or deactivate a process. These results concur with 
previous calculations (Jansen and Gerstein, 2000) wherein 
we found the transcriptome is enriched specifically with 
proteins involved in protein synthesis and energy. 

Limitations given the small size of the protein 
abundance data 

Even with the extended coverage made possible by 
merging many data sets together into reference sets, the 
analysis is still limited by the minimal data. This was 
most applicable to the protein abundance measurements, 
potenlially biasing our statistical results towards certain 
protein families. Moreover, the J8J proteins in Gp ro t do 
not represent a random sample. They are skewed towards 
highly expressed, weJI-studied proteins. Our methodology 
attempts to control for this gene-selection bias through 
our enrichment formalism, which allows one to rather 
precisely gauge various aspects of the bias. Conversely, 
many protein features in both the transtetome and the 
transcriptome are dominated by highly expressed proteins. 
Under these circumstances, it is often sufficient to look at 
this smaller number of dominating proteins to characterize 
the whole population. This is similar to the development of 
the codon adaptation index lor yeast (Sharp and Li, 1987). 
While based on only 24 highly expressed proteins, it has 
proven to be robust in predicting expression levels lor the 
entire genome. 

"We believe that the essential formalism and approach 
that we develop will remain quite relevant for future data 
sets' (Smith, 2000). 
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Galanin in pituitary adenomas. 

. GrenbackE, Bjeliexup P, Wallerman E, Lundblad L . Anggard A Ericson K. Aman 

Depaitoent of Molecular Medicine, Endocrine and Diabe.es Unit, Karolinska Hospital, 
b-17176 Stockholm, Sweden. Eva.Grenbaek@ks.se 

J^SS'i? n C TT! was measured in extracts from human P ituita <* ade « ™ ng a 

specie RIA method for monitoring human galanin. Twenty-two out of twenty-four 
tumors contained galanin with notably high levels in corticotroph adenomas varying 
levels m chmcajly inactive tumors, and low levels in GH secreting adenomas. Tumor 
ga ton and ACTH contents were closely correlated in all tumors. In four young patients 
mth m.croadenomas and highly active Mb Cushing tumor galanin was inversely related 
to tumor volume. The molecular form of tumor galanin, studied with reverse-phase 
HPLC, was homogeneous with the majority of rumor galanin coeluting with standard 
human galarun. In the rumors analysed with in situ hybridization there was a good 
correlator, between galarun peptide levels and galanin mRNA expression. In some 
tenors galanm mRNA and POMC levels coexisted, in others they were essentially in 
Afferent cell populations. Levels of plasma galanin-LI were not related to tumor galanin 
concentrate, and galanin levels were in the same range in sinus petrosus close to the 
pituitary venous dra.nage as in peripheral blood. Corticotropin releasing hormone 
injections m two patients caused ACTH, but no detectable galanin release into sinus 
petrosus. Our results demonstrate that corticotroph, but not GH adenomas, express high 
levels of galarun, in addition to ACTH, and that in some tumors both polypeptides are 
synthesised 1 .n the same cell population. However, galanin levels in plasma were not 
influenced by the tumor galanin content. 
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extracts. If these minor cell proteins differ among cells to the same extent as the 
more abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to Externa] Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
. is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
■ production of glucose from amino acids and other small molecules; (he set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. Jn fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond 10 glucocorticoids at all. These examples illustrate 
a general feature of cell specialization—different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc 1 
tive character. These features reflect the persistent expression of different sets of 
genes. 

i 

j Gene Expression Can Be Regulated at Many of the Steps 
J in the Pathway from DNA to RNA to Protein 1 

\ If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle he regulated. Thus a cell can control the proteins it 
makes by I J) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary - RNA transcript is spliced or 
othervvise processed (RNA processing control), i'S) selecting which completed 
rnRNAs in the cell nucleus are exported to the cytoplasm (UNA transport con- 
trol). M) selecting which mRNAs in the cytoplasm arc translated by rihosomes 
(translation:*! control), (f>) selectively destabilizing certain mHNA molecules in 
i he cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Tignie 9-Z). 

For most series it arise tip lional coniiols are paramount. This makes sense 
because, of all the possible control points illustrated in figure only Itanscrip- 
tionaJ control ensures that no superfluous intermediates are synthesized. In the 
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Summary 

77, t> many types of celts in animals and plants are created largely through mecha- 
nisms that cause different genes to he transcribed in different cells. Since many spe- 
cialized tmimal cells can maintain their unitpte character when grown in culture, the 
gene regulatory mechanisms involved in creating them mast he stable once estab- 
lished and heritidde when the cell divide, endowing the cell with a memory of its 
developmental history. )>rocuryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may he rel- 
evant to the creation of specialized cell types in higher encaryotes. One such mecha- 
nism involves a competitive interaction between two tor more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other; this can create a flip-flop 
switch that switches a cell between two id tentative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their men synthesis, provide a general mechanism for cell memory. 

In encaryotes gene transcription is generally controlled by combinations oj gene 
' regulatory proteins. It is thought that each type of cell in a higher et tea ty otic organism 
con tf tins n specific emu hi nation of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell. A given gene r egulatory pro- 
tein mny /;»' e.\ pressed in a variety of circumstances and iypictdly is involved in the 
rei^uUition of ninny genes. 

in iiilditioir to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are itlso utilized by enc myotic cells to regulate gene expression. In ver- 
tebrates DNA methylotion also plays a part, mainly as u device to reinforce decisions 
about gene expression thai are mode initially by other mechanisms. 



Although controls on the initiation of gene l ransei iplion arc the predominant 
torm of regulation Ini'most genes, oilier controls can acl later in the pathway 
from UNA to po>iein to modulate the amount of gene product thai is made. Al- 
though ihrsr post 1 1 anso iplional controls, which opeiaie after KNA polyene) ase 
has hound to the gene's promote* b'-'gnn UNA synthesis, ate less common 
than irunscriptiomtt control, lot many grnes they ate crucial. It seems that evety 
step in gene evpiession that eon hi he eon I tolled in piiociple is likely 10 he i emu- 
lated under some ciicumstances for some genes! 

We considei the vatieties ot posmansoipiional regulation in temporal oi- 
dn. at cooling to flu- sequence <>l events th;tl might he e;>:pe) ieneed by an RNA 
molecule a! t ei its ■ i attsct ipi ion has begun l l : igme !)~7;>i. 
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An increased high-mobility group A2 expression level is associated with 
malignant pheuotype iu pancreatic exocrine tissue, 

AbeJV, VVataitabeT, Suzuki V , Matsumoto N . Masaki T . Mori T . Sugiyama M . 
Cfuappetta G . Fusco A , A to mi Y . 

First Department of Surgery, Kyorin University School of Medicine, 6-20-2, Shinkawa, 
Mtlaka, Tokyo 181-861 1, Japan, abenbtg@kyorin-u.ac.jp 

The altered form of the high-mobility group A2 (HMGA2) gene is somehow related to 
the generation of human benign and malignant tumours of mesenchymal origin. 
However, only a few data on the expression of HMGA2 in malignant tumour originating 
from epithelial tissue are available. In this study, we examined the HMGA2 expression 
level m pancreatic carcinoma, and investigated whether alterations in the HMGA2 
expression level are associated with a malignant pheuotype in pancreatic tissue. High- 
mobility group A2 mRNA'and protein expression was determined in eight surgically 
resected specimens of non-neoplastic tissue (six specimens of normal pancreatic tissue 
and two of chronic pancreatitis tissue) and 27 pancreatic carcinomas by highly sensitive 
reverse transcriptase-polymerase chain reaction (RT-PCR) techniques and 
immunohistochemical staining, respectively. Reverse transcriptase-polymerase chain 
reaction analysis revealed the expression of the HMGA2 gene in non-neoplastic 
pancreatic tissue, although its expression level was significantly lower than that in 
carcinoma, [mmunolustochemicat analysis indicated that the presence of the HMGA2 
gene in non-neoplastic pancreatic tissue observed in RT-PCR reflects its abundant 
expression in islet cells, together with its focal expression in duct epithelial cells. Intense 
and multifocal or diffuse HMGA2 immunoreactivity was noted in all the pancreatic 
carcinoma examined, A strong correlation between HMGA2 overexpression and the 
diagnosis of carcinoma was statistically verified. Based on these findings, we propose 
that an increased expression level of the HMGA2 protein is closely associated - with the 
malignant pheuotype in the pancreatic exocrine system, and accordingly, HMGA2 could 
serve as a potential diagnostic molecular marker for distinguishing pancreatic malignant 
cells from non-neoplastic pancreatic exocrine cells. 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
with the marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph 1 , a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the v-abl gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-abl 
sequences, bcr t normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of transacting factors that occur during evolution of 
the disease. Our analysis of P210 c ' abl and the 8-kb mRNA in 
Epstein-Barr virus-transformed Ph^positive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c abl one-fifth that of P145 c abf , as detected by metabolic 
labeling with [ 32 P]orthophosphate and immunoprecipitation 
(S.C., O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases, relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 c abl expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Ph l -positive B-cell lines we have examined 
(Table 1), we have seen higher levels of P210 c ab! in those 
derived from patic v. at more advanced stages of the disease. 



It will be important to search for ceil-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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Ruedi Aebersold In this review we exami ne the current state of proteome analysis. There are 

three main issues discussed: why it is necessary to study proteomes; how pro- 
Department of Molecular teomes can be analyzed with current technology; and how proteome analysis 
Biotechnology University of Can be US6d t0 enhance biological research. We conclude that proteome anal- 
Washineton Seattle WA USA yS,S IS an essential too[ in the understanding of regulated biological systems. 

Current technology, while still mostly limited to the more abundant proteins, 
enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biological 
research will continue to be enhanced by further improvements in analytical 
technology. 
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protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
leome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ti) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 



2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et al. % submitted for 
publication). mRNA expression levels were calculated 
from published SAGE frequency tables [22]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
of the protein spots separated by high resolution 2-DE 
and mass spectrometric identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig. 1). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Gygi 
et ai. y submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translationally modified by glycosylation, phosphor- 
ylation, prenylation, acylation, ubiquitination or one or 
more of many other modifications [23] and many pro- 
teins are only functional if specifically associated or com- 
plexed with other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cofactors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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Figure I. Correlation between mRNA and protein levels in yeast cells. 
For a selected population of 80 genes, protein levels were measured 
by 3i -S-radiolabeling and mRNA levels were calculated from publi- 
shed SAGE tables. Inset: expanded view of the low abundance region. 
For more experimental details, also see Figs. 5 and 6, (S. P. Gygi et a/., 
submitted). 



are important indicators for the state of the system, The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 



2.3 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular states 
which are characterized by significantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translational and post-translational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided that the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of protein synthesis, degradation, 
processing and modification, which are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot' be 
amplified. It is possible to produce targe amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics , of post-translational protein edit- 
ing thus significantly complicates proteome studies 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided- 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful 
general proteomics technology requires high sensitivity 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-D electrophoresis - mass spectrometry: a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (IEF/SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoelectric focusing (IEF) and then by SDS- 
PAGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 
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samples. The separation of thousands of proteins has 
been achieved in a single gel (24, 25j and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using A/terminal sequencing (26 27] 
internal peptide sequencing (28, 291, immunoblotting or 
comigration with known proteins (30|. The recent dra- 
mat.c growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their ammo add sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases (31-33J. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today The 
most significant strengths of the 2-DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig 2 Pro- 
tein spots detected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
tor analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins 
Itiey can be grouped into two categories, digestion in 
the gel slice (28, 34) or digestion after electrotransfer out 
ot the gel onto a suitable membrane ((29, 35-371 and 
reviewed in (381). In most instances either technique is 

S?S i'2e 3nd yidds g00d results - ^ analvsis of MS or 
Mb/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated, to the use of peptide MS 
and MS/MS spectra for the identification of proteins 
Proteins are identified by correlating the information 

mc'ATc in thC MS Spectra of P rotein di e e sts or 
Ma /MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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figure 2. Schematic diagram of a procedure for identification of gel- 
scparaied proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 



raphy [39, 40) or capillary electrophoresis [41), the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 33]. The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 

3.3 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

It has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants. We employ 
three different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42]. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 um ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
et at., submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel [40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3.3.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ion trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level [43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has hot been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure 3. Schematic illustration of a 
microfabricated analytical system for CE, 
consisting of a micromachined device, 
coated capillary electroosmolic pump, 
and microelectrospray interface. The 
dimensions of ihe channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from (45|, with permission. 



microfabricated devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 urn in depth and 50-70 urn in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels are 
connected to an external high voltage power supply [45]. 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. . 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/uL. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 
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Figure 4. MS spectrum of a tryptic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/uL of carbonic anhydrase tryptic 
digest was infused into a Finnigan LCQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from (45|, with permission. 
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'liT 'a 2 " DE separation of a lysate of yeast cells, with identified proteins highlighted. The first dimension of separation was an IPG from 
I'll 3-10. and the second dimension was a 10%T SDS-PAGE gel. Proteins were visualized by silver staining. Further details of experimental 
procedures are included in S. P. Gygi et ai (submitted). 



calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46], The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified (> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in cells. In contrast! 
codon bias values calculated for the entire yeast genome 
.(Fig. 6a) show that the majority of proteins present in 
Jhe proteome have a codon bias of < 0.2 and are thus of 
'ow abundance. 

This finding is of considerable importance in our assess- 
ment of the current status of proteome analysis technol- 
ogy- It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and identify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
ceils [16]. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100000 
gene products [47]. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE [48]. 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 
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Figure 6. Calculated codon bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome, (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. 1 and 5. Further details of experimental procedures are 
included in S. P. Gygi et at. (submitted). 



4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive. The most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. This approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7. The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. ; 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent [491. 
It is obvious that the. protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD [50] (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12] (accessible at http:// 
biobase.dk/cgi-bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (e.g., enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay; mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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n. live comparative analysis of hundreds to a few thousand alized, although the results become more informative as 

B proteins. Comparative analysis of the 2-DE patterns fur- more proteins are compared. It is obvious, however, that 

sr ihcrmore highlights quantitative and qualitative differ- the possibility to identify any protein deemed character- 

)]. ences in the protein profiles which correlate with the istic for a particular state dramatically enhances this 

ly state of the system. For this type of analysis it is not approach by opening up new avenues for experimenta- 

ly essential that all the proteins are identified or even visu- tion. 





Figure 7. High resolution 2-DE map of proteins isolated from rat serum with or without prior exposure to an inflam- 
mation-causing agent. (A) normal rat serum, (B) acute-phase serum from rats which had previously been exposed to 
an inflammation-causing agent. The first dimension of separation is an IPG from pH 4-10, and the second dimen- 
sion is a 7.5-17.S%T gradient SDS-PAGE gel. Proteins were visualized by staining with amido black. Further details 



ta- of experimental procedures are included in [14, 49 J. 
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Proteome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
pf) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A different way to use proteome analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 



We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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Each year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include rumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease- free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 1 0%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amplifica- 
tidti. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (< 1 cm) tumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin 0 (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin c in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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HER-2/neu via IHC 

88342 (including interpretive report) 

HER-2/neu via FISH 

88271 *2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
8829 1 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 

Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest 0 . The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for.routinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti -rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site. The specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 1 7 centromere and the second for the HER- 
2/neu oncogene located at 17ql 1 .2-12 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 



1. Wingo, P.A., Tong, T., Bolden, S., "Cancer Statistics", 1995;45: 1:8-31. 

2 "Cancer Rates and Risks", 4* ed., National Institutes of Health, National 
Cancer Institute, 1996, p. 120. 

3 Slamon, DJ., Clark, G.M., Song, S.G., Levin, W.J., Ullrich, A., McCuire, 
W.L. "Human breast Cancer Correlation of relapse and survival with 
amplification of the her-2/neu oncogene". Science, 235:177-182, 1987. 

4 Xing, W.R., Gilchrist, K.W., Harris. CP., Samson, W., Meisner, L.R 
"FISH detection of HER-s/neu oncogene amplification in early onset 
breast cancer". Breast Cancer Res. And Treatment 3 9(2): 203-2 12, 1996. 

5 Press, M.F, Bernstein, L, Thomas, P. A., Meisner, L.F., Zhou, J.Y., Ma, 
Y., Hung, C, Robinson, R.A., Hams, C, El-Naggar, A., Slamon, D.J., 
Phillips, R.N., Ross, J.S., Wohnan, S.R., Flom, K.J., "Her-2/neu gene 
amplification characterized by fluorescence in situ hybridization: poor 
prognosis in node-negative breast carcinomas", J. Clinical Oncology 
15(8):2894-2904, 1997. 



Provided for the clients of 



Pathology Associates Medical Laboratories 
PacLab Network. Laboratories 

Tri-Cities Laboratory 
Treasure Valley Laboratory 

For more information, please contact 
your local representative. 



© 1999 by Pathology Associates Medical Laboratories. 



Journal of- _ 

research articles prOTGOITIG 

•research 



Analysis of Genomic and Proteomic Data Using Advanced Literature 

Mining 

Yanhul Hu, Lisa M. Hines, Halfeng Weng, Dongmei Zuo, Miguel Rivera, 
Andrea Richardson, and Joshua LaBaer* 

Institute ofPrvteomlcs. Harvard Medical SchooHBCMP, 240 Longwood Avenue, Boston. Massachusetts 02115 

Received March 13, 2003 

High-throughput technologies, such as proteomic screening and DNA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships In Medline. Using MedGene, we analyzed a novel micro-array expression dataset 
comparing breast cancer and normal breast tissue In the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 

Keywords: blolnformatlcs • micro-array • text mining • gene-disease association • breast cancer 



Introduction 

At its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g.. 
breast cancer, a researcher would have had to scan 130 different 
Journals and read 27 papers per day in 1999. 1 This problem is 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteomics, which require the analysis of 
large datasets involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database 1 and the 
Tumor Gene Database, 1 However, as these resources are hand- 
curated. the labor-intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which Involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
Interactions that proteins or genes have with one another, in 
the literature or by functional annotation. 1 * 7 Thus far. few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-Iratxeta et al. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations In order 
to Identify genes linked to Inherited disorders. 1 

To obtain a more global understanding of disease develop- 
ment It would be valuable to incorporate information regarding 
all possible gene-disease relationships. Including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. Second, it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
cltation. We applied this tool to help Interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 



Methods 

MedGene Database. MedGene Is a relational database, stor- 
ing disease and gene information from NCBI. text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user Interface for users to 
query the database (http://hipseq.mediiarvard.edu/MedGene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) (http.// 
www.nlm.nlh.gov/mesh/meshhome.htmO and human disease 
categories were selected. Locuslink files were downloaded from 
the Locuslink web site at NCBI (http://www.ncbl.nih.gov/ 
LocusLink/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to Index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an Index, then qualified gene 
official/preferred names were used A local copy of Medline 
records (up to Jury. 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to Index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were Indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
Included chl-square analysis, Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease* (http:// 
Wpseq.medJiarvard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using TreeViewer" (http:// 
rana.lbI.gov/ElsenSoftware.htm) . 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-posirJve) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
types of breast cancer. Biotinylated cRNA, generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Affymetrix U95A oligonucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Affymetrix. and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations In the literature, 
we first complied complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) Index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged In both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 910 Twenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information. Supplemental Table 1, or visit http://hlpseq. 
med.harvard.edu/MedGene/publicatlon/s_Table l.html) were 
selected from the 2002 MeSH index creating a list of 4033 
human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus. It was necessary to apply a string search algorithm 
for gene names or symbols found In Medline text A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLink database at 
NCBI. 11 11 which contains 53 259 independent records keyed 
by an official gene symbol or name dune 18 th , 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of Italics, if at all. 
For the intended use of this study, this lack of distinction is 
unlikely to have a large effect and may in fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table 1). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax Issues were readily handled 
by Including alternate syntax forms In the search terms. The 
false positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive suing mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract. Case-insensitive string mapping was used for all 
searches except as noted above. No additional weight was 
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error type 
false positive 

false positive 

false positive 

false negative 
false negative 
false negative 



example 



MAO-myelln 

associated glycoprotein 
M^O--maUgnancy-assoclated 

protein 

PyHpallid homologue (mouse), 

paUidin (also abbrev. for Pennsylvania) 
WAS- Wiskott-Aldrich Syndrome 

(also the word •was") 
BAG- 1 Instead of BACI 
PS3 Instead of TP53 
estrogen receptor Instead of 
Estrogen receptor 1 



fitter solution 
eliminate this term 

eliminate this term 

case-sensitive string search 

add dash term 
add all gene nicknames 
add family stem term 



gene symbol/name 
Is not unique 

gene symbol is 

unrelated abbreviation 
gene symbol/name 

has language meaning 
nonstandard syntax 
unofficial gene name/symbol 
nonspecifled gene name 

- ,„ p„» *udU, Medline wa, sea.hed for '0^-£ B= ^™^ ^^r.^^^^ ^ 
^^^i^^^^^^^Z^^ * c~. the flte sCuticn laetf Introduce, 

added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medliiie records were searched with all qualified gene 
Identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name. e.g.. estrogen receptor uttteador 
estrogen receptor 1 (ESRI). creating a source of false negatives. 
For this reason, gene family stem terms were created for all 
Kenes that have an alpha or numerical suffix (e.g.. IL2RA.TCFP, 
ESRI. etc.) and then used to search the literature. The family 
stem terms were handled separately from the specific gene 
names so that It would be clear when linkages were made to 
the gene family versus a specific member In that family. 

To improve performance and accuracy, some pre-selectlon 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second. non-English Journals were removed because 
the natural language Biters were only relevant to English 
publications. Finally. Journals unlikely to contain primary date 
about gene-disease relationships were also removed (e.g.. Int. 
J Health Educ. Bedside Nurse, and J.Health Earn.). Together. 

these filters reduced the 12 198 221 Medline publications (July 

2002) by 37%. «... 

Ranking the Relative Strengths of Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations, 
in which 16% (8297) or all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because It gave the best results overall. 

Validation of MedCene. In developing this tool. It was 
important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were In conflict. Inclusiveness was 
prioritized. To determine the false negative rate In MedGene. 
breast cancer was used as a test case because It was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand-curated databases. The breast cancer-related genes 
identified by MedGene were compared with those listed In 
several other databases Including the Tumor Gene Database 
(TGD). 1 the Breast Cancer Gene Database(BCG). 1 GeneCards 
(GC)" and Swlsspc^. 11 Genes were considered false negatives 
If they were represented In at least one of these other databases 
and not In MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes In each database or shared by more than one 
database Is Indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapplng 
genes In other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared the list of breast cancer- related genes 
from MedGene to these databases, illustrated In Figure 1. 
Among the 285 distinct breast cancer-related genes that were 
supported by at least one literature citation in these hand- 
curated databases. 26 were absent from MedGene. suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for these genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tion, Supplemental Table 2. or visit http://Wpseq.med. 
harvaitLedii/MedGene/publicatlon/sJTable 2.htrru). Among 
these papers, most false negatives were caused by nonstandard • 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned In 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or tide (1.1%). Of note, 
MedGene identified approximately 2000 additional breast 
cancer-related genes not listed In any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell Into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information, Supplemental Table 3. or visit 
http://hlpseq.med.harvard.edu/MedGene/publlcatlon/ 
s Table 3JitrnI). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two In which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of an/ relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer. 11 " 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted In most of the false positives (see the Supporting 
Informatioa Supplemental Table 4. or visit http://hipse- 
q med.harvaxd.edu^edGene/pubHcation/s.Table 4.htmQ. em- 
phasizing the Importance of the filters that were added In the 
search algorithm (Table 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESR1 and 10 to ESRZ whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESR1 or ESI&. 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
For this experiment. It was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus. If the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or Inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information. 
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Supplemental Figure I. or visit httpy/hipseq.med.harvard.edu/ 
MedGene/pubUcation/s_Flgure I. html). For example, In one 
such cluster shown in Figure 2. diabetes and Its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by theCalse 
negative rate (-9%) and down by the false positive rate (-26% 
on average). Using this, the average disease has 103.7 ± 45.3 
(mean db s.d.) genes associated with It although the rtfige is 
quite broad with 2359 genes related to breast cancer. 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis Identified 
2286 genes that had greater than a l-fold difference In mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene, we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer In the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family term); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2. or visit http://hipseq.med.harvaid.edu/MedGene/ 
pubUcatlon/sJflgure 2.html.) Among the 505 previously un- 
related genes. 467 were either newly Identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes, 9 had been related to other 
cancers, specifically esophageal, coloa uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 4r>fold (0.3%). Notably, 
the majority of genes with greater than 10-fold expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r« 0.018. p-value » 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend In correlation (Figure 
3B) suggesting that genes with a more substantial change In 
expression level were more likely to have a stronger association 
in the literature. For genes that had 10-fold change or more in 
expression level, the correlation increased to 0.41 (p-value = 
0.05). 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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harvard.edu/MedGene/pub11cation/s_Figure 1 .html). 



Finally, to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by MedGene for hypertension, a 



disease unrelated to breast cancer. As expected, we did not 
observe an increasing trend in correlation for hyperten- 
sion. 
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estrogen receptor 

PGR 
ERBB2 
BRCAl 
BRCA2 
EGFR 
CYP19 
TFF1 
PSBN2 
TP53 

CES3 

CEACAM5 

ERBB3 
cyclln 
COX5A 
cathepsln 
ERBB4 

TRAM 

CCNDl 
EOF 
MUCl 

insulin-like 
BCL2 

mucin 
FGF3 



REN 
DBP 
LEP 
ACT 
INS 

kalllkrein 
ACE 

endothelln 

S100A6 

SDK 

DIANPH 
SARS 

PIH 

CD59 

ALB 

CYP11B2 
MAT2B 
angiotensin 
receptor 

AGTR2 

NPPA 

LVM 

DBH 
NPY 

POMC 
neuropeptide 



RA 

TNFRSFWA 
CRP 
AS 
ESRl 

HLA-DRBI 
DRt 

interleukin 

TNF 

IL6 

collagen 
ILIA 

ACR 

TNFRSF12 
02 

CHI3L1 
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Interleukin 1 
matrix 
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interferon 

CD68 
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MMP3 
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Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organfcation 
and biological systems. High-throughput technologies can 
provide novel Insight Into comprehensive biological function 
but also introduces new challenges. The utility of these 
technologies Is limited to the ability to generate, analyze, and 
interpret large gene lists. MedGene. a relational database 
derived by mining the information in Medline, was created to 
address this need. MedGene users can query for a rank-ordered 
list of human gene-disease relationships (Table 2) for one or 
more diseases. Each entry Is hyperlinked to the original papers 
supporting each association and to other relevant databases. 

MedGene Is an Innovative extension of previous text mining 
approaches. Perez-Iratxeta et al. used the GO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders .• MedGene takes a broader view 
and includes aU diseases and all possible gene-disease relation- 
ships. Furthermore. MedGene utilizes co-citatlon to Indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have GO annotation. Our approach Is 
complementary to that taken by Chaussabel and Sher who 
used the frequency of co-clted terms to cluster genes Into a 
hierarchy of gene-gene relationships. 0 

A unique aspect of this tool Is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias 15 and Is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
http://hlpseq.med.harvard.edu/MedGene/publicatlori/s_Ta- 
ble 3.html). Of course, relationships established by frequency 
of co-dtation do not necessarily represent a true biological link: 
however. It is strong evidence to support a true relationship. 

Another important feature of MedGene is the Implementa- 
tion of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study. aU of the filters that we applied were general 
ones. e.g.. expanding the list of all gene names to address the 
different syntax forms used by different Journals, eliminating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 
rate. 

It is not uncommon to see expression changes In micro- 
array experiments as small as 2-fold reported In the literature. 
Even when these expression changes are statistically significant, 
it Is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease- related genes to 
appear In the altered expression group. MedGene provided a 
unique opportunity to test this notion In the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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KRTHB1 

BR53 

DKK1 

ZIC1 

TLR1 

KIAA0680 

CDKN3 

EBI2 

CZMB 

STK18 

CPR49 

MYO10 

LADl 

P0LE2 

HMG4 

BCL2L11 

LRP8 

CCNB2 

CCNE2 

FOB 

KNSL6 

H1F5 

SERPINH2 

YAP1 

LPHB 

TCEA2 

TFF1 

COU7A1 
.POPS 

BPAGl 

PDZKl 

VEGFC 

MUC6 

SERPINA5 

MEIS1 

CAM 



1.0 
1.2 
1.2 
1.9 
1.0 
2.6 
1.0 
4.0 
3.8 
4.7 
1.0 
1.6 
-1.0 
4.2 
4.4 
-1.2 
2.9 
1.0 
4.0 
-4.3 
2.9 
3.0 
4.6 
1.0 
-1.3 
-1.1 
1.3 
-4.1 
1.1 
-4.6 
-l.l 
-2.8 
-1.4 
-1.0 
-1.6 
2.4 



610.8 
89.4 
69.8 
59.6 
38.5 
33.2 
30.6 
27.9 
21.9 
18.6 
14.6 
14.4 
13.5 
13.0 
12.9 
12.3 
12.2 
11.8 
11.6 
11.1 
10.9 
10.2 
10.2 
10.0 
-10.4 
-10.8 
-11.4 
-15.7 
-16.2 
-22.3 
-36.8 
-51.5 
-64.9 
-83.1 
-85.9 
-150.3 



Tabte 3. MedGene Identified a set of relatively understated, yet highly 

these Btnei have either never been co-dted with breast cancer or nave a 
weak association except those marked with an . 

reflects the many genes whose role In breast cancer may not 
involve large changes In expression In sporadic tumors (e.g.. 
BRCAl and BRCA2) and genes whose modest changes In 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more In expression level, there 
was a strong and significant correlation between expression 
level and a published role In the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
First a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
less than 5-fold expression level changes. Second, any genes 
with 10-fold changes or more are likely to be related to breast 
cancer and warrant attention. It Is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-posltive tumors, not ER-negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
subpopulatlons that behave very differently. The MedGene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER-negatlve tumors that are worthy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of Its kind. By generating a score that reflects 
the strength of the association. It provides an Important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore. It-can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high-throughput 
applications. In the future. It will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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Genomic and proteomic analysis of Hie myeloid differentiation program 
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Peter E. Ncwburger, Nancy Berliner, and Sherman M. Weissman 



Although the mature neutrophil Is one of 
the better characterized mammalian cell 
types, the mechanisms of myeloid differ- 
entiation are Incompletely understood at 
the molecular level. A mouse promyelo- 
cyte cell line (MPRO), derived from mu- 
rine bone marrow cells and arrested devcl- 
opmentally by a dominant-negative 
retinofc acid receptor, morphologically 
differentiates to mature neutrophils In the 
presence of 10 nM retfnolc acid. An exten- 

Introduction 



slve catalog was prepared of the gene 
expression changes that occur during 
morphologic maturation. To do this, 3'- 
end differential display, oligonucleotide 
chip array hybridization, and 2-dimen- 
slonai protein electrophoresis were used, 
A large number of genes whose mRNA 
levels are modulated during differentia- 
tion of MPRO cells were Identified. The 
results suggest the Involvement of sev- 
eral transcription regulatory factors ndt 



previously Implicated In this process, but 
they also emphasize the Importance of 
events other than the production of new 
transcription factors. Furthermore, gene 
expression patterns were compared at 
the level of mRNA and protein, and the 
correlation between 2 parameters was 
studied. (Blood. 2001;98:513-524) 
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Studies of norma! myeloid maturation from many laboratories have 
identified genes lhat may play critical roles in myeloid differentia- 
tion. M Cuirent studies suggest that these events are dependent on a 
cascade of molecular changes that involve complex modulation of 
mRNA transcription. Furthermore, studies of acute leukemia have 
suggested Lhat the disease arises from the accumulation of myeloid 
precursors arrested at early stages of differentiation and associated, 
in many cases, witfi chromosomal rearrangements that alter the 
structure of specific transcription factors. 5 Nevertheless, the molecu- 
lar events underlying the production of manire myeloid cells arc 
not well understood and appear to use interacting pathways and 
networks, the elucidation of which requires an extensive descrip- 
tion of the molecular components available to the myeloid cell. 

An extensive body or information is accumulating with respect 
to gene expression profiles of mammalian cells. However, much of 
the information available in public databases has been accumulated 
by the use of techniques such as single oligonucleotide chips or 
cDNA arrays that measure fewer than 6000 of potentially 30 000 to 
1 20 000 transcripts. The more limited range of analyses reported by 
the serial analysis of gene expression (SAGE) 6 ' 7 technique accu- 
rately estimates changes in levels of the more abundant mRNAs but 
requires extensive redundant analyses to measure changes in the 
patterns of expression of scarce mRNAs. We have used a modified 
polymerase chain reaction (PCR)~based cDNA differential display 
(DD) method in which single restriction fragments derived from 
the 3' end of cDNAs are separated on a sequencing gel.*- 9 Bands 
from the gel can be identified initially by sequencing, but then 



comparison of patterns from different samples can he made without 
further sequencing. This sensitive and reproducible method detects, 
in principle, most cDNAs regardless of whether they are repre- 
sented in existing databases. 

Systematic analysis of the function of genes can also be 
performed at the protein level. This approach has the advantage of 
being closest to function, because proteins perform most of the 
reactions necessary for the cell. The most common method of 
protcome analysis is the combination of 2-dimensional gel electro- 
phoresis (2DE) to separate and visualize protein and mass spectrom- 
etry (MS) for protein identification. 10 Several such analyses of 
yeast and of normal or malignant mammalian cells have been 
performed. To date, however, there have been few studies in which 
both mRNA and protein have been compared by applying analyses 
to the same samples. The studies of Anderson" and Gygi 1J showed 
that there is not a good correlation between mRNA and protein 
levels, in yeast or human liver cells. However, other analyses 
disagree with this conclusion (Grcenbauin et al, manuscript 
submitted, and Futcher et al 14 ). Furthermore, global correlations 
between changes in mRNA and protein levels have not been 
examined during the execution of any developmental program. 

The MPRO cell line was derived by transduction of a dominant- 
negative retinoic acid receptor construct into nomiai mouse bone 
marrow cells. It is a granulocyte-macrophage colony-stimulating 
factor (GM-CSF)-dependcnt line arrested at a promyeiocytic stage 
of development 1$ . 16 After treatment with aiUrans retinoic acid 
(ATRA) most of the cells acquis the morphology of mature 
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neutrophils and begin to produce neutrophil lactoferrin and gelati- 
nase, 2 proteins characteristic of neutrophil secondary granules.' 7 
As such, it offers a valuable model for studying neutrophil 
differentiation in vitro. 

We now report the analysis of mRNA expression changes 
during the process of MPRO cell maturation to neutrophils and 
compare the results with a limited analysis of cellular protein 
composition. mRNA expression changes were studied by combin- 
ing the use of oligonucleotide arrays and DD. A database (dbMC) 
with comprehensive genomic information for myeloid differentia- 
tion program was constructed (accessible at http://www.bioinfo.mbb. 
yale.edu/expression/neutrophil). We have grouped the changes in 
mRNA levels of a large number of genes into 6 patterns, with 
implications for the genetic program of myeloid differentiation. 

We also compared 2-dimensionai high-resolution gel electro- 
phoretograms from control cells and cells differentiated for 72 
hours in the presence of ATRA. Fifty protein spots whose relative 
intensity changed prominently during differentiation were exam- 
ined by mass spectrometry. The results suggest a poor correlation 
between mRNA expression and protein abundance, indicating 
that it may be difficult to extrapolate directly from individual 
mRNA changes to corresponding ones in protein levels (as 
estimated from 2DE). 



Materials and methods 

Cell lines 

MPRO cells and HM-5 cells provided by Dr Schickwann Tfeai (Fred 
Hutchinson Cancer Research Center, Seattle, WA)' 5 were used throughout 
the study. The cells proliferated continuously as a GM-CSF-dcpendent cell 
line at 37" C in Tscoves modified Dutbecco medium (Gibco BRL, Grand 
Island, NY) supplemented with 5% to 10% fetal calf serum (Gibco BRL) 
and 10% HM-5 -conditioned medium as a source of GM-CSV. Morphologic 
differentiation of the blocked MPRO promyelocytes was induced by 
treatment with 10 u,M ATRA (Sigma, St Louis, MO). Controls were 
cultured in the absence of ATRA but with the same volume of ve- 
hicle Methanol). 

RNA Isolation and differential display 

After exposure to 10 u,M ATRA for 0, 24, 48, or 12 hours, total cellular 
RNA was isolated from MPRO cells using TRIzol reagent (Life Technolo- 
gies, Gaithersburg, MD). cDNA was then synthesized using a T-7 Sal-Oligo 
d(T) 32 primer as described previously."*'* The double-stranded cDNA was 
digesKd with 1 of 9 different restriction enzymes (Apal, BgM y BamHU 
Eagl, EcoRl, Hindll^ XhcA, Kpnl, and SphV) and ligated to Y-shaped 
adaptors with a complementary overhang. DMA fragments were then 
amplified by PCR as described previously.*' K PCR produce were separated 
on a sequencing gel of 6% polyacrylamide with 7 M urea, lhe gel was dried 
and exposed to x-ray film. Genes from differential display gels, whose 
maximum intensity changes equaled 2+ on a scale of 1 1 to 8 + , were 
recorded as significantly changed. 1 * Individual DNA bands were recovered 
from the gels, amplified by PCR, and sequenced. 

Oligonucleotide chip analysis of RNA samples 

Ten micrograms total RNA from each sample (0, 24, 48, or 72 hours) was 
used U> prepare cDNA. This cDNA was transcribed with T7 RNA 
polymerase to prepare a fluorcsccntly labeled probc. i(ul Each sample was 
hybridized to mouse urruy chip (Mul IK Array; Aflymctrfrc, Santa Clara, 
CA) containing oligonucleotide probe sets cor responding to approximately 
7000 known genes or ESTs represented by UniGcnc clusters. 22 cDNAs 
were considered present it" their probe set results were rated as such by the 
GeneChip software ( Asymetrix) and if the average difference (AD) 
between perfect match and mismatch probe pairs was not less 100 U. If a 



gene was represented by more than one array probe set, the average of all 
probe seta for the gene was taken. Genes with AD values between 100 and 
200 were considered unchanged because of their low expression levels. 
Those genes with AD value* equal to or more than 200 U at one time point 
were further studied by reseating, threshold, aud normalization methods 
described in the MIT Center for Genome Research Web site. 1 3 A value of 20 
was assigned to any gene with an AD below 20 at seme time point. 

BloinformaticG and database development 

All the sequences or gene fragments were searched using Blast against 
GenBank and TIGR gene indices. A database of genes or ESTfc whose 
expression levels changed during myeluid differentiation was constructed 
containing information for each band or genc. This included GenBank 
matches. Locus Unlc or Unigcne clusters, expression patterns, tissue 
distribution, synonym(s) protein name, gene name(s), notations of possible 
functions, poly A signal and sequence quality, and hyperlinks to the 
database searches, sequence trace files, and related references. All gene data 
were then gathered into a cluster tile. Supplementary information is 
available at http://bfoinfo.mbb.yate.edu/expression/neuUophil. 

Classification and analysis of DN A fragments 

Sequences from differential display analyses were classified as representing 
known genes, ESTs, genomic sequences, or novel genes as described. 
Known genes from both differential display and arrays were clustered into 
27 functional categories aud searched against SWISS-PROT (httpj/ 
www.expasy.cbr.nrc.ca/cgi-biii/sprot-search-ful) or P1R (http://www.pir. 
georgctown.edu/). Information such as function, subcellular location, 
family and super family clarification, map position, similarity, syniwym(s) 
protein name, gene name(s), and so on was recorded in a variety 
of databases. 

Northern blot analysis 

Thirty micrograms total cellular RNA per lane frwm time-course MPRO 
cells were loaded onto 1.2% formaldehyde-agarose gels, then transferred to 
Hybond-N+ membranes (Amersham Pharmacia Biotech, Uppsala, Swe- 
den). After standard prchybridization, membranes were hybridized over- 
night at 65°C with radiolabeled cDNA probes (ordered from Research 
Genetics according to their dbEST Tmagc ID). Membranes were washed at a 
final stringency of 60°C in 0. t x SSC. 

Immobilized pH gradient 2-dimenslonal gel electrophoresis 
and mass spectrometry 

fnduccd MPRO cells collected at 0 and 72 hours were lyscd with lysis 
buffer (S40 mg urea, 20 my Aithinthrcitol, 20 (iLPharmaJyu: [3-10], 1 .4 mg 
phenyhnethylsulfonyl fluoride, 1 u.g each aprotinin, leupeprin, pepsiatin A, 
and anttpain 50 itg TLCK, and 100 p.g TPCK/i mL). Wc applied (00 u,L 
each MPRO cell rysate (2.5 X lO^ceUs/iOO u,L) to immobilized pH 
gradient (IPG) suips (pH 3-10 L; Amersham Pharmacia Biotech), and IPG 
electrophoresis was conducted for 16 hours (20 100 Vb) using an immobi- 
linc Drystrip Kit (Amcrsham Pharmacia Biotech). Electrophoresis in the 
second dimension was carried out in a 12% sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE) gel with the Laemmli- 
SDS continuous system m a Protean II xi 2-0 cell (Bio-Rad) run at 40 mA 
constant current for 4.5 hours. Proteins were detected by Brilliant Blue 
G-colloidal staining." Protein spots were excised from the gel and digested 
with trypsin. ACTH clip (average [M.+l{\ 2466.70) and bradykinin 
(average {M + H] 1061.23) were used for ca libra! ion uf peptide masses. One 
microliter sample digest was mixed with 1.0 *lL a-cyano-4 -hydroxy 
cinnamic acid (4.5 mg/mLin 50°/ 0 CFTjCN. O.OSliTFA) matrix Kolurion and 
1 tiL calibrants (100 miol) each. The spectra of the peptides were acquired 
in reflector/delayed extraction mode on a Voyagcr-DE STR mass spectrom- 
eter (Perseptive Biosystems, Foster City, CA). Peptides were identified 
using the ProFound search engine. - w 
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Results 

Differentiation of MPRO cells 

Figure t illustrates the morphologic changes in an MPRO cell 
population representative of those used for RNA expression 
analysis. Undifferentiated MPRO cells resembled promyelocytes 
under the light microscope (Figure I A). After induction withATRA 
for 24 hours, the cells morphologically differentiated into metamy- 
elocytes (Figure IB), At 48 hours, the cells further developed 
into metamyelocytes and band neutrophils (Figure IC). At 72 
hours, nearly 100% of MPRO cells became mature neutrophils 
(Figure ID). 

Identification of mRNAs by differential display assay 

MPRO cellular mRNA was analyzed at 0, 24, 48, and 72 hours after 
ATRA treatment. Nine restriction enzymes were used in a 3 '-end 
DD approach. During MPRO differentiation, 1109 fragments 
corresponding to 837 transcripts were found to change substan- 
tially in expression levels (Figure 2). These represented approxi- 
mately 279 known genes, 112 ESTs, and 59 putative new genes, 
each with a perfect or fair polyadenylation signal at an appropriate 
distance from the otigo-dT priming site. The gene information 
detected by DD was collected in database dbMCd. 

Identification of mRNAs by oligonucleotide chip assay 

We used an oligonucleotide chip containing 13 179 probe sets 
corresponding to approximately 7000 murine genes to analyze 
patterns of mRNA expression in the same RNA samples used for 
DD. The information obtained by oligonucleotide arrays was 
collected in the database dbMCa. 

We clustered the genes by their similarity to idealized 
expression patterns. For instance, the expression pattern of an 
ideal gene that is ovcrexpresscd (high) at time 0 and un del ex- 
pressed (low) at 24, 48, and 72 hours, would be high-low-low- 
low (HLLL). Overall we have (2 4 -2) idealized patterns exclud- 
ing HHHH and LLLL. Pearson correlation was used as the 
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Figure 1. Morphology of MPRO celts during differentiation. MPRO cells wore 
induced as describud fn 'Materials and methods' concentrated by cytospln, and 
Wright-Giemsa stained. (A) Uninduced MPRO cells. (D) MPRO cells induced with 
ATRAfcr 24 hours. (C) MPRO cells Induced wRh ATRA for 4 8 hours. (D) MPRO cells 
Induced vrith ATRA for 72 hours. 



Figure 2. Distribution of genes obtained by OD essay. MPRO cell mRNA was 
analy2ed at 0. 24. 48, and 72 hours after ATRA treatment; 1109 fragments 
corresponding to 637 transcripts were found to change substantially in expression 
levels. The total 837 transcripts were classified into 6 categories according to the 
Woinformatic analysis. Percentages show the gene distributions in these 6 catego- 
ries. Information for each transcript was collected In database dbMCd. 



measure of similarity of each gene expression pattern, 
* = (x ir X2>x iw x 4 ) to each of the 14 idealized patterns 
)' = (yiJi&y*). The 4 entries of x and y corresponded to the 
4-dimensioual gene expression levels at 0, 24, 48, and 72 hours, 
respectively. Each gene was assigned to a cluster laheled hy the* 
idealized pattern that had the maximal correlation with that 
gene. We selected only genes that hybridized well compared 
with the background (considered "present" by GencChip soft- 
ware) and had maximal AD amplitude greater than 200 U in at 
least 1 of the 4 stages. We further tabulated the 14 patterns 
according to whether the gene expression changed at early 
C0-hour), intermediate (24- and 48-hour), and late (72-hour) 
time points and whether gene expression monotonically in- 
creased (up-regulated), rnouotomcally decreased (down-regn- 
latcd), or was not rnonotonic (transient). Tabic 1 shows 8 
clusters of 104 genes that had significant changes of mRNA 
levels, arranged according to the temporal stage and the 
mono tonic/transient changes of expression levels. 

Principal component analysis determined whether we could 
comprehensively present multidimensional data (4-dimcnsional in 
our case) in a simple 2-dimensional graph. First, we found the 4 
principal components, which were the axes of the most compact 
4-dimensional ellipsoid diat encompassed the 4-dimensional cloud 
of data. Each axis was a different linear combination of the original 
4 variables. Then we verified that the first 2 principal components 
(the first 2 largest axes of the ellipsoid) captured most (95.2%) of 
the variauon of the data. Therefore, the data could he faithfully 
projected (with a. minor loss of information) into a 2-dimerisionai 
graph, with the 2 largest principal components as the x- and y-axes. 
As shown in Figure 3, genes tend to coalesce in clusters, according 
to their labels determined by their similarity to an ideal expression 
pattern. In sunimary, a genomic' (global) picture of the distribution 
of genes according to their similarity to predetermined idealised 
multidimensional expression patterns is concisely displayed in a 
2-dimensional graph. 
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Table 1. Genes differently regulated during tho different stages of mouse promyelocyte ce ll line differentiation p 

, ' Timing 



Category 



Early 



Middle 



Up-regulation 


LHHH (n « 10) 


LLHH (n - 6) 


uaie 

ULH(n~ 13) 

H1a CsfTrli Os/SIOOaB L-CCR Oss 




KtftH P?rv\ ttriti'J IHrl I #-n*> /l.,,C 
'™u t *igD« ttin LCnc llpij 

Cebph H2-DEtohi6Zyx 
HLLL (n = 11) 

Tcrg-V4 LyG4 Ctsg Spi2-1 McptS 
Myc Myb Tlr4 Npml Erh Hsp60 


Pfral Cybb Pfc PiraS Cd53 Ifngr2 


Down-regulation 


HHLL (n - 1) 
Mpo 


AJdol Rac2 Fprl Ct«d Ubb Plmb4 
HHHL(n « 37) 

Adx W2 EL2 RpM 9 Antb Ly6« Atf 1 H1&I2 
PGma2 Gnas Zfp36 tHni LforSWdgl 
Max Rps8 Csf2rbl Slpi Td«1 Tpl 3tf3 
Cn/fGys3Slct0a1 asbSeppI Rm3 
Ccnb2 S10039 Cf 1 1 H)st$-2ax RHa 


Transient 




LLHL (n = 9) 

Seil K7I2 PliaB Plrb Lstl Uf Sema4d SJa/6" Mtnp9 
LHHL(n» 17) 

Cebpa Lyze Fcgrt Arf5 Ump1 Siat3 Csf2ra Osl 
Actg Sfpil Gpx3 PtprcPrtn3 trfl RpsBkal 
Ltb4rMyln 


Copa Gstml Gnb2»re1 Gm RPL6 



" ■ w / \" • fy \<.inivui anufO-iiUUIJ, OTHJ UUB \l l-flOUT) SkuQ^S. 

AD indicates average difference; gene symbols ere expanded In an Appendix at the end of this article. 




Figure 3. Gens duster* In the fin* 2 principal component spaces. Principal 
component analysis allowed us to present Ihe multidimensional data (In mis case, 
4-dlmenslonal data of each gene expression pattern) In a simple 2-dlmenslona! 
graph. We derived the 4 principal components, which are a linear combination of the 
standardized expression Intensities (zero mean and unit variance) at 0, 24, 48. and 
72 hours. Tho first 2 principal components captured most of the variation of the data 
(approximately 85%). Therefore, the data can be displayed (wfth a minor loss of 
information) h a 2-dimensteriaJ graph. The first and second principal components, d and 
C2, aiB gtven by the Rnear combtarJons o, = 0.747 • n1 - 0.11 - n2 ~ 0.658 * n3 + 0 ■ 
n4 and Oj = 0278 - n1 + 0.353 ■ n2 +■ 0.233 - o3 - 0.B63 - n4, where n1, n2, n3, 
and n4 are the reseated and standardized expression levels at Q, 24, 48, and 72 
hours, respectively. The exes legends d and c2 stand for the first 2 principal 
components. In this paper we used the Pearson correlation to measure the similarity 
of each gene with the idealized expression patterns, as opposed to the Euclidean 
distance we used In a previous wont," because dusters were better separated using 
this measure. In both cases, we presented the data In the 2-d!menstonaf space of the 
lowest principal components. The data had a tendency to be drcularly distributed 
when we used the Pearson correlation as a distance measure. 



Correlation between array and DD analyses 

We have previously demonstrated a correlation coefficient of 0.93 
between visual estimates of changes in band intensity on DD and 
Phosphorimager System (Molecular Dynamics, Sunnyvale, CA) 
estimates of band intensity and a correlation coefficient of 0.88 
between hybridization intensity changes of mRNA on Northern 
blot analyses and changes in band intensity on DD. 1 * In a few cases 
there were clear discrepancies in the pattern of expression of a 
gene, as estimated by DD and by oligonucleotide chip analysis. We 
chose the 6 most extreme cases and examined the levels of mRNA 
change for these genes by Northern blot analysis (Figure 4). In 5 
cases, the Northern blot results agreed with the results of the DD 
analysis, whereas the results of Gnb2-rsl disagreed with the 
oligonucleotide array but duplicate bands from DD showed a 
relatively high level of expression in the 0 time sample thai did not 
correlate with the Northern blot (Table 2). One possible explana- 
tion for these findings was the change in the relative use of different 
polyadenylation sites after the addition of ATRA to the MPRO cells. 

Constructing a database for mRNA level changes during 
myeloid differentiation 

Based on the data obtained above, an in-house database (dbMC) 
was constructed that included 2 subdatabases, dbMCd and dbMCn, 
for collecting gene information from DD or oligonucleotide arrays, 
respectively. Each entry in dbMC is accompanied by a so-called 
executive summary. The linkage between dbMCd and dbMCa was 
established by UniGene ID and cluster ID. dbMC contains the 
temporal expression patterns of genes during the MPRO cell 
differentiation process, including not only products represented in 
public databases but also novel transcripts. 
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Figure 4. Northern blot analysis of selected mRNAa. Equivalent amounts of RNA 
from MPRO cells Induced byATRAst different time points (0 hour, ?A hours, 46 hours, 
and 72 hours) were resolved by formaldehyde-agarose gel electrophoresis, stained 
to verify the amount of loading. Eleven genes were separately probed on the RNA 
fiSere. The gene symbol of each probe was listed at the left of a related Northern blot 
result Detailed Information on these 11 probes was listed In Table 5. One of the 
RNA-btetted membrane photographs Is shown with methylene blue-stained 23S and 
18S RNA subunlts demonstrating the quality and quantity of RNA loaded In 
individual lanes. 



Analysts of gene expression patterns during MPRO 
differentiation 

Many of the genes identified in this study were found in myeloid 
cells or were implicated in myeloid development for the first time. 
We detected 8 cytokines 25 and chemoJcines whose mRNA levels 
changed more than 5-fold by arrays and 2-fold by DD during the 
maturation of MPRO cells (see our Web site, http://bioinfo.mbb. 

Table 2. Expression patterns of genes detected by Northern blot analysis 



yale.edvi/expression/neutrophil). Among these were 2 members of 
the CC chemokine family. Interleukm-la (IL-la) was up-regulated 
at the late stage of differentiation (LLLH pattern, Table i). 

mRNA for approximately 52 receptors was detected by one or 
the other method. A number of the receptors known to be present on 
mature neutrophils showed late induction of mRNA, and their 
levels of induction were high, indicating that the expression of 
these products is a prominent event late in neutrophil maturation 
(Table 3). Rarely was mRNA for receptors down- regulated, 
consistent with myeloid maturation being accompanied by increas- 
ing responsiveness of the cell to a variety of external stimuli. 

expression of mRNA for granule proteins 

Neutrophils contain several types of granules that develop at 
different stages of myeloid maturation. 3 ' 17 ' 26 Levels of mRNAs 
encoding secondary granule proteins, such as lactoferrin, increased 
as the cells matured (Table 4). The level of mRNA for Mmp9, 
reported as a tertiary granule protein, increased markedly between 
24 and 48 hours after the induction of differentiation, whereas 
mRNAs for secondary granule proteins either increased less 
markedly or showed a maximum increase by 24 hours. mRNAs for 
several primary granule constituents, such as myeloperoxidase and 
cathepsin G, were present in unstimulated cells and decreased as 
the cells matured. There was a discrepancy m the measurements of 
proteoglycan mRNA by DD and oligonucleotide chips, but North- 
ern blots showed that it reached a peak at 48 hours and then 
declined (Figure 4). Cathepsin D is reported as a primary granule 
protein, but its pattern of mRNA expression more closely re- 
sembled that of secondary granule constituents. In addition to 
known granule components, mRNAs for several other cathepsins 
were up-regulated during myeloid differentiation, in parallel with 
or later than the tertiary granule protein mRNAs. 

mRNAs for transcription factors 

Transcription factor genes, including several identified at the sites 
of consistent chromosome rearrangements in acute myeloid leuke- 
mia, have been implicated in normal myeloid differentiation and in 
the expression of neutrophil proteins. 2 ' 5 * 27 However comprehensive 
information concerning the expression of these transcription fac- 
tors during myeloid development is not readily available. There- 
fore, we compared gene names and identifiers in our databases to 
those of the transcription factor database Transfac (http:// 



Gene 
symbol 



Gone 
secession 



AD value by array 



Intensity by OD 



Oh 



24 h 



4Bh 



72 h 



Oh 



24 h 



Cebpa 


M62362 


33 


212 


182 


44 










Cebpb 


X62600 


390 


1248 


1380 


1S03 










Cehpd 


X61800 


157 


262 


168 


*30 










Cebpe 




















Myb 


M12848 


892 


356 


230 


435 










SipJ 


U73004 


617 


501 


783 


402 


1 


2 


3 


3 


Pro3 


W45K34 


153 


259 


339 


345 


5 


1 


1 


2 


Gnb2-rs1 


X75313 


4231 


3623 


3215 


3403 


4 


4 


i 


4 


Ly6e 


U0426B 


3061 


5391 


2844 


1262 


3 


2 


1 


1 


Up1 


M90316 


65 


376 


840 


28 


2 


3 


5 


6 


Actb 


X03765 


3095 


3588 


3976 


2434 


1 


2 


3 


2 



Gem symbol end gene accession refer to National Center for Biotechnology Information databases end, In particular, to Locus Link. AO vetue b the average Oiflerence In 
the vafce of hybridization intens&y between the eel of perfectly matched oBgonuc'eotides and the Bet of mismatched oligonucleotide in the oligonucleotide array 3and 
.ntensft.ee from DD were semiquaniified on a scale from 1 (-r Mo 8 + + + + + + +,. These estimates are shown as boldface numbers in this table.'* Doth AD value end 
•nlenslty of genes were studied at 4 time points corresponding to MPRO ceils induced for the Indicated times. 

DD Indicates differential display; MPRO, mouse prcmyelocytic cell itne; for gene symbols, see the Appendix at the end of this article. 
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Table 3. Receptors expressed during myeloid differentiation process 



Maximal fold change 



Gene symbol 


Gene accession 




AD value by array 




Oh 


24 h 


48 h 


72 h 


Bzrp 


D21207 


641 


658 


S81 


887 


Cmkar4 


X995M 


508 


447 


378 


684 


Cny 


M34173 


433 


384 


506 


506 


Csf2rh1 


M34397 


31 B 


345 


410 


241 


HtrSa 


218273 


186 


272 


273 


339 


M6pr 


X 64068 


536 


409 


408 


649 


MPPIR 


AA1 16789 


232 


64 


63 


<3Dl 


TCRGB 


M28053 


165 


212 


244 


299 


Trtfrsfla 


M59377 


0 


1 


1 


1 


Cmkbrl 


U28404 


221 


244 


504 


638 


Crhr 


X72305 


121 


200 


250 


355 


Csf2ra 


MB5078 


171 


372 


402 


254 


l=M3 


AF013114 


187 


270 


428 


140 


Gfidl 


D10171 


128 


164 


150 


257 


(fngr 


J05265 


141 


263 


327 


251 


Ii2rg 


U21795 


205 


184 


231 


477 


Ldir 


X 644 14 


1399 


1653 


1665 


3968 


P40-8 


J02870 


649 


677 


3B1 


640 


Ptaur 


X62701 


312 


443 


476 


734 


Rorg 


M34476 


102 


113 


114 


218 


Srbi 


U37799 


126 


232 


132 


258 


Cr2 


M29281 


83 


138 


243 


77 


Csf2rb2 


M29B55 


203 


249 


437 


111 


Fcerlg 


J05020 


2398 


2766 


3365 


8751 


Fcgr2b 


X04648 


1703 


1652 


1431 


4605 


tfngr2 


U69599 


1 


2 


2 


3 


Nr4e1 


XI 6995 


96 


168 


202 


401 


I11r2 


X5B769 


482 


1796 


2872 


3616 


C5r1 


LQ5630 


185 


434 


60B 


1078 


Drd2 


X55674 


0 


0 


0 


219 


Fcgr3 


M14215 


1 


1 


1 


Z 


Fprl 


L22161 


0 


69 


141 


671 


GCR 


AA240711 


2 


0 


0 


0 


L-CCR 


AA034646 


48 


»75 


314 


2056 


NMDARGB 


AAB20211 


2 


2 


0 


0 


P2rx1 


X64B9B 


79 


346 


530 


744 


Plral 


U96682 


0 


43 


172 


378 


Pira5 


U96666 


274 


391 


954 


1874 


PiraO 


U9G687 


122 


635 


2014 


1716 


Ptrb 


U36689 


191 


445 


966 


747 


Sell 


M25324 


46 


104 


570 


20 


. Tcna-V4 


M54998 


1650 


78 


65 


315 



2 or more, less than 3 



3 or more, less than 4 



4 or more, less than 5 

5 or more 



R^ptcre are identified as present whose maximal AD values were more than or equal to 200 U in thiG study. Genes were sorted by their expression patlrms as follows: 
Irsir^eavemged^ 

Center fori Biotechnology Inform*™ database where available. Numbers In bold denote those gene expression patterns obtained by differential display JZttoZZ 
ofigonucleoude array assays. The other information is presented as in the legend to Table 2. ^ 
AD Indicates average difference; gene symbols ore expanded in an Appendix at the end of this article. 



www.transfac.gbt-braiinscliweig.de/TRANSFAC) and determined 
which factors contained in this database were present at detectable 
levels in MPRO cell mRNA, using Affymctrix software for the 
criteria for inclusion of mRNAs from approximately 200 murine 
transcription factors probe sets on the oligonucleotide chip. Of 
these, 5-1 were expressed and 1 3 showed changes of 3-fold or more 
in chip signal (Table 5). 

The changes in certain transcription factors, such as the moderate 
down-regulation of myh and myc and the up-regulation of the Max 
dimerizalion protein MAD, were consistent with the shift of the cells 



from a proliferative to a dtferentiated state. 2 * Some changes are more 
difficult to explain, such as the up-regulation of DPI, a partner for E2f 
factors in the regulation of S-phase genes, and the mild up-regulation of 
the Id genes, commonly associated with an inhibition of differentiation 
by competition with bHLH traascriptianal activators. 2 * 

The C/EBP family has been extensively studied with respect to 
myeloid differentiation. wo Absolute levels of the C/EBP a and 8 
mRNAs were low, probably at the borderline of significance for the 
oligonucleotide chip assay, whereas the level of C/EBP P appeared 
higher. In addition, there were discrepancies between the chip 
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Table 4. Granule constituents axpnissed during mouse promyelocyte cell line cell differentiation 



Granule constituent 



Gene symbol 



Gene accession 



AD value by array 



24 h 



Azurophil (primary) granulns 



Possible granule proteins 



Specific secondary granules 



Tertiary granules 



Man2c1 
Ctab 
Ctsd 
Cteg 

ei2 

E1a2 

Gus-s 

Lyzs 

McptB 

Mpo 

Prg 
asc 

Ctse 
Ctsh 
Ctsl 
Ctss 

Cpa3 

Cd36l2 

Ctitp 

Cybb 

Eor2 

Fpr1 

Itgb2 

Lcn2 

Ltf 

M3P 

Mmp13 

nop 

Mmp9 



AA16ie69 

M65270 

X528B6 

M96601 

U04362 

AA6890ie 

M63836 

M2 1050 

X7S545 

X15378 

X16133 

AA144BB7 

X97399 

U06119 

X06086 

AA089333 

J05118 
A80085S3 
X94353 
U43384 

L221B1 
X148S1 
W1316B 
J03298 
W45B34 
X66473 
1.37297 

227231 



178 
442 
214 
1509 
658 
47 
544 
0 
831 
378B 
2B21 

252 
1 

45 
16 



621 
113 



178 



10 



2661 



134 
480 

1087 
405 

1273 
159 
226 
1 

258 
3009 
2653 

194 
3 
124 



270 
93 
479 



220 



3S13 
162 
1 
43 
4782 



595 
1828 
46 
B43 
134 
266 
1 
66 
776 
2920 



195 
31 



90 
157 
704 



235 



3931 
333 



72 
2311 



164 

3B9 
2784 
286 
157 
163 
254 
3 
491 
692 
9B59 

576 
5 
156 
237 
463 

601 
187 
626 
128 
2 

645 
2 

6036 
13B 
2 

176 
69)2 



AO .ndicates average drfference; gene symbols are expanded in an Appendix at tfm end of this article. 3 



estimates and the mRNA levels observed by Northern blotting with 
specific probes for these genes. In particular, the latter method, 
more sensitive and specific, showed that C/EBP a began to decline in 
the most mature cells, whereas C/EBP 8 mRNA declined progressively 
beginning at 21 hours after (he onset of differentiation. 

C/EBP e is a more recently cloned C/EBP family member. Previous 
studies indicated it is expressed in a large array of human leukemia cell 
lines blocked at various stages of differentiation and that it is up- 
regulated during granulocytic differentiation. 31 A C/EBP c probe was 
not included in the oligonucleotide chips, and this mRNA was not 
detected by DD. Therefore, we examined the C/EBP c expression 
patterns by quantitative PCR and Northern blot analysis (Figure 4). 
C/EBP € exon I was PCR amplified from MPRO RNAs using primers 
RY48 (AQXCCOGACA<XXXiYjATGA) and RY49 (TGGCACACT- 
GCGGGCAGACAG). 32 The results showed that C/EBP e is expressed 
(hroughoul myeloid differentiation, with expression levels increased 
moderately in the later stages. 

We detected a number of other transcription factors that are 
broadly expressed or that have been reported in other studies of 
hematopoiesis (Table 5). Some of the factors that were most 
strongly induced during differentiation have been studied in other 
contexts but not previously implicated in hematopoiesis, such as n 
mammalian homologue to the Drosophila enhancer of split gene, a 
transcriptional silencer. The mammalian gene is expressed at 
relatively high levels as measured by the oligonucleotide chip and 



is a candidate for mediation of the silencing of growth-related 
genes in the tmtfuring neutrophil. Another candidate transcriptional 
silencer, Tiflb, may serve as a corepressor for the KRAB domain 
fomily of zinc finger transcription factors and also may mediate 
binding of the heterochromatin protein HP! to DNA. 33 

There were 26 transcription factors whose mRNAs showed no 
significant changes by oligonucleotide chip analysis and were not 
identified as differentially regulated genes by differential display 
assays. PU.l, a factor necessary for the production of neutrophils 
and the expression of several neutrophil genes,** showed less than a 
3-fold increase in mRNA, below the threshold for a significant 
change. Other candidate hematopoietic transcription (actors such 
as PEBPUB2 (AML1). GATA-1, and SP-2, were represented on 
the oligonucleotide chips, but their mRNA levels were so low that 
they were reported as absent in this study. The possibility that small 
changes in the levels or ratios of some transcription factors could 
produce marked changes in transcription potentially limits the 
ability of data generated by present methods to explain transcrip- 
tional changes during differentiation. 

Protein expression patterns of MPRO cells during 
ATRA induction 

We visually compared the 2DE patterns from MPRO cells at the 
same time points used for mRNA analysis. In most cases the 
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TableS. Transcription modulators presented during myeloid differentiation 



Maximal fold change 



Gene symbol 



Gene accession 



AO value by array 



46 h 



2 or more, loss than 3 



3 or more, less than 4 



4 or more, less than S 



5 or more 



ZfpH-8 

Btf3 

Gata2 

Hmgl 

Idb1 

Max 

Nfatc2 

Pm1 

Rarg 

Reia 

Sox 15 

Ybx1 

Zfp162 

Cebpd 

Idb2 

Jundl 

Lyt1 

Nfe2 

Nfkbl 

Pbx1 

sfpM 

Trflb 

Trp53 

Usf2 

Ybx3 

2fp216 

lrf1 

KK 

Myb 

Stat3 

Tfdpl 

Cebpb 
Stral4 

Cebpa 

Grg 

Mad 

Myc 

Etohi6 

TBX1 



AB020542 

W13502 

AB000096 

J04179 

M31BB5 

M63903 

AA560093 

U33626 

M34478 

M61909 

W53527 

M62B57 

Y 12838 

X6180D 

M69293 

W29356 

X57687 

L09600 

L28117 

AF020106 

A34683 

U67303 

P10361 

U12283 

L35549 

AA510137 

M21QS5 

U2SC96 

M12848 

AA39K023 

Q08639 

X62600 
Y0783B 

M62362 
X73359 
XB31C6 
L00039 
W89667 
AA54222Q 



2630 
3 

562 
337 
455 
256 
2313 
173 
102 
297 
419 
643 
671 

157 
244 
1274 
399 
458 
953 
611 
375 ^ 
673 ' 
259 
120 
96 
62 

65 

62 
892 
464 
307 

390 
223 



0 
314 
169 

0 



Shown are the transcription factors identified os present by the oligonucleotide array analysis whose maximal 
sets was greater than or equal to 200 U In tNs study. Data are presented as described in the legend to Table 3 
lruficate s average differer.ee; gene symbols are expanded In an Appendix at the end of this article 



2969 
3 

770 
348 
787 
224 
3216 
2B1 
113 
260 
461 
469 
734 

262 
210 

2002 
342 
743 

2044 
303 
784 
659 
149 
185 
169 
151 

207 
66 
356 
1057 
560 

1248 
383 

212 
565 
111 
112 
366 
0 



2795 
2 

472 
177 
721 
312 
2396 
329 
114 
304 
484 
472 
720 

168 
310 
1434 
347 
1042 
1876 
345 
891 
420 
125 
285 
210 
204 

278 
246 
230 
1012 
505 

1380 
510 

1B2 
916 
167, 
62 
313 
1 



2515 
1 

730 
232 
637 
172 
2542 
306 
21S 
244 
637 



430 
604 

3085 
«91 
505 

2034 
212 
529 
663 
361 
192 
119 
106 

198 
77 
435 
280 
1093 

1903 
935 



1005 
327 
173 

1003 
2 



AD between perfect match and mismaich oligonucleotide * 



peptides identified for a given protein weie deri ved from regions 
along the entire length of the protein, indicating the observed 
products were not the result of proteolytic degradation. These 
data must be considered with several caveats: membrane and 
other hydrophobic proteins and very basic proteins are not well 
displayed by the standard 2DE approach, and proteins present at 
low levels will be missed. 35 In addition, to simplify MS analysis, 
we used a Coomassic dye stain rather than silver to visualize 
proteins, and this decreased the sensitivity of detection of minor 
proteins. The MS method we used was sufficiently sensitive to 
identify proteins that could barely be visualized by colloidal 
blue staining. However, a limitation of the method foT the mouse 
is that the current database lacks predicted amino acid sequences 
for a substantial fraction of murine genes. In addition, very 
small proteins give only a few peptides, making statistically 
confident identification difficult. 



Figure 5 shows the analytical colloidal blue-stained 2DE IPO 
reference maps of diflerentiated MPRO cells. Expression patterns 
of more than 500 protein spots were detected and observed through 
the entire series of gels. Protein spots could easily be cross- 
matched to each other, indicating the reproducibility of the method. 
As marked on the gel pictures (Figure 5), 50 proteins with a wide 
range of molecular weights ( I to 200 kd). isoelectric points (4 to 9), 
and abundances were subjected to MS protein identification. The 
results are presented in Table 6. 

Comparing the theoretical value of the molecular weight and pi 
of each protein to that of the observed value, we confidently 
identified 28 proteins in the expected position on the gels (spots I to 
28). Some of the other proteins with strong matches to the murine 
databases migrated to a somewhat unexpected pi position. Nine 
spots gave clear peptide peaks on mass spectroscopy but did not 
match any known gene. Their identification will require amino acid 
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Figure S. 2DE elect rophorctog aims of MPRO cells. 
MPRO cell lysate (2.5 X 10° cell/sample) was loaded for 
2DE analysis. Gcte were stained with brilliant blue G-col- 
loicia! dye. (A) 2DE map of uninduced MPRO cell (0 hour). 
(B) 2DE map of matured MPRO cells (72 hours). Protein 
spots marked In the rrtaps were considered differentially 
exprefised and were subjected to MS analysts. The 
resutlant protein information is listed in Table 6. 



IPGs 
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sequence analysis or availability of more extensive murine data- 
bases. We searched for the expression patterns of the genes cognate 
to the expressed proteins in dbMC (Table 6). Nineteen genes were 
found in dbMC, the mRNA for 5 genes was reported as absent, and 
13 genes were present during NfPRO differentiation. Comparison 
of the expression patterns showed only 4 genes of IS present on the 
oligonucleotide chips whose expression was consistent at the RNA 
level and protein level. None of these was on the list of the genes 

Table 6. Correlation of expression patterns between mRNA level and protein level 



that were differentially expressed significantly (5-fold or greater 
change by array or 2-fold or greater change by DD). 



Discussion 



We explored the temporal patterns of gene expression during 
myeloid development A database has been developed to provide a 



Spot 



Protein definition 



Gi number 


Predicted 
value 


Percentage 
<%> 


2DE pattern 


cOMA expression 
pattern 


A(J 


kd 


Pi 


Oh 


72 h 


Oh 


72 h 


2505545 


72.4 


5.1 




1 


3 


1321 


1043.3 


N 


6752954 


41.77 


5.3 


40 


3 


6 


0 


2 


Y 


2494703 


22.63 


4.3 


. 33 


3 


3 


341 


441.6 


Y 


7242171 


26.77 


4.7 


42 


1 


0 


544 


430.9 


Y 


403&346 


69.B 


7.1 


24 


2 


1 


43 


50.7 


N 


6755074 


57.9 


7.2 


4B 


6 


4 


3047 


5seo.3 


N 


6671509 


41.72 


53 


39 


1 


3 


2539 


341.3 


N 


6379937 


35.79 


B.7 


39 


8 


7 


3073 


5742.3 


N 


461911 


10.99 


5.9 


48 


O 


4 


N/A 


N/A 




66B0047 


35.06 


7.9 


21 


4 


2 


139 


303.1 


N 


6678413 


26.69 


6.9 


26 


3 


3 


3312 


2650.1 


Y 


1196524 


17.19 


7 


61 


2 


3 


152 


126.9 


N 


7949121 


16.59 


68 


25 


1 


0 


626 


812.4 


N 


6681019 


62.16 


6.4 


36 


2 


0 


Absent 


Absent 


N 


220474 


47.52 


6.6 


35 


2 


0 


Absent 


Absent 


N 


52783 


35.62 


4.8 


29 


3 


0 


Absent 


Absent 


N 


5931565 


31.3 


6.7 


30 


1 


2 


Absent 


Absent 


N 


316B662 


26.6 


7.1 


39 


1 


2 


N/A 


N/A 




739346 


6.04 


6.4 


68 


0 


2 


Absent 


Absent . 


N 


5932010 


158.7 


6 


17 


1 


0 


N/A 


N/A 




675501 B 


74.46 


7.2 


21 


1 


3 


N/A 


N/A 




3212625 


66.45 


5.7 


24 


0 


6 


N/A 


N/A 




6576815 


62.53 


7.2 


33 


2 


1 


N/A 


N/A 




1730203 


48.22 


7,2 


31 


3 


1 


N/A 


N/A 




1730519 


44.54 


6.3 


47 


S 


4 


1083 


1402.3 


N 


6754976 


22.16 


8.6 


53 


3 


1 


N/A 


N/A 




3913065 


17 


7.6 


55 


0 


3 


N/A 


N/A 




2137430 


12.1 


5.2 


38 


0 


1 


N/A 


N/A 





1 GRP76 

2 Art In, gamma, cytoplasmic 

3 RHO GDI 2 

4 Proliferating cell nuclear antigen 

5 APS kinase 

6 Pyruvate kinase 3 

7 Melanoma X-actin 

6 Gtyceraidehyde-3-phosphate dehydrogenase 

9 Stefln 3 

10 Guanine nucleotide binding protein, beta -2, 

related sequence 1 

11 Trbsephosphale iscmerase 

12 Testis -derived c-abl protein 

13 RNA binding motif protein 3 

14 Coliapsln response mediator 

15 Lamln A 

16 47-kd keratin 

17 sid476p 

18 MHC class II H 2- 1 A- beta- 5 

19 Ar.drogeobinding protein: subunH alpha 

20 Neuronal apoptcsis inhibitory protein 

21 PAD type fV 

22 Human serum albumin homologcue 

23 syncrip 

24 Trsnsamidinace 

25 PGK crigr phosphogiycerele 

26 Proliferation-associated gene A 

27 Puletuve peroxisomal antioxidant enzyme 

28 IgE chain C2 region 



The proteins listed here are represented by the spots marked in the electrophoretograms shown in Figure 5 

Protein definition , GI number, and predicted value refer to the protein name, accession number, and properties derived from the National Center for Biotechnology 
Information database. The column labeled % shows the porrentage of peptides predicted from the protein sequence that were dctodoTby r^ss icZI/S 
i m I! !LT£? ISIF*? T" «■ illduced ** °^™retinote «* foro hoursand 72 hour, (Figure 5) were scored on a s4e of 

th ^hiun t" h pattern column. The cDNA expression patterns of the cognate mRNAs are listed in the cDNA expression pattern column abstracte^Jfrxtm 

the dbMC database. The ger^ not represented or, the ongonudeolWe array* were 

Y ■nd.cates agreement and N dKcrcpancy between changes m cDNA and protein spot intensity. The numbers in bold were obtained with OD. 2DE indicates Mmon tonal 
gel electrophoresis; IgE, immunoglobulin E; DD, differentia! display. woicaies z-oimonsionaf 
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reference for later research on the molecular mechanisms underly- 
ing normal myeloid development. 

The MPRO cell system morphologically mimics normal myeloid 
differentiation and biochemically proceeds further toward mature neutro- 
phils than most other in vitro systems. Because the arrest in differentia- 
tion of MPRO cells growing in the absence of ATRA is not physiologic, 
there is a theoretical risk that gene expression in these ceils is not 
coordinated in the way that it is in normal differentiation. It Is 
encouraging that, for the most part, the timing of expression of genes for 
proteins of the various neutrophil granules is consistent with the timing 
of mc morphologic and biochemical appearance of these granule 
components during norma] myeloid differentiation 

The DD technique provides certain advantages for detecting 
and comparing mRNA levels in different samples. First, the method 
is, in principle, similar to competitive RT-PCR, and, with the use of 
stringent PCR conditions, is expected to be about as reliable. 
Second, display patterns are reproducible. Third, the method 
detects the levels not only of RNAs already represented in the 
database but also of unknown RNA species that may represent 
'"new" genes. Fourth, closely related genes can be distinguished 
regardless of cross-hybridization, provided there are some single 
nucleotide differences in the 3' end sequence. Limitations associ- 
ated with this technique are that numerous gels are necessary to get 
complete information and that comparison of the levels of different 
. rnRNAs is only approximate because of the differential amplica- 
tion of bands of different size or sequence. 

Oligonucleotide chip analysis is a fast and effective means of 
accessing mRNA expression patterns. 20 Cluster analysis of groups 
of samples by this approach is effective. However, the present 
results indicate that alternative methods of verification are desir- 
able before the data on an unexpected change in a particular gene 
are definitively accepted. 

To obtain the broadest range of information from the myeloid 
differentiation process, both differential display and oligonucleotide 
chip techniques were applied in the current study. As a result, 65.3% of 
the observed changes in mRNA levels came from the differential display 
method and 41 .5% came from oligonucleotide chip assays. 

Our data showed in general that changes in expression pattern 
by the 2 methods agreed qualitatively but that there was some 
quantitative variation. Our results indicate that DD may be a more 
accurate way to detect changes in levels of gene expression than the 
oligonucleotide chip assay. However, improvements in the types of 
oligonucleotides used in arrays may close this gap in the future. 

The rnRNAs for a limited number of transcription factors vary in a 
pattern correlating with that of the rnRNAs for primary or secondary 
granule proteins. However, more detailed infonnation is needed, and the 
underlying mechanisms of granule gene regulation remain unclear. The 
number of potential positive and negative regulatory factors found here 
is sufficiently small as to make it feasible to perform in vivo studies, 
such as chromatin immimoprecipitation. 

The oligonucleotide chip used in this study focused on known 
genes, whereas the DD method samples ail polyadenylated tran- 
scripts. The latter method generated a large number of products not 
associated with known genes, in part because the mouse genome is 
not as well represented in the database as the human genome. 
However, our experience with DD and human rnRNAs indicates 
that substantia] fractions of the products represented as ESTs or not 
represented at all in the public databases are cDNA copies from 
introns, hnRNA, or other RNA with internal A runs. 

Approximately 59 sequences obtained from gel-display bands 
had significant changes in the level of expression and a sequence 
that did not match that for any named gene in the public databases. 



Of these, 38 had plausible or excellent polyA signals. This is only 
an approximate estimate of the number of new genes found 3 * 5 
because a fraction of the rnRNAs for known genes still had poor 
polyA signals. In addition, the full 3' untranslated region is often 
not known for characterized genes, and in some cases these new 
genes may prove to be identical to products identified by the 
oligonucleotide chips when more complete sequences are obtained. 
At the least, their presence indicates that a substantial fraction of 
the regulatory or functional circuitry of maturing myeloid cells 
remains unexplored and that valuable tools for their investigation 
will emerge from a combination of RNA expression studies and 
analysis of emerging genomic sequences. 

The desired end point for the description of gene expression in a 
biologic system is not only the analysis of mRNA transcript levels 
but also the accurate measurement of protein abundance. The 
developments in 2DE and new MS instrumentation make it 
possible to accomplish this work rapidly and efficiently. In this 
study, we attempted to identify a number of the proteins differen- 
tially expressed between uninduced and ATRA-differentiated MPRO 
cells and to examine the relation between mRNA and protein expression 
levels for these genes representing the same state. 

For protein levels based on estimated intensity of Coomassie dye 
staining in 2DE, there was poor correlation between changes in mRNA 
levels and estimated protein levels. Other groups have studied the 
correlation between mRNA and protein levels in yeast and liver 
cells. 11 * 12 - 14 In the liver cell (sxperiments, 11 - 12 correlation coefficients of 
0.4 to less than 0.5 were observed. In an extensive study in yeast," -12 the 
correlation coefficient was high if the most abundant rnRNAs and 
proteins were considered. If a handful of these products was omitted, the 
remaining correlation coefficient was 0.4 or less. However, one 
could restore some of the correlation by averaging individual 
data points into broad protcomic categories.- 17 

The discrepancies between mRNA and protein levels in MPRO cells 
appear to be substantially larger than those observed for yeast Possible 
causes for the discrepancies include translational regulation, differential 
expression of certain rnRNAs at various stages of cell growth m vitro, 
post-translational protein modification that varies with the stage of 
maturation of the cells, and selective degradation or excretion of proteins 
in vivo. Finthermore, here we are focusing on a developmental 
time-course, whereas the yeast study concentrated on the organism in 
vegetative growth. New techniques, equipment, and bioinformatic 
analysis tools must be developed to make such systematic, global, and 
quantitative analyses feasible. 

The initial studies of protein expression presented here provide a 
cautionary note for efforts to interpret cell composition and function in 
relation to mRNA levels. Discrepancies we observed between gene 
expression and protein abundance suggest that selective post-trans crip- 
tional controls may be at least as important as changes in mRNA levels 
in determining the protein composition of neutrophils and that they arc 
phenomena less well explored than transcriptional control. Analysis of 
mRNA expression patterns is itself only a small beginning toward a 
genomo-wide description of cellular components^ 
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Appendix ' 

Gene symbols used in tables: Actb: actin, beta, cytoplasmic; Actg: actin, gamma, 
cytoplasmic; Actx: melanoma X -actin; Aldol: aldolase 1. A isotbim; Arf5: 
ADP-ribosyiaiion factor 5; Atl'l: actteifog transcription lactor I; Atf2: activating 
traiisaiptiou factor 2; Bu3: basic transcr^ition factor 3a; Bap: paiphcral-type 
boozodiazepiiK receptor, C5ii : complement component 5, receptor 1/G protein- 
coupled receptor (C5a); Ccnb2: cyciin B2; Cd36l2: CD36 antigen (collagen type 
I receptor, thrum bos pondin receptor)-like 2; Ctl53: CD53 antigen; Cebpa: 
CEAAT/cnhancer binding protein CHIP, alpha; Ccbpb: CCAAT/Vnhanccr 
binding protein (C/EBP), beta: Cebpd: CCAAT/enhancer binding protein (C/ 
EBP), delta; Cebpc; CCAAT/enhancer binding protein (C/EBP), cpsilon; CB1: 
cofilin !,nonmuscic; Cmkai4: chemokine (C-X-C) receptor 4; Cmkbrl: chemo- 
kinc (C-C) leeeptor 1/NIipla lcceptor; CWp: cathclin like protein; Cutt*: ciliary 
neurotropic factui//inc finga protein PZF; Copa: coalomcr pax>tcin complex 
subunit alpha; C(w3: cmiiox>7wntidasc A3, mast cell; Cr2: complement rcccjrtor 
2; Ottr: unttcocroptn releasing hormone receptor. Crry: complement rcccpUT- 
rclauxl protein; Csflr GSF 1 (M-CSF) rcccptor/c-frns/CDI 15; Csf7.ra: CSF I 
(GM-CSF) receptor, alplia, luw-aiTini(>'/CDI 16; Csfirbl: CSF 2 (GM-CSF) 
reccpfor, beta % low-atfimty/U. 3 receptor-like protein (AIC2ByCDwl31; 



Cst2rt>2: CSF 2 (GM-CSF") receptor, beta 1, low-arfiniry/TL-3 receptor (A1C2A); 
Ctsb: catbepsin B; Ctsc: cathepsin C; Cud: csthepsin i>, Ct&e: cothepsin L; Ctsg: 
cathepsin G; Ctsh: cathepsin H; Ctsl: caihepsin L; Oss: cathepsin S; Cybb: 
cytochrome b-245, beta; Did2: dopamine receiitoi' 2; E21T: H2I : transcription 
factor 1; Ear2: eoGuiophuVassociatcd hbonuclease 2; £bi3: lZpstein-Barr virus -- 
induced gene 3/cytokme receptor-like molecule (EBD); E12: Balb/c neutrophil 
elastase; FJa2: e1asta.se 2; Erh: enhancer of rudtmentary homofog (Drofophlla); 
Etoht6: ctlianol induced 6Vstcrol regulatory clement binding transcription factor I 
(SKEBFI) homolog; F2rl2: coagubu'on factor TT (dirombin) rcccptor-likc 2; 
Fcer lg: Fc receptor, IgE, higb ariinhy I, gamma porypeptidc; Fcgr2b: Fc receptor, 
IgG, low alliniry lib; Fcgr3: Fc receptor, IgG, low afliniry 111; Fprl: I'onnyl 
pcpddc receptor 1/1MLP receptor, Gabpbl: GA repeal binding protein (GABP- 
bctal subitait); Gata2: GAXA-binding potein 2; Gnas: guanine nucleotide 
binding protein, alpha stimulating; Gnb2-rs 1 : guanine nucleotide binding protein, 
bcta-2, related sequence 1; Gpx.l: gtutalhtunc peroxidase 3; Grg: rclulcit tn 
Dn>sophila grondto gene; flridh ghitumalc receptor clianncl subiinil dclfJ! I ; 
Gm: gntnulin; Gstml: gtuuithionc-S-inmsfcrasc, mu I; Gus-s: bcta-ghicuruni- 
dasc structural; Gys3: glycogen synthase 3, brain; H2-D; histoco.-urjalibilify 2, D 
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region locus 1; Hist2: bistone gene complex. 2; Hist5--2ax: H2A hist one family, 
member X; Hingi: high mobilhy grotip protein I; Hsp60: heat shock protein, 60 
kDa; Htr5a: 5-hydroxytryptamine (serotonin) receptor 5A; Idbl: inhibitor of 
DNA binding l/helix-loop-helix DNA binding protein regulator (Id); Idb2: 
inhibitor of DNA binding 2; Imgr. interferon gamma receptor; Ifhgr2: interferon 
gamma receptor 2; lir la-associated invariant chain; III a: 1LI alpha; If Lr2: DL 1 
receptor, type II; I12rg: IL2 receptor, gamma chain; IWra: IL4 receptor, alpha; 
111 Orb: ItJO receptor, beta; II llr IL17 receptor; Irfl: uucrfcron regulatory factor 
1; Ir£2: interferon regulatory faetor-2; Itgb2: integrinbeta2 (Cdl8); ItpfS: inositol 
l,4 ? 54risphrisphate receptor (type 2); Jundl: Jun proto-oncogcue-related gene 
dl/transcription factor JUN-D; KII2: Kruppel-like factor LKLF; L-CCR: Upopoly- 
saccharidc inducible C-C chemokine receptor-related; Lcn2: lipocalin 2; Ldtr: 
low density lipoprotein receptor; Lspl: Lymphocyte-specific l/S37/pp52; Lstl: 
leiicocyte-specific transcript 1 ; Ltb4r leukotriene B4 receptor; Lthr lymphotoxin- 
beta receptor; Ltf: l&ctotransfenin; L>64: lymphocyte entigen 64; Ly6e: lympho- 
cyte antigen 6 complex, locus E; LylJ : lymphohlatfomic leukemia/bHLH factor; 
Lyzs: lysozyme; M6pr. mannose-6-phosphate receptor, cation dependent; Mad: 
Max. dimenzatton protein; Man2c 1 : mannosidase, alpha, class 2C, member 1; 
Max: Max protein; Max MYC-associated zinc ringer protein (purine-binding 
transcription factor); MBP: eosinophil granule major bask protein precursor; 
McptS: mast cell protease 8; Mil: myeloid/lymphotd or nuxed-Uneage leukemia; 
Mmp 13: matrix. meuuJr/proteinase 1 3/collagcuase; Mmp9: matrix metaubpjotein- 
ase 9/gelatiuase B; Mpo: myeloperoxidase; Myb: myeloblastosis oncogene; 
Mybl2: myeloblastosis oncogene -like 2; Myc: myclocytomatosis oncogene; 
Myln: myosin light chain, alkali, nonmuscle; Nfatc2: nuclear feet or of activated T 
cells, cytoplasmic 2; Nfe2: nuclear factor, eiythrokt-rfenved 2, 45 kDa; Nfkbl : 
NF-Juppa-B (p 105); Ngp: neutrophilic granule protein; NMDRGB: N-methyl-D- 
aspariate receptor gliitamcte-Wnding chain homolog; Npml: nudeophosmui 1; 
Nr4al: nuclear receptor subfamily 4. group A, member I; Osi: oxidative stress 
induced; P2rxl: puriuergic receptor P2X, lig&nd-gated ion channel, 1; P2ry2: 
piirinergk receptor P2Y, G-protein-coupled 2; P40-8: P40-8, fundwnal/laminin 
receptor; Pbxl: pre B-cell leukemia transcription factor I; Pfc: properdin factor, 
complement; Piral: paired-Ig-like receptor A I; Pira5: paired-Ig-like receptor 



A5; Pira6: paired-Ig -like receptor A6; Pirb: paireoVIg-like receptor 0; Plaur: 
urokinase plasminogen activator receptor; PML putative receptor protein (SP: 
P 171 52 ); Pml: promyelocytic leukemia; Prg: proteoglycan, secretory granule; 
Pm3: proteoglycan 3/cosinophit major basic protein 2; Prtn3: proteinase 3; 
Psma2: proteasome (prosome, macropain) sub unit, elpha type 2; PtrnM: protfay- 
mosm beta 4; Ptprc; protein tyrosine phosphatase, receptor type, C; Rnc2: 
RAS-related C3 botulinum substrate 2; Rarg: retinok acid receptor, gamma; 
Rela: avian reuciuwrulotheUosw viral (v-rel) oncogene homolog A/NF-kappa-B 
p65; Rpli9: nbosomai protein L19; RPL8: ribosomal protein L8; Rps6kal: 
ribosomal protein S6 kinase polypeptide 1; Rps8: ru^somftl protein S8: Rin3: 
reticuion 3; S100a8: SI 00 calcium binding protein A8 (caigranulin A); S100a9: 
SI 00 caichtnvb hiding protein A9 (calgntuulm B); Sdfr2: stromal cell-derived 
tactor receptor 2; SelL selectin L (lymphocyte adhesion molecule I); Sema4d: 
semaphorin 4D; Seppl: sdenoprotein P, plasma, 1; Sfpil: SFFV pro viral 
integration 1; Shfdgl: split hand/foot deleted gene 1; Sic 10a 1: solute carrier 
family 10 (sodium/bile acid co transporter family), member 1; Slpi: secretory 
leukocyte protease inhibitor; Soxl5: SRY-box containing gene 15; Spi2-1: serine 
protease inhibitor 2-1; Srbl: scavenger receptor class B 1 ; Stal3: signal transducer 
and activator of transcription 3; Stat5a: signal transducer and activator of 
transcription 5A; Stat6: signal transducer and activator of transcription 6; StraI4: 
basic-helix-loop-he lix protein-retinoic acid induced; Thxi : TBX1 protein/LPS- 
induccd TNF-alpha factor homolog; Tcrgb: T-ccll -receptor germJine beta-chain 
gene constant region; Tcrg-V4: T-cell-feceptar gamma, variable 4; Tctexl: 
t-complex testis expressed I; Tfdpl: tnuiscription factor Dp I; TUlb: tmoscrip- 
tional intermediary feet or 1, beta; Tlr4: toll- like receptor 4; Tnfhfla: TNT 
receptor superfamily, member la; Tnftsflb: TNF superfamtty, member lb; 
Tbmm70a: transkK&se of outer mitochondrial membrane 70 (yeast) homolog A; 
Tpu trbsephosphate isomerase; 1>p53: transformation-related protein 53; Ubb: 
ubiquitin B: Usf2: upstream transcription factor 2; Yhxl: Y box transcription 
factor; Yhx3: Y box binding protein; Zfpl !-<>: zinc finger protein sll~6; Zfp IS: 
zinc finger protein 1 8 homolog; Zfp?6: zinc finger protein 36; Zfpl62: zinc finger 
protein 162; ZJp2l6: zinc finger protein 216; Zfpml: zinc finger protein, 
muttitype 1 ; Znfiil al : zinc finger protein, subfamily LA, 1 (fltaros); Zyx: zyxin 
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Bacterial lipopolysaccharide (LPS) evokes several 
functional responses in the neutrophil that contribute 
to innate immunity. Although certain responses, such as 
adhesion and synthesis of tumor necrosis factor-**, are 
inhibited by pretreatment with an inhibitor of p38 mi- 
togen-activated protein kinase, others, such as actin as- 
sembly, are unaffected* The aim of the present study was 
to investigate the changes in neutrophil gene transcrip- 
tion and protein expression following lipopolysaccha- 
ride exposure and to establish their dependence on p38 
signaling. Microarruy analysis indicated expression of 
13% of the 7070 Affymetrix gene set in nonstimulated 
neutrophils, and LPS up-regulation of 100 distinct 
genes, including cytokines and chemokines, signaling 
molecules, and regulators of transcription. Proteomic 
analysis yielded a separate list of up-regulated modula- 
tors of inflammation, signaling molecules, and cytoskel- 
etal proteins. Poor concordance between mRNA tran- 
script and protein expression changes was noted. 
Pretreatment with the p38 inhibitor SB203580 attenu- 
ated 23% of LPS-regu!atcd genes and 18% of LPS-regu- 
lated proteins by ^40%. This study indicates that p38 
plays a selective role in regulation of neutrophil tran- 
scripts and proteins following lipopolysaccharide expo- 
sure, clarifies that several of the effects of lipopolysac- 
charide are post-transcriptional and post-translational, 
and identifies several proteins not previously reported 
to be involved in the innate immune response. 



Lipopolysaccharide (LPS), 1 a component of the outer cell wall 
of Gram-negative bacteria, evokes a variety of functional re- 
sponses in the human neutrophil (PMN) after binding to a 
plasma membrane receptor complex that involves the Toll-like 
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receptors (TLRs) (1-5). These "immediate" functional re- 
sponses, including actin assembly, adhesion, activation of nu- 
clear factor-kappa B (NF-kB), and priming for an enhanced 
secretory response and for release of reactive oxygen interme- 
diates, appear to be central both to the innate immune re- 
sponse and to the pathogenesis of several inflammatory human 
diseases, including sepsis and the acute respiratory distress 
syndrome (6). p38 mitogen-activated protein kinase (p38 
MAPK) has been shown to mediate LPS-induced PMN adhe- 
sion, NF-kB activation, and TNF-a and ILt8 translation and 
release (7), and its blockade attenuates LPS-induced PMN 
accumulation in the airspace (8). However, other cascades al- 
most certainly lead to downstream effectors of the LPS signal; 
for example, actin assembly appears to be p38 MAPK-inde- 
pendent (9). An improved understanding of the transcriptional 
and translational responses of the neutrophil to LPS and the 
modulation of these responses by p38 MAPK might carry 
pathogenetic and therapeutic implications. 

Historically, it has been believed that the downstream PMN 
transcriptional response to LPS is static and that PMN func- 
tional response to LPS that depend on de novo protein syn- 
thesis are primarily limited to the release of cytokines (10). 
However, recent studies indicate a robust transcriptional re- 
sponse (11). To date, most studies have relied upon and re- 
ported a short list of functional assays of the LPS-exposed 
PMN; therefore, no exhaustive investigation of either the tran- 
scriptional response or protein synthetic repertoire of the PMN 
has been reported. Although several techniques have been used 
to evaluate transcripts, the screening of global changes in 
mRNA by microarruy analysis has only recently become possi- 
ble. In this way, thousands of genes can be screened in an 
unbiased fashion for transcript abundance. Such genomic* 
Bcrecns in mammalian cells have previously been applied to 
define altered expression profiles in response to agonists (12) 
and to drug action (13) and during cell cycle progression (14). 

Although DNA microarray technology is expected to provide 
insight into the response of the human PMN to LPS (15), 
inliibition of LPS-stimulated IL-1 and TNF-« production by 
p38 MAPK inhibitors in THP-1 cells (16) and of TNF-« synthe- 
sis in human PMNs (9) occurs at a translation^ level and 
would therefore not be detected by DNA microarrays. Further- 
more, in other systems, such as yeast and human liver, mRNA 
and protein levels show poor correlation (17, 18). Proteomics is 
a complementary tool for assessing global changes in cellular 
protein expression, thereby providing additional insight into 
cellular signal regulation. A proteomic approach has proven 
useful in different systems for dissecting signal transduction 
cascades and describing their output (19, 20) and has even 
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recently been used to detect novel upstream messengers in- 
volved in LPS signal transduction (21). We have npplied DMA 
microarrays and proteomicB to define and compare transcrip- 
tional and post-transcriptional alterations in the LPS-exposed 
PMN and to establish the dependence of these alterations on 
p38 MAPK signaling. 

EXPERIMENTAL PROCEDURES 

Afateria/.s^Endotoxin-free reagents and plastics wore used in all 
experiments, Aprotinin, leupeptin, AEBSF, E-64, pepstatin, and besta- 
tin protease inhibitors, spermine HC1, and a-cyano-4-hydroxycinnamic 
acid (CHCA) were all purchased from Sigma Chemical Co. (St. Louis, 
MO). SB203580, a p38 MAPK inhibitor, was purchased from Calbio- 
chcm-Novabiochem Corp. (San Diego, CA). For two-dimensional PAGE, 
rehydration buffer, equilibration bufFers, vertical electrophoresis solu- 
tions, and 10% homogeneous polyacryl amide slab gels were purchased 
from Genomic Solutions, Inc. (GSI, Ann Arbor, MI). Sequencing grade 
porcine trypsin was purcliased from Promega (Madison, WI). 

LPS Incubation— PW^s were isolated by the plasma Percoll method 
(22), a technique that yieLds less than 5% monocytic contamination, and 
resuspended at a concentration of 15.4 X 10°Anl in RPMI 1640 culture 
medium (BioWhittaker, Walkers ville, MD) supplemented with 10 mM 
HEPES (pH 7.6) ai\d 1% heat-inactivated platelet-poor plasma. After 
addition of 100 ng/ml Escherichia coli 0111:B4 LPS (List Biological), 
incubation was earned out with continuous rotation (4 h, 37 *C) both in 
the presence and absence of SB203580. Both Affymetrix analysis and 
proteomic analysis utili?.ed 75 X 10 6 celJa. For microarray analysis, 
nonstimulated and 4-h-treated PMNs were collected from Uu*ee sepa- 
rate donors. A more detailed time course following LPS exposure was 
performed using polymerase chain reaction. For proteomic analysis, 
LPS incubations from separate donors (n - 6) were performed and tlien 
analyzed individually. Control and post-LPS incubation PMNs were 
washed (0.34 M sucrose/1 mM EDTA/T0 mM Tria) and then tysed in a 
modified rehydration buffer (GSI, Ann Arbor, MI) supplemented with 2 
M thiourea, 50 mM ditbiothreitol (DTT), 22.5 mM spermine HCl, and a 
mixture of six protease inhibitors (10 n£/m\ aprotinin, 10 /tgfrnl leupep- 
tin, 2 mM AEBSF, 6 jiM E-64, 1 pepstatin, 10 u-M bestatin). DN A was 
pelleted by centrifugation at 260,000 X g for 60 min (23). 

Affymetrix Oligonucleotide Array— Five micrograms of total RNA was 
isolated with TRIzoI (Invitrogen) and RNeasy columns (Qiagen) and 
subsequently labeled with biotin as descril>ed by Affymetrix. Briefly, 
first-strand synthesis was accomplished with Superscript II reverse tran- 
scriptase (Invitrogen) using a T7-oligo(dT) 24 primer for 1 h at 42 a C 
followed by second-strand synthesis using B. coli DNA polymerase I and 
RNase H (faivitrogen) at 16 °C for 2 h. Double-stranded DNA was used as 
a template for in vitro transcription with T7 RNA polymerase in the 
presence of biotin-labeled UTP and CTP using the BioArray High Yield 
RNA transcript labeling kit (Enzo). Fifteen micrograms of cRNA was 
fragmented and used for hybridization to A/Iymetrix HuGene 6800FL 
Genechips. Each sample was hybridized initially using a Test2 Genechip 
to test for sample degradation and full-length in vitro translation. Data 
were analyzed using Asymetrix Genechip software. Results from three 
separate donoira were analyzed. 

Reverse Transcription and Polymerase Chain Reaction— cDNA was 
prepared by reverse transcription using 2 /ig total RNA, derived from 
20 X 10* cells that were treated as indicated. Polymerase chain reac- 
tions were performed usiixg specific primers for Mx-l, TNF-au, MCP-1, 
p&5, S1Q0A4, and gJyceraldehyde-3-phoaphate dehydrogenase. 

Two-dimensional PAGE— The protein concentration of the lysates 
was measured as described by Bradford et al. (24). Poor isoelectric 
focusing (IEF) results wore encountered unless the polycationic sperm- 
ine was diluted (data not shown); therefore, lysates were diluted with 
rehydration buffer (GSI, Ann Arbor, MI) to achieve a final spermine 
concentration of 6 mM. Equal protein loads (1.6 mg) of control and 
LPS-stimulated neutrophils were used to rehydrate IEF gels overnight 
(18 cm, pH 3-10 nonlinear Immobitine DryStrip IEF gels, Amersham 
Biosciences; Piscataway, NJ). IEF was performed at 20 °C to 100-kVh 
(Phaser, GST) under mineral oil, followed by two 10-min SDS equilibra- 
tion steps (DTT and then iodoacetanudeKiontaining equilibration buff- 
ers, GSI) and then by vertical electrophoresis on 10% homogeneous 
polyacrylamide slab gels (GSI) at 500 V. Protein spots were visualized 
by agitation in colloidal Coomasaie Brilliant Blue G-2G0 (16 h) (26), 
followed by destaining in deionized water (20 h). In separate experi- 
ments, control and LPS-stimulated PMN lysates from three donors 
. were pooled and then analyzed by two-dimensional PAGE using over- 
lapping narrow isoelectric point (pi) ranges (18 cm, pH 6.0-6.0, 6.5- 



6.7, and 6-11, Amersham Biosciences, Piscatawny, NJ). Identical IEF 
and vertical electrophoresis parameters were used for all gels. 

Image Analysis of Two-dimensional Ge/s— Colloidal Coomasfrie- 
stained gels were digitized using a Powerlook II (UMAX Data Systems, 
Inc., Taiwan) flatbed scanner with 6-bit dynamic range and 150-dpi 
resolution. Biolmage (GSI, Ann Arbor, MI) 2D-Analyzer software was 
used to locate, quantitate, and match protein spots on the control and 
LPS gel images. Analysis was performed by assigning 60 common 
anchor spots between paired images; the remaining spots were com- 
pared by a constellation- matching algorithm. All data were then care- 
fully reviewed by the operator to account for any discrepancies. Protein 
loading between control and experimental gets may have varied be- 
cause of inconsistencies in rehydration of the different IEF gel strips; 
therefore, gel images were normalized so that the sum of the integrated 
intensities of all matched spots on paired gels was made equal Control 
and LPS-stimulated gel images from individual donor experiments 
were matched to generate composite images; composite images were 
then matched into a master composite image to track the LPS response 
of protein spots among different donora (26). Only those spots that were 
common (image-matched) to all original 12 (pH 3.0-10.0) gels were 
considered for further analysis. For these spots, the LPS-induced 
change in integrated intensity in the six experiments was subjected to 
statistical analysis with » two-tailed Student's f test, and those spots 
withp < 0.06 were identified by peptide mass fingerprinting (described 
below). For the narrow range (pH 5.0-6.0, 5.5-6.7, and 6-11) two- 
dimensional PAGE experiments using pooled donora, only those spots 
with concordant regulation exceeding 1.6-fold or that appeared de novo 
in the LPS gel in two repeat experiments were further analyzed. 

In-gel Tryptic Digestion— In-gel digestion of protein spots was per- 
formed with sequencing grade porcine- modified trypsin using the 
method of Hellman et al. (27). TVyptic peptides were then extracted (60 
til of 50% ncetonitrile/5% trifluoroacctic acid, 2 h), and the supernatant 
was token to dryness in a vacuum centrifuge and then redissolved in 
trifluoroacctic acid (20 /d, 0.6%). Peptides were then purified and con- 
centrated using ZipTip cl8 pipette tips (MilUpore, Bedford, MA). 

MALDI-TOF Mass Spectrometry— Analyses were performed on an 
Applied Biosystems matrix-assisted laser desorption ionization time-of- 
flight (MALDI-TOF) Voyager-DE PRO mass spectrometer (Framing- 
ham, MA) operated in delayed extraction mode. Samples (0.5 jd) were 
spotted onto a sample plate to which matrix (0.5 jd of 10 mgfrnl CHCA) 
was added. The sample-matrix mixture was dried at room temperature 
and then analyzed in reflector mode. CHCA was also spotted alone as a 
negative controL Spectra were the stun of 100 laser shots, and those 
peaks with a signal-to-noise ratio of greater tlian 3:1 were selected for 
data base searching. Spectra were internally calibrated using autolytic 
trypsin peptides (m/z 842.51, 2211.10). 

Data Base Searching Algorithm— -The monoisotopic masses for each 
protonated peptide were: (a) entered into the program MS-Fit (available 
at pro&pector.ucsf.edu) for searches against the Swiss-Prot, NCBI, 
and GenPept databases, and (6) entered into Mascot (available at 
matrixscience.com), an algorithm testing statistical significance of pep- 
tide mass fingerprinting identifications. For MS-Fit searches, masses 
derived from trypsin, CHCA, keratin, and Coomassie Brilliant Blue 
G-250 were excluded. Search parameters included a maximum allowed 
peptide mass error of 0.1 Da (0.8 Da in the few instances in which linear 
mode was used), consideration of one incomplete cleavage per peptide, 
pi range of 3.0-10.0, and molecular mass range of 1-200 kDa. Accepted 
modifications included carbamidomethylation of cysteine residues 
(from iodoacctamide exposure following IEF) (28) and methionine oxi- 
dation, a common modification occurring during 8DS-PAGE (29). Pro- 
tein identifications were assigned when three criteria were met: 1 ) 
statistical significance (p < 0.05) of the match when tested by Mascot 
(matrixscience.com); 2) >20% sequence coverage by the tryptic pep- 
tides; and 3) concordance (±16%) with the molecular weight and pi of 
the parent two-dimensional PAGE protein spot. The following special 
exceptions were considered: (a) protein identifications not fulfilling 
criterion 2 were still assigned if criteria 1 and 8 were fulfilled and no 
other Homo sapiens proteins with peptide mass-matched p values < 
0.05 were identified by Mascot; (£>) if criterion 3 was not fulfilled (lower 
than expected molecular weight), a cleavage product of the identified 
protein was inferred, and the cumulative molecular weight of the tryptic 
peptides was compared with that of the two-dimcnsional-PAGE spot to 
ensure that it was not exceeded; (c) if criterion 3 was not fulfilled (isolated 
discordance between theoretical and observed pi), post-translational mod- 
ification of an roirecovered peptide was inferred; and (d) if two or more H. 
sapiens protein assignments with >4 mutually exclusive matching pep- 
tides were identified, a protein mixture in the two-dimensional PAGE 
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Bpot was inferred and further analysis hid ted (quantitative conclusions 
regarding the individual protein constituents could not lie drawn). 

RESULTS 

Genes Differentially Expressed in LPS-stimulated Neutro- 
phils— Human PMNs were left untreated or incubated in the 
presence of 100 ng/mi LPS for 4 h. As a control to confirm that 
the PMNs were quiescent at baseline and that LPS resulted in 
normal stimulation, mRNA was isolated, cDNA was prepared, 
and PCR for TNF-a was performed. Little TNF-a expression 
was seen in nonstimulated cells, whereas LPS treatment led to 
an increase in expression in each of the donors subsequently 
used for microarray analysis (data not Bhown). No maerophage- 
colony stimulating factor receptor transcript was detected by 
oligonucleotide microarray analysis, confirming there was no 
significant monocytic contamination. 

Human PMNs express a limited repertoire of mRNA tran- 
scripts at baseline but respond to LPS with differential expres- 
sion of genes in many families. Considering only those genes 
present by microarray analysis in all three donors, unstimu- 
lated PMNs expressed 13.0% (923 of 7070 genes) of the Af- 
fymetrix gene set. Gene classes represented at baseline include 
metabolic enzymes, structural proteins, receptors, signaling 
proteins, and transcription factors. By comparison, human 
monocytes expressed —40% and human fibroblasts —35% of the 
represented genes (data not shown). By the criterion of a >3- 
fold increase in expression in all three donors on Affyraetrix 
oligonucleotide array analysis, exposure of PMNs to LPS for 4 h 
resulted in the up-regulation of 100 genes (Table I). 

Genes from several different functional classes were induced 
in PMNs following LPS exposure. Of interest, a number of 
transcriptional regulators were induced, including transcrip- 
tion factors of the NF-kB family. The transcriptional NF-kB 
complex has previously been implicated in the regulation of the 
genes induced by LPS (11). The genes for several cytokines and 
chemokines were also found to be up-regulated. These include 
TNF-a, IL-1Q, IL-6, MCP-1, MIP3a n and MIP-1& (Table I). 
PCR was performed to confirm the results from the microarray 
analysis. PCR analysis on selected genes indicates that the 
time course for changes can be rapid or delayed but parallel the 
changes found in the array at the 4-h time point (data not 
shown). Other up-regulated genes included those for metabolic 
enzymes, immune response molecules, kinases, phosphatases, 
signaling molecules, adhesion and cytoskeletal components, 
interferon-stiinulated genes, and those with unknown or mis- 
cellaneous function (Table I). 

LPS stimulation of PMN also resulted in the down-regula- 
tion of 56 genes (Table II). Down-regulated genes were identi- 
fied as transcriptional regulators, protein and lipid kinases and 
phosphatases, structural molecules, and signaling molecules. 
Genes for metnbolic proteins were also evident, as were several 
uncharacterized genes. 

Two-dimensional PAGE and Image Analysis — In contrast to 
the limited number of transcripts found at baseline, PMNs 
were found to express a large number and variety of proteins in 
the non stimulated state (Fig. 1, A and C, and Tables DI-V). 
Reproducible protein expression patterns were found on the pH 
3.0-10.0 gels, and the majority of proteins fell in the pH 5.0- 
7.0 range (Fig. 1A). The basic region (pH > 7.0) consistently 
exhibited poor resolution, precluding meaningful image analy- 
sis and further workup (data not shown). Depending on the 
spot-finding parameters (minimum spot intensity, filter width) 
selected on the image analysis software, spot-by-spot manual 
editing was found to be necessary to avoid over- and underde- 
tected spots; moreover, further manual editing was performed 
to screen for unmatched and mismatched spots following 
matching of paired control and LPS-stimulated gels. After spot 



editing, -1200 well-reBolved spots were evident on each pH 
3.0-10.0 gel. In an attempt to improve resolution of the pi 
range bearing the greatest number of well-resolved spots, over- 
lapping narrow pH range gels (pH 5.0-6.0, 5.&-6.7, 6-11) were 
also run. Of interest, a similar number of well-resolved spots 
(-1200) were detected on the narrow pH range gels (Fig. 1, C 
and D). Assuming a detection limit for Coomassie of 15 ng (0.25 
pmol, or 1.5 X 10 11 molecules, for a 60-kDa protein) and a 
protein load per gel corresponding to 75 X 10 6 PMNs, we 
estimate a detection limit on our gels of 2000 molecules/cell for 
a 60-kDa protein. As investigators have suggested in other cell 
lines with the use of high resolution two-dimensional-PAGE 
methods (30), we estimate that > 10,000 proteins are expressed 
in the resting PMN. 

Human PMNs respond to LPS with the differential expres- 
sion of a large number of proteins. In the six individual pH 
3.0-10.0 experiments, the number of protein spots that in- 
creased in integrated intensity by at least 50% following LPS 
exposure was 185, 122, 104, 104, 96, and 131, respectively. The 
number of protein spots that decreased by at least 50% follow- 
ing LPS exposure was 72, 151, 102, 98, 128, and 97, respec- 
tively. Although gel-to-gel regional variability in resolution was 
expected to account for individual spots not being well visual- 
ized on particular gels, only those spots that were matched to 
all 12 original gels were analyzed further. Overall, the number 
of spots matched to all 12 original gels was 125. The numbers 
of spots that were both matched to all 12 origi nal gels and that 
increased by at least 50% in integrated intensity in the indi- 
vidual experiments following LPS exposure were 46, 13, 17, 27, 
22, and 20, respectively. The numbers of spots that were 
matched to all 12 gels and that decreased by at least 50% were 
6, 22, 17, 22, 34, and 28, respectively. The LPS-induced change 
in integrated intensity of the 126 spots that were matched to all 
12 original gels was subjected to statistical analysis with a 
two-tailed Student's t test, and those spots with statistically 
significant (p < 0.05) regulation among the sue experiments 
were identified by peptide mass fingerprinting (Table III), 

Identification of LPS-regulated Proteins—Several proteins 
were consistently up-regulated on the pH 3.0-10.0 gels (Table 
III), including regulators of inflammation (annexin HI) and 
signaling molecules (Rab-GDP dissociation inhibitor f3). Sev- 
eral actin fragments were seen to be consistently up-regulated 
in the sue experiments following LPS exposure (Table III). Of 
interest, the proteasome 0 chain was also consistently up- 
regulated. Down-regulated proteins included other signaling 
molecules, such as Rho GTPase activating protein 1. 

On the pH 5.0-6.0 and 5.5-6.7 gels, several proteins were 
found to show increases of greater than l.fcVfold following LPS 
exposure (Tables IV and V), including cytoskeletal proteins, 
such as moesin, non muscle myosin heavy chain, and a putative 
phosphorylated form of nonmuscle myosin heavy chain, and 
signaling molecules, such as protein phosphatase 1 and P0 4 - 
stathmtn. The putative phosphorylated form of nonmuscle my- 
osin heavy chain (spot #J 101) was positioned 0.03 pH unit more 
acidic than the unmodified protein (spot #1102) (Fig. ID) and 
was distinguished by a tryptic peptide (mtz 1366.74) not pres- 
ent in the unmodified protein, consistent with phosphorylation 
of serine 685. Serine 685 is predicted by NetPhos 2.0 Prediction 
Server (available at wTvwxbs.dtu.dk/services/NetPhos/(31)) to 
be a liigh probability phosphorylation residue and by Scan- 
Prosite (www.expasy.ch/tools/scnpsite.html) to be a substrate 
for protein kinase C. The tryptic phosphopeptide identified in 
P0 4 -stathmin, extending from residues 15 to 27 (1468.7 Da), is 
consistent with phosphorylation of either 6erine 16, a known 
substrate for Ca 2 7calmodulin (CaM)-dependent kinases (32), 
or serine 25, a known substrate for p385 and ERK (Fig. 2A) 
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Table I 

Fluman neutrophil genes induced after 4 hof LPS exposure 



. Description 



GenBunk 71 



Chuxige-fuld 



Transcriptional regulation 
Pleiomorphic adenoma gene -like 2 
NFKB2 
NFKB1E 
p65 
BCU 

Xbox binding protein 1 

Me.tal-n>gulatary transcription factor 1 

Ets 2 

c-Rel 

NFKB1 

Basic leucine zipper transcription factor, ATF-like 
1KB 

MAX dimerization protein 
DIF2 

Cytokines and receptors 
MCP-1 
MlP-lfi 

aHelix coiled-coil rod homolog 
/L-i/J 

GRQ3 (beta) 

TOT or 

MlP-3* 

ILIORA 

Ih-B 

GROa 

HM74 

Immune response 
Orosomucoid 

Complement component C3 

Pixrtea&e inhibitor 9 

Complement component 3a receptor 1 

Protease inhibitor 3 

SW7/antilaukoproteaBe 

ELANH2MaetaBO inhibitor 

CD58 

Complement component PFC 

Kinases 
CNKf FNKf PLK-like 
Cot 
Pim-2 
L1MK2 

Phosphatases 
PAC1/DUSP2 
DUSP5 
PHA1 

Signaling molecules 
77VFAIPJ/A20 
VRAFl 
RanBP2 
GNA15 
PTAFR 

Adhesion and cytoskeleton 
ICAM1 

CEACAMl (bilary glycoprotein) 
UMS1 

SNTJbcUxi bundling protein 

Galcctin-1 ILGALS1 

MEMD/ALCAM 

CD44 

TSG-6 

Metabolic 
GTP cyclohydrolasp. I 
iV2H VV2/ubiquinone reductase 
PSA£A6/lproteosonie iota) 
UDP-galactose transporter (SLC35A2) 
PLAU (urokinase) 
KlWU/Lrkynurt^nine hydrolase 
AMPD3 

P4HAl/xfro\y\ 4-hydroxyiasc 
y Glutamylcysleine synthetase 
ATP6D 
ATP6S1 



D83784 


16.8 


S76638 


12.3 


U91616 


11.5 


L19067 


8.4 


U05681 


7.7 


M31627 


7.5 


X78710 


7.4 


J04102 


7.4 


X75042 


6.2 


M58603 


6.6 


U15460 


4.7 


M69043 


3.8 


L06695 


3.6 


S81914 


3.1 



M69203 

M72885 

AF014958 

X04500 

M57731 

X02910 

U64197 

U00672 

Y00081 

X54489 

D10923 

X02544 
K02765 
U71364 
U28488 
L10343 
X04470 
M93056 
Y00636 
M83652 



78.7 
48.8 
20.8 
17.6 
17.3 
14.5 
8.1 
7.3 
6.3 
4 

3.8 



20.2 
12.8 
9.5 
6.1 
4.9 
4.7 
4.6 
3.8 
3.5 



U56998 


16.2 


D 14497 


11.9 


U77735 


9.5 


045906 


4.3 


LI 1329 


11.8 


U15932 


5.3 


U73477. 


3.4 


M59465 


10 


U19261 


6.2 


D42063 


5.6 


M63904 


5.2 


D10202 


3.9 


M24283 


22.4 


XI 6354 


6.3 


U09284 


6.1 


U03057 


5.9 


M57710 


4.7 


U30999 


4.2 


HG2981— HT3125 


3.9 


M31165 


3.7 


U19523 


13.5 


M22538 


8.6 


X59417 


8.4 


OS4454 


7.3 


X02419 


6.4 


U57721 


5.5 


D12775 


5 


M24486 


4.7 


L35546 


4.5 


J05682 


4.2 


D16469 


4 
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Tabu; I— continued 



Description 



GenBank™ 1 i 



Glycerol kinase 

FAChl 

AK3 

Interferon-inducible 

rsais 

Mxl 

iFm 

INDO 
GBPI 
PHKH 
TFIT4 
IFI54 
IFI58 
IFP35 

Other 

Gos2 

M1HCI1AP1 

KIAA010S 

KIAA0118 

SNAP23 

CASP5 

IOAA0U3 

KIAA0255 

Hepatoma-derived GF 

PTGS2 

CD48 

UNCI 29 homolog 

KIAA0I51 

Rablb 

Annexin VI) 

fOAAOUO 

Adrenomedullin 

AIM] 

KIAA0250 

PS~1 

Scavenger receptor expressed by endothelial cells 
VHL 



X68285 
L09229 
K60673 



M13755 
M33882 
M24594 
M34455 
M55542 
U50648 
U52513 
M146G0 
U34605 
U72882 



M728S5 

U37546 

D14661 

D42087 

LT55936 

U28015 

D30755 

D87444 

D16431 

D28235 

M37766 

U40998 

D634B5 

XM0356G0 

J04543 

D14811 

014874 

U83115 

D87437 

U)6175 

D63483 

L15409 



Chunge-fold 



3.6 
3.5 
3.3 



22.5 
19.4 
12.1 
5.2 
4.3 
3.7 
3.6 
3.5 
3.5 
3 



46.8 
7.2 
5.1 
5 
5 

4.8 
4.8 
4.7 
4.7 
4.6 
4.3 
4.2 
3.9 
3.8 
3.7 
3.7 
3.7 
3.6 
3.2 
3.2 
3.2 
3.1 



(33). Assuming that no other multiply phosphorylated etath- 
min species had escaped detection, analysis of the integrated 
intensities of the P0 4 -stathmin and stathmin spots indicates 
that the percentage of the P0 4 form of total cellular stathmin 
increased from 11% to 38% with LPS stimulation (Fig. 2B). 
This is similar to a previous report of an increase from <10% to 
35-40% of the Ser^-phosphorylated form in Jurkat cells stim- 
ulated with anti-CD3 (34). 

Effort of SB203580 on LPS-stimulated Gene Expression— 
Gene expression analysis of PMNs stimulated with LPS indi- 
cated that the majority of genes induced by LPS were unaf- 
fected by prior treatment of PMN with SB203580. Of the 100 
genes up-regulated by LPS, the up-regulation of 23 was inhib- 
ited by greater than 40% (Table VI). The majority of these 
genes affected by SB203580 were inhibited by less than 60%. 
whereas only six were inhibited by greater than 80%, all of 
which represent previously identified interferon-stinmlnted 
genes. Induction of cytokine genes by LPS, with the exception 
of IL -6, was generally unaffected by SB203580. 

Effect ofSB2035S0 on LPS- stimulated Protein Expression— 
Similar to the effect of SB203580 on LPS-stiinulated gene 
expression, little effect of SB203580 was seen on expression 
levels for the majority of LPS-regulated proteins (Table VII). 
Two exceptions are annexin III and a-enolase, for which LPS- 
stimulated expression was attenuated in the presence of the 
p38 MAPK inhibitor. 

Comparison of Microarray and Proteomics Resu!,ts— Of the 
LPS-regulated proteins identified by peptide mass fingerprint- 
ing for which probes were present on the oligonucleotide mi- 
croarray, poor concordance was found at the mRNA level (Table 
VIII). For 13 LPS- up-regulated proteins, 2 corresponding 



mRNA transcripts were up-regulated, 1 was down-regulated, 5 
were unchanged, and 5 were not detected by the Affymetrix 
chip. For 5 down-regulated proteins, 3 corresponding tran- 
scripts were down-regulated, 1 was unchanged, and 1 was not 
detected. Varying patterns of LPS regulation emerge for those 
candidates detected at both the transcript and protein level. 
Proteasome 0 chain was up-regulated at both the transcript 
and protein levels (Table VUT), with no notable effect of 
SB203580 on expression at either level. Similarly, CAP1, Rho- 
GAP1, and ficolin 1 were down-regulated at both the mRNA 
transcript and protein level (Table VIII), with no notable effect 
of SB203580. Annexin HI was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Tables VII 
and VIA). 

DJSCU&SION 

Interaction of bacterial LPS with the human PMN repre- 
sents a model system for studying the activation and output of 
the innate immune system during infection and inflammation. 
A recent publication (35) describes the gene expression changes 
of a cultured monocytic cell line after infection by the Gram- 
positive bacterium Listeria monocytogenes. The cell wall com- 
ponents of Gram-positive bacteria, like Gram- negative-derived 
LPS (i.e. from E. coli) t are known to signal through TLRs (36, 
37). Importantly, many of the expression changes found in 
LPS-stimulated PMNs in the present study were also described 
in the bacteria-exposed monocytic cells, indicating that many of 
the gene expression changes seen in bacterial infection are 
likely mediated by TLRs (38, 39) and that the LPS model 
system accurately reflects exposure of immune cells to infec- 
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Table II 

Human neutrophil genes repressed (>4-fold) after 4 h of LPS exposure 



Description 



GtnButik 11 



Kinases 
CAMK, II, gamma 
Diacylglycerol kinase, delta 
PRKCL2/PKK2 protein kinase C-like 2 
MAPKAPK3 

Protein kinase Ht31, eAMP-dependent 
C AUK 11 

Transporters 
SLC25A5/B olute carrier family 25, member 5 
SrX!19Al; folate transporter 
SLC2A3; facilitated glucose transporter 

Metabolic 
Carbonic anhydrase JV 
KNase A family, kH 
Glycogen phosphorylase; liver 
Inositol polyphosphate-5-phospkatase 
Inositol 1,3,4-trisphosphate S/6-kinase 
Transketolase 

Protein phosphatase 4, reg. eubunit I (clone 23840) 

Cytidine deaminase 

MGATI 

HMOXi 

MAN2A2 

Glycogenin (also represents U31525) 

Structural 
Fibrinogen-like protein (pT49 protein) 
H2AFZ 
Paxillin 
Ijamin H R 
Dynamin 2 
Actinia 1 
a-Tuhulin 

Tubulin, al, isoform 44 

Transcriptional regulators 
I/ymplmblastic leukemia-derived sequence 1 
MAX'interacting protein 1 
Nuclear factor crythroid 2 isoform f 
Transducer of ERHB2, 1 
NFATC4 
A7T-2iCRK-Bpa) 

Receptors 
Lympkotaxin receptor 
Folate receptor 3 (gamma) 

Signaling 
Pixa; cool-2 (KIAA0006) 
ARHB/KlwYi 
TNFSF10; TRAIL 

Ca tt+ binding 
ANX1I 
S100A4 
ANXI 

Other 
Proteolipid protein 2 

Protein phosphatase 1, a catalytic Bubunit 

TIMP2 

KIAA0199 

Lipin 2 (KIAA0249) 

LRMPUavfl) 

CUGBP2 

Clone 23933 

PECAMl 

Delta sleep-inducing peptide 

DiGeorge synd. critical region gene 2 (KIAA0163) 
SELPLG: CD162; selectin P ligand 



IT50360 
D63479 
U33052 
U09578 

HG2167-HT2237 
L07044 

J02683 
U17566 
M20681 



L10955 
U64998 
Ml 4636 
1757650 
U51336 
L12711 
TJ79267 
L27943 
M55621 
X06985 
1.28821 

HG4334-HT4604 

Z36531 
M37583 
U 14568 
L25931 
L36983 
M95178 
XOI703 

HG226^-HT2348 

M22633 
L07648 
S77763 
D38305 
L41067 
L05515 

L04270 
U08471 
U11875 

D25304 . 

M12174 

U37518 

L19605 
M80563 
X05908 



1,09604 

HG1614-HT1614 

M32304 

D83782 

D87436 

U 10485 

U69546 

1779273 

L34657 

Z5078X 

D79985 

U25956 



Change 

•fad 

-4 

-4.2 

-4.3 

-6.3 

-8 

-9.8 



-4.2 
-4.4 
5 



-4.4 
-4.5 
-4.6 
-4.6 
-4.7 
-4.8 
-4.9 
-5.4 
-5,4 
-5.4 
-5.8 
-5.9 



-4.2 
-4.7 
-4.9 
-5.9 
-6.2 
-6.7 

-10 

-15 



-4.4 

-4.5 

-6 

•6.9 

-7.8 

-9.6 



»4.4 

-5 

-5.3 



-4.5 
-4.5 
-6.6 



-4.3 
--4.8 
-4.8 



-4.9 
-5 
-5.1 
-5,2 
-5.6 
-5.8 
-6.9 
-7 
-8 
-8.7 
-9 
-32 



tion. Nevertheless, the reliance upon DNA microarrays alone 
affords insight only into the transcriptional response without 
corroboration at the protein level. In the present study, appli- 



cation of both DNA microarray and proteomics technology to 
our model system provides unique insight into both the cellular 
biology of the activated PMN and the responsiveness und reg- 
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Fig. 1. Two-dimensional PAGE of LPS- exposed human PMNs. 

A and fl, colloidal Coomasaie Blue-stained pH 3.0-10.0, two-dimen- 
sional PAGE gels (A, control; B t LPS-exposod) with up-regulated (solid 
arrows) and down-regulated {hatched arrows) proteine indicated. These 
results arc representative of six separate experiments. C and D, colloi- 
dal Coomassie Blue-stained pH 5.0-6.0, two-dimensional PAGE gels 
t'C, control; D, LPS-exposed) with up-regulated (solid arrows), new 
(solid arrow, open arrowhead), and down- regulated (hatched arrows) 
proteins indicated LPS -exposed PMNs from three blood donors were 
pooled. 

ulation of its transcriptional and translationai machinery. As 
will be discussed below, our study identifies, in particular, 
novel aspects of the LPS-stimulated PMN transcriptional reg- 
ulation, activity in the innate immune response, signaling, 
cytoskeletal reorganization, and priming for granule release. 

In the present study, the increase in NF-kB transcript abun- 
dance (Table I) detected by the microarrays corroborates the 
findings of other studies of PMNs and monocytes (40) and 
indicates a mechanism for the responsiveness and scope of the 
PMN transcriptional machinery following LPS exposure. NF- 
kB, recently described to be activated by LPS through the 
TLR/MyD88/interleukin-l receptor-associated kinase pathway 
(1, 4), ih the only transcriptional complex reported to be in- 
duced by LPS in the PMN. However, because the transcrip- 
tional NF-kB complex has been implicated in the regulation of 
only a portion of the genes induced by LPS in this study (data 
not shown), the importance of alternative transcriptional reg- 
ulators in the PMN is clear. Of interest, several other known 
and putative transcriptional regulators with less well defined 
functions were also up-regulated in the present study, includ- 
ing PLAGL2, a putative zinc-finger protein, XBP-1, MTF-l t 
Ets-2, B-ATF , and DIF-2. On the other hand, LPS-down-regu- 
Iated genes include ATF-2 (a known target of p38), NFATC4, 
TOB-U NF-E2, MXI-l, and LYL-1. Although the exact role of 
these gene products in regulating cell function is unknown, 



these data indicate that the range of transcriptional responses 
in the LPS-stimulated PMN is much broader than previously 
suggested and that the signaling capabilities of the PMN in the 
immune response are thereby likely extended in scope and 
specificity. 

As expected from the literature, the genes for several cyto- 
kines and chemokines, including JL-J/3, IL-6, and MIP-lp, were 
found to be up-regulated (Table I). On the other hand, the 
notable absence of up-regulated cytokines in the proteomics 
experiments reflects their removal in the post-LPS incubation 
wash performed prior to lysis for two-dimensional-PAGE. Up- 
regulation of these inflammatory mediators is well documented 
in PMNs exposed to LPS and in animal models of LPS-induced 
sepsis syndrome and acute respiratory distress syndrome, a 
PMN-mediated illness (41, 42). Several genes in this family 
were up-regulated that have not, to our knowledge, been de- 
scribed in LPS-stimulated cells, including MCPl, GR03, 
IL-10RA, and HM74, an orphan G protein-coupled receptor 
with homology to chemokine receptors. The down-regulation of 
TNFSF10, lymphotoxin b receptor, and TNFAIP1 were also 
observed. The modulation of genes involved in cytokine signal- 
ing, including the adapter molecules TRAFl (LPS and TNF 
receptor signaling) and TNFAIP1 (TNF receptor signaling) and 
several kinases and phosphatases, may indicate a change in 
cytokine responsiveness after LPS treatment. Relevant in this 
regard from the proteomics data are: 1) the up-regulation of 
protein phosphatase 1, which has been shown to regulate PMN 
NADPH oxidase activation and translocation (43, 44) and to 
regulate LPS-induced NF-kB activation (45); 2) the down-reg- 
ulation of Rho-GAPl, which has been shown to regulate 
NADPH oxidase activity in the PMN (46); and 3) the up- 
regulation of PO + -stathmin (Table IV), a phosphoprotein pos- 
tulated to function as a relayer and integrator of multiple 
signal transduction pathways (34). Several noncytokine, 
nonchemokine genes involved in the immune response were 
also up-regulated, including the complement pathway mem- 
bers C3, C3AR1, and PFC\ the protease inhibitors ELANH2 
(elastase inhibitor), SLPI, PI-3, and PI 9; and the acute phase 
protein orosomucoid. LPS regulation of C3AR1 and orosomu- 
coid expression have not previously been reported. In the pro- 
teomics experiments, the down-regulation of ficolin-1 (Table ni), 
a coliec tin-like cell surface protein reported to activate the com- 
plement system and to mediate adhesion and phagocytosis in 
monocytes but not previously reported in granidocytes (47), may 
represent negative modulation of the innate immune response. 
The finding that genes other than cytokines and chemokines are 
regulated by the PMN in response to LPS indicates that the PMN 
plays a more sophisticated role in host-defense and immunity 
than previously thought 

Treatment of the PMN with LPS lead to the induction of a set 
of genes associated with the anti-viral Type I interferons, 
IFNa/0. This induction occurs independently of the release of 
IFN or another unidentified soluble factor. 2 Furthermore, the 
set of genes expressed is smaller than that induced by IFNa/0, 
as described by Der et al. (12). This may be due to differences in 
the scope of the signaling systems activated by LPS and 
IFNa/0, or the time course of analysis of genes in the LPS- 
stimulated PMN. The implication that LPS treatment of PMN 
allows PMN to express anti-viral activity is currently being 
tested. Of interest was the finding that induction of interferon- 
stimulated genes was blocked by pretreatment of PMNs with 
SB203580. Work from our laboratory has indicated that signal 
transducers and activators of transcription activation does not 
occur in response to LPS in PMNs. 2 In addition, interferon- 



3 K. C. Malcolm and 0. 8. Worthcn, manuscript in preparation. 



31298 



LPS-activated Neutrophils: Microarrays and Proteomics 



Tabus III 

Analysis of pH 3.0-10.0 two-dimensional PAGE gels 
Moari change(-fold) in expression level among six PMN donors is reported. The change in expression for the proteins Hated was statistically 
significant ip < 0.05) as measured by a two-tailed Student's t test. 



Identification [spot no.) 


Swias-Prot no. 


Estimated 
Afi/pl 


Theoretical 


Peptides matched/ 
submitted 


Protein 
covered 


Mean 
change 










% 


% 


fold 


Up-regulated 














Pmteasome fj chain [646] 


P28070 


27/5,7 


29.2/5.72 


9/12 (75%) 


36% 


1.51 


Amiexin III [550] 


P12429 


31/5.7 


36.4/5.6 


14/18 (78%) 


42% 


1.37 


Actin fragment [544|° 


P02570 


32/5.5 


(41.7/5.29) 


13/15 (87%) 


(34%) 


1.74 


Actin fragment [5911* 


P02570 


30/5.4 


(41.7/5.29) 


14/18 (78%) 


(29%) 


1.60 


«-Enolase [360] 


P06733 


41/5.7 


47.2/7.01 


9/10 (90%) 


24% 


1.65 


Rab-CfDP dissociation inhibitor 0 [289] 


P50395 


50/6.1 


50.7/6.11 


10/11 (91%) 


25% 


1.24 


Glutathione S- transferase P [648] 


P09211 


23/5.5 


23.4/5.43 


6/8 (75%) 


41% 


1.54 


Pre-B-cell colony enhancing factor 11152] 


P43490 


53/7.0 


55.5/6.60 


12/16 (75%) 


25% 


1.29 


Dawn -regulated 














Adenylyl eyclase-associated protein 1 [256] 


Q01518 


55/7.3 


51.7/8.07 


16/22 (73%) 


34% 


0.53 


Rho-OAPl [283] 


Q07960 


50/5.8 


50.4/5.85 


7/9 (78%) 


22% 


0.67 


Ficolin 1 (511] 


000602 


33/6.5 


35/6.39 


10/12 (83%) 


26% 


0.74 



° The theoretical pi and M R of native actin are indicated. Protein coverage indicates coverage of native actin. 

Table IV 

Analysis of pH 5.0-6.0 two-dimensional PAGE gels 
Results are from pooled samples for control (n~3) and LPS -ex posed (n - 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. "New" designates proteins seen in tho LPS gel in two repeat experiments but 
not detectable in the corresponding control gels. 

Identification {spot iic.l 

Up-regulated 
Protein- tyrosine kinase 9- like [4681 
Protein phosphatase 1, catalytic subunit, fi isoform 
[378] 

P0 4 -stathmin [5771 
Nonmuscle myosin heavy chain [1102] 
Putative P0 4 -nonmuscle myosin heavy chain [110 If 
Leukocyte elastase inhibitor [318] 
Grancalan [1004] 

Dawn-regulated 
Adenosylhomocysteinase [324] 

PEST phosphatase interacting protein homotog [234)" 
" TrEMBL access ion number. 

h Accession number and theoretical pi and Af R for the unmodified protein are indicated. 

* ATCBI accession number. 
J See text for explanation. 

* Among three experiments, the ratio of P0 4 -stutbmin expression increase, following LPS exposure in the presence of SB203580 divided by that 
in the absence of SB203580, was 0.93. 

'Genpept accession number. 

a This search was performed using average masses measured by linen r mode MALDI-TOF MS. 

Table V 

Analysis of pH 5.5-6.7 two-dimensional PAGE gels 
Results are from pooled samples for control (n ~ 3) and LPS- exposed (n = 8) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. 



Swiss-Prot 
no. 


Estimated 
i»Vpl 


Theoretical 


Peptides matched/ 
submitted 


Protein 
covered 


Change 








% 


% 


fold 


Q9Y3F5" 


34/5.81 


39.5/6.37 


10/14 (71%) 


34% 


1.8 


P37140 


38/5.73 


37.2/5.84 


7/10 (70%) 


22% 


2.0 


P16949 4 


18/5.36 


17.3/5.76 


9/12(75%) 


42% 


. 2.1* 


189036' 


145/5.32 


145/5.23 


20/21 (95%) 


17% 


New 


189036 fcr 


145/5.29 


145/5.23 


14/16 (87%) 


13% 


New 


P30740 


42/5.71 


42.7/5.9 


9/13 (69%) 


22% 


2.4 


P28676 


24/5.36 


24.0/5.02 


7/10 (70%) 


31% 


New 


P23526 


46/5.82 


47.7/6.04 


7/9 (.78%) 


14% 


0.4 


4100162' 


48/5.30 


47.675.35 


11/13 (85%) 


30% 


0.5 



identification [spot no.) 


Swiss-fret 
no. 


Estimated 


Theoretical 
AVpI 


Peptide* matched/ 
submitted 


Protein 
covered 


Change 










% 


% 


-fold 


Up-rcgulatcd 














Tninsntdolase [475] 


P37837 


38/5.95 


S7.5/6.36 


13/17(76%} 


33% 


2.5 


Isocitrate dehydrogenase [431] 


075874 


4a'6.25 


46.7/6.35 


7/7(100%) 


13% 


2.3 


Moesin [201] 


P26038 


61/6.09 


67.8/6.07 


11/13 (85%) 


17% 


2.1 


<r-Enolp.se (459) 


P06733 


43/5.64 


47.2/7.01 


7/10 (70%) 


17% 


3.8 


Dawn-regulated 














Calponin H2 [2401 


Q99439 


34/6.65 


33.7/6.94 


10/11(90%) 


27% 


0.5 



regulatory factor 3, a known regulator of interferon- stimulated 
gene transcription, ia not a direct target of p38 kinase. 2 There- 
fore, gene expression analysis of LPS- stimulated PMNs has 
uncovered a previously uncharacterized signal transduction 
system that is sensitive to inhibition of p38 MAPK. 

Knowledge of the genes down-regulated by LPS permits the 



development of further hypotheses addressing PMN function in 
the face of infection. Strikingly, several down-regulated genes 
and gene products are structural in nature (e.g. paxillin, acti- 
nin, calponin 112) (Tables n and V). A known consequence to 
the PMN of LPS exposure is decreased motility (48). Uprregu- 
lation of genes for adhesion molecules (ICAM1, CD44 y AL- 
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ASGQAFELILSPR 



15 



27 




Tabib vt 

Effect ofSB203580 on LPSstimuIated gene expression 
Genes are reported for which the SB203580/control expression ratio 
is < 0.60. 



Gene name 



-fold change ratio 
<SB20358O&rotrol) 



ISG15 

HCH 

Mx-i 

IFI66 

PI-9 

Ete-2 

IL6 

Rel 

UMS1 

C3AR1 

WDO 

KIAA0105 

SNAP23 

SLPI 

ELNAI12 

HM-74 

PKR 

MAD 

IFIT4 

Glycerol kinase 
IFI54 
IFI58 
IPF35 



0.09 
0.38 
0 
0 

0.57 
0.59 
0.45 
0.50 
0.58 
0.49 
0.35 
0.41 
0.58 
0.58 
0.49 
0.57 
0 

0.21 
0.12 
0 
0 

0.39 
0.46 



Change in absence 
afSB20358a 



-fold 
22.5 
20.8 
19.4 
12.1 
9.5 
7.4 
6.3 
6.2 
6.1 
6.1 
5.2 
5.1 
fi.O 
4.7 
4.6 
3.6 
3.7 
3.6 
3.6 
3.6 
3.5 
3.5 
3.0 




Fig. 2. A, the predicted sequence of the tryptic phosphopeptide in 
P0 4 -stathmin (1468.72 Da). The peptide mass measured by MALDI- 
TOF MS and the predicted mass differed by 14 ppm. As indicated, two 
alternate phosphorylation sites are possible: serine 16 and serine 25. B, 
P0 4 -stathmin and etathmin were identified on the control and LPS- 
exposed pH 5.0-6.0 gels. Consistent with phosphorylation, the P0 4 - 
etathmin spot was distinguished by a peptide of mass 1468.72 Da {Le. 
80 Da greater than the peptide of 1388.72 Da seen in the stathmin spot). 
Assuming that no other multiply phoaphorylated stathmin species have 
escaped detection, analysis of the integrated intensities of the PO,- 
stathmin and stathmin spots indicates tliat the percentage of the P0 4 
form of total cellular stathmin has increased from 11% to 38% with LPS 
stimulation. The decrease in integrated intensity for stathmin was 
equal in amount to the increase in P0 4 -stathmin following LPS 
exposure. 

CAM, and TSG-6) t and down-regulation of genes for structural 
proteins, indicates a genetic basis for this observation. Down- 
regulation of two genes implicated in cytoskeletal regulation, 
Pix-a and RhaB, was also observed. The calcium-binding pro- 
tein S100A4, down-regulated in LPS-treated PMNs (Table II). 
has been implicated in cell motility and metastasis (49). De- 
creased motility may be beneficial in sustaining the inflamma- 
tory response at sites of infection. In addition, LPS treatment 
results in an inhibition of apoptosis (50). Therefore, the longer 
residence time of the PMN at sites of infection is consistent 
with the long term genetically coded changes seen in these 
gene-profiling experiments and indicates that the changes in 
gene expression are functionally relevant to host defense and 
immunity. 

By providing information on post-translational modification, 
the proteomics (lata may provide further insights into the cy- 



Table VII 

Effect ofSB203S80 on LPS -stimulated protein expression 



Protein mime 


-fold dianei; rutin 
<SB2U36B0/ccntrol> 


Change in 
ubeexice of 
SB2035B0 






-fold 


Up-regulated 






Proteasome 0 chain 


0.8 


I.5t 


Annex in III 


0.6 


1.37 


Actin fragment [544] 


0.8 


1.74 


Actin fragment (591) 


0.8 


1.60 


a-Enolase 


0.6 


1.65 


Rab-GDP dissociation inhibitor 0 


1.1 


1.24 


Glutathione S-trunsfcrase P 


1.2 


1.54 


Pre-B-cell colony enhancing factor 


1.2 


1.29 


Down-regulated 






Adenylyl cyclase-aesociated protein 1 


. 13 


0.53 


Rho-GAPl 


0.8 


0.67 


Fieolin 1 


1.0 


0.74 



toskeletal remodeling effects of LPS upon the PMN. We con- 
tend that the actin fragments identified (Table III) are unlikely 
to represent technical artifacts. Rather, their specificity (iden- 
tical molecular weight/pl among different experiments), statis- 
tically significant up-regulation by LPS, as well as the use of a 
Lysis buffer containing chaotropes and multiple protease inhib- 
itors argue instead that these fragments are physiologic con- 
sequences of LPS exposure in the human PMN. More specifi- 
cally, the up-regulation of these fragments following LPS 
exposure (Table III) suggests that LPS may activate an actin- 
cleaving enzyme, which, in turn, remodels the cytoskeleton. 
Intriguing in this vein, calpain has recently been reported to 
play an important role in cell migration and cytoskeletal orga- 
nization of fibroblasts (51). The possibilities that LPS may 
induce calpain activation and that calpain activation may reg- 
ulate cytoskeletal reorganization and motility are currently 
under investigation. An alternative possibility is that actin 
cleavage is a marker of neutrophil apoptosis (52). 

Other LPS-regulated proteins may play important roles in 
cytoskeletal reorganization. The up-regulation of protein-ty- 
rosine kinase 9-like (A6-related protein) may modulate LPS- 
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Tabus VIII 

LPS-regulated proteins for which a probe was present on the 
Affymetrix chip 

A comparison, of corresponding protein and mRNA transcript changes 
following LPS exposure is shown. 



Protein 


Proton 

change 


uiRNA change 






-fold 


Up-regulated 






Proteasome p chain 


1.5 


1.9 | 


Leukocyte eLaBtatue inhibitor 


2.4 


4.6 f 


Rab-GDI p 


1.24 


NC 


Grancalcin 


New 


NO 


Transaldolase 


2.5 


NC 


Moesin 


2.1 


NC 


Nonmuscle myosin heavy chain 


New 


NC 


Glutathione S-transferase P 


1.54 


Absent 


Pre-B cell enhancing factor 


1.29 


Absent 


Lsocitrate dehydrogenase 


2.3 


Absent 


PO>-stathmin 


2.1 


Absent (stathmin) 


Protein phosphatase 1, B catalytic sub unit 2 


Absent 


Annexin III 


• 3.1 


3.1 1 


Down-regulated 






Adenvlyl cvclase-associated protein 1 


1.9 


2.1t 


Rho-GAP i 


1.5 


2.7 4 


Ficolin 1 


1.4 


1.7 i 


Adenosylhom ocys teinase 


2.5 


Absent 


Calponin H2 


2 


NC 


" NC ( no measureable change. 



induced actin polymerization, because it bears a high degree of 
homology to tovinfilin (A6), an actin monomer-binding protein 
that localizes to sites of rapid filament assembly in cells and is 
believed to regulate actin filament turnover (53), tn turn, LPS- 
induced down-regulation of Rho-GTPase activating protein 1 
(Table IE) may regulate twinfilin (and protein-tyrosine kinase 
9-like) activity, because twinfilin has been shown to colocalize 
with Racl and Cdc42 and to be regulated by active Racl in NIH 
3T3 cells (53). Activation of Rho proteins may be facilitated by 
LPS up-regulation of moesin (Table V), because moesin report- 
edly induces the dissociation of Rho from GDI (54). Racl may, 
in turn, promote activation of the actin filament-nucleating 
Arp2/3 complex tlirough interactions with WASP (Wiskott-Al- 
drich syndrome protein) family proteins (55) and, interestingly, 
is postulated to regulate the dynamics of both the actin and 
microtubule cytoskeletons via phosphorylation of stathmin (Ta- 
ble IV) (56). Calponin H2 is an actin-binding protein not pre- 
viously reported in PMNs that is postulated to play a role in 
cytoskeletal organization (57). Its down-regulation by LPS (Ta- 
ble V) likely modulates LPS-induced cytoskeletal reorganiza- 
tion. The up-regulation of nonmuscle myosin heavy chain and a 
putative phosphorylated form of myosin heavy chain (putative 
protein kinase C substrate by prediction rules) in the LPS- 
exposed PMN (Table IV) is of uncertain significance; myosin 
has been implicated in multiple functions in the PMN, includ- 
ing locomotion, fluid pinocytosis, and phagocytosis (58). Of 
interest, however, S100A4 (down-regulated, Table II) lias been 
reported to regulate cytoskeletal dynamics by inhibiting pro- 
tein kinase C-mediated phosphorylation of nonmuscle myosin 
heavy chain (59). 

LPS induction of stathmin phosphorylation (Table TV and 
Fig. 2) may represent another mechanism by which the cy- 
toskeleton is remodeled. Stathmin is a phosphoprotein report- 
edly involved in both signal transduction and in regulation of 
the microtubulin filament network; furthermore, phosphoryla- 
tion of stathmin has been reported to modulate its tubulin- 
binding avidity (60). Inferences can be made about both the 
phosphorylation site on P0 4 -stathmin and the responsible ki- 
nase induced by LPS. Four phosphorylation sites in stathmin 
have been well described: Ser lti , Ser 25 , Ser 38 , and Ser 63 (32, 33); 



Ser 16 has been reported as a substrate for Ca 2+ /calmodulin 
(CaM)-dependent kinases (32), and Ser 25 as primarily a sub- 
strate for p38 and ERK (33), with p34 nd,<2 also active but bear- 
ing a 5-fold preference for Ser 30 (34). As stated above, the 
phosphopeptidc identified in PO^stathmin, extending from 
residues 15 to 27 (1468.7 Da), is consistent with phosphoryla- 
tion of either Ser ls or Ser 25 (Fig. 2). Although both p386 and 
p38a MAPK isoforms are expressed in the human PMN, LPS 
has been shown to selectively activate the p38a isoform in 
human PMNs (9). The p36a isoform, however, has been shown 
to be relatively inactive nt Ser 25 ; in fact, p386 is -100-fold more 
active at Ser 25 , and selective p38a inhibitors do not inhibit the 
stress-activated phosphorylation of stathmin in 293 cells (33). 
Further support for the lack of involvement of p38 signaling in 
phosphorylation of stathmin in our system is the apparent lack 
of effect of SB203580 (a selective p38a and p380 inhibitor) on 
LPS-induced expression of P0 4 -stathmin (Table IV). Because 
p34 cdc2 is relatively inactive at Ser 25 (34), we conclude that the 
phosphorylation site is likely to be Ser 16 , a reported substrate 
of CaM-dependent kinase. Although CaM .kinases have previ- 
ously been implicated in gene activation in LPS-exposed my- 
elomonocytic HD11 cells (61), stathmin signaling has not, to 
our knowledge, been previously reported in either PMNs or 
lipopolysaccharide signal transduction. 

Cytoskeletal reorganization, a well-described regulator of 
granule release (62), may underlie LPS-induced priming for 
PMN granule release, but several LPS-regulated proteins may 
provide more specific clueB. LPS exposure led to increased 
levels of grancalcin, a calcium-binding protein previously de- 
tected in PMNs and shown to translocate to granules und 
plasma membrane in the presence of physiologic concentra- 
tions of calcium (63). Similarly, annexin HI, a calcium-binding 
protein highly expressed in PMN granule membranes and im- 
plicated in calcium-mediated secretion (64) and in granule fu- 
sion (65), was also found to be up-regulated. Exocytosis of 
granule contents may also be facilitated by LPS up- regulation 
of Rab-GDP dissociation inhibitor (Table III), which has been 
proposed to recycle Rab after vesicle fusion by extracting it 
from the membrane and loading it onto newly formed transport 
intermediates (66). 

Parallel use of DNA microarrays and proteomics affords a 
powerful strategy for comparison of corresponding mRNA tran- 
scripts nnd proteins, thereby affording new insight into the 
mechanisms by which the cell regulates its signaling responses 
to the external environment. Of interest, a poor correlation was 
found between corresponding transcripts and proteins (Table 
VIE), as reported in other systems (17, 18). The finding in some 
cases of unchanged transcript abundance in the face of regu- 
lated protein levels indicates post-transcriptional modulation 
following LPS exposure. The finding of undetected transcripts 
in the face of regulated Levels of the corresponding proteins 
may indicate previous transcri ption of these genes in an earlier 
state of the myeloid maturation of the PMN, producing stable 
protein species that have undergone post-translational alter- 
ation following LPS exposure. The use of SB203580, a p38 
inhibitor, adds further insights into the mechanisms of LPS 
regulation. At the level of mRNA expression, SB203580 inhib- 
ited 23% of LPS-stimulated genes by >AQ% and 11% of genes 
by ^60%; therefore, p38 plays a specific role in gene regulation 
in the PMN. In particular, proteasome 0 chain was up-regu- 
tated at both the mRNA transcript and protein level (Table 
VIII), with no notable effect of SB203580 on expression at 
either level, consistent with a non-p38-mediated pathway of 
primary transcriptional up-regulation induced by LPS. Simi- 
larly, CAP1, Rho-GAPl, and ficolin 1 were down-regulated at 
both the mRNA transcript and protein level (Table VTII), with 
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no notable effect of SB203580, consistent with a non-p38-me- 
diated pathway of primary transcriptional down-regulation. 
Interestingly, anriexin III was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Table VII), 
consistent with a p38-niediated post-transcriptional up-regula- 
tion induced by LPS. 

Limitations of the present study should be noted. Gene ex- 
pression analysis by cDNA microarrays does not distinguish 
between transcriptional regulation and mRNA stabilization; 
similarly, two-dimen sional PAGE proteomics by itself does not 
distinguish among transcriptional, translational, or post-trans- 
lational regulation of protein abundance. Transcript detection 
by microarray technology is limited to the probes included; 
protein identification by two-dimensional PAGE proteomics is 
limited to well -resolved regions of the gel, may perform less 
well with hydrophobic and high molecular weight proteins, and 
tends to select for more abundant protein species (30). Harvest- 
ing of the LPS-incubated PMNs at 4 h may have prevented 
detection of earlier, transient changes and may have thereby 
introduced artifactual transcript-protein discordance. Further- 
more, the post-LPS incubation, pre-two-dimensional PAGE cell 
washes would be expected to remove secreted proteins from 
further analysis, with uncertain effects on detected protein 
abundance depending on such factors as the degree of rfc novo 
synthesis and extent of degranulation/exocytosis. Because pro- 
tein binding of Coomassie Blue has a limited dynamic range 
and is typically not linear throughout the range of detection, 
image analysis of Coomassie Blue-stained protein spots should 
be considered semi-quantitative. For some protein spots, the 
apparent magnitude of regulation by LPS may have been 
blunted by the spot approaching staining saturation in the 
control gel. By limiting our analysis to those protein spots 
common to all twelve pH 3.0-10.0 two-dimensional gels, we 
likely excluded, some LPS-regulated proteins that happened to 
be either poorly resolved on a subset of the gels or unmatched 
by the image analysis software. By further limiting the analy- 
sis to those matched spots on the pH 3.0-10.0 gels for which a 
two-tailed t test demonstrated p < 0.05, the list of regulated 
proteins was likely also limited by statistical power. In addition 
to those regulated proteins listed in Table III, three others were 
up-regulated and three down-regulated with p < 0.09 (data not 
shown). 

Limiting our reported results to those changes that met 
statistical significance among the donors carries further impor- 
tant implications. We have encountered a two order of magni- 
tude range of response in unselected donor LPS-induced PMN 
functions, such as TNF-a and superoxide anion release (data 
not shown). The sources of this physiologic heterogeneity re- 
main uncertain but may possibly include such factors as nat- 
ural mutations of the LPS receptor component, TLR4 (67). By 
selecting for LPS effects common to all donors, we may not have 
characterized the range of genomic and proteowic heterogene- 
ity present in the population and thereby may have focused on 
only a narrow portion of a broader biological response to LPS. 
We contend that this reductionist approach is valid because it 
would be expected to enrich for biologically integral responses 
of the PMN to LPS. Nevertheless, correlation of genomic and 
proteomic profiles with functional phenotypes of the PMN may 
bear important diagnostic and therapeutic implications and 
will be pursued in future studies. 

Widespread regulation of numerous noncytokine/chemokine 
genes and proteins in the LPS-stimulated human PMN is a 
novel finding. These data indicate that, despite a narrow scope 
of gene expression in the nonstimulated state, the terminally 
differentiated, short lived PMN likely plays a role in the innate 



immune response that is far more sophisticated and dynamic 
than the simple release of preformed inflammatory mediators. 
Although gene expression appears to be an important mecha- 
nism by which PMNs respond acutely to infection, mRNA tran- 
script/protein concordance is limited, and post-transcriptional 
(and post-translational) modifications also play an important 
role. The alteration of multiple transcriptional regulators, G- 
protein regulators, P0 4 -stathmin, and protein phosphatase 1 
indicates that one of the responses to LPS exposure is to modify 
subsequent signaling events by bacterial components or by 
other cytokines and chemokines. Finally, the finding that p38 
MAPK mediates LPS regulation of a limited subset of tran- 
scripts and proteins underlines the continuing need to define 
signal transduction cascades in the neutrophil. 
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is that risk relates to the environment and 
human health. On the other hand, recent 
studies have repeatedly shown diat public 
hesitance also includes a number of ethical 
issues (e.g. f market dominance of a few large 
companies and GM crops threatening 
natural or divine orders, refs 
1 ,2). Our worry is that the 
US government is 
neglecting widespread 
concerns of the European 
public diat include more 
than environmental risk 
and human health. 

Research carried out by 
our group in Denmark 1 
indicates that, although 
many people are confident 
that the public authorities 
are able to manage the risks 
here and now, people are less confident about 
their ability to handle long-term effects 
because of the scientific uncertainty. 
Attempts to conceal these or other limits to 
scientific knowledge do not prevent 
controversies from arising; rather, the 
opposite happens because trust in business, 
scientific experts and public authorities is 
undermined (witness the handling of the 
BSE controversy in die United Kingdom). 

In the long run, a policy of openness about 
the different dimensions of uncertainty 
would be more likely to increase trust in 
scientific risk assessment. Of course, this 
will not guarantee public acceptance of GM 
food, but experience in Europe shows that 
transparency and dialog are prerequisites for 
decreasing concerns about new technology. 




The argument that the EU's resistance 
to GM food has had negative consequences 
for developing countries, denying them 
access to a technology that could alleviate 
food provision, is regarded sympathetically 
by many among the European public. 

Indeed, here most people 
abandon the simple 
dichotomy between 
'unacceptable' GM food and 
the much more acceptable 
medical uses. This is because 
GM foods 

are seen as a means to help 
people in distress. Many 
counter such humanitarian 
uses, however, by the 
observation that, in general, 
GM crops are developed not 
to benefit people in the 
developing world, but to make money. 
Needless to say, according to those who 
point this out, making money is not in 
itself an acceptable objective. Thus, the fear 
is that the benefits will never accrue to 
those who are at present suffering. 

Kristian Borch, Jesper lessen 
& Rikke B Jargensen 

Centre for Hioethks and Risk Assessment, 
Systems Analysis Department, 
PO Box 49, DK-4000 Reskilde, Denmark 
e-mail: kristian.borch@risoe.dk 

1. Lassen, J. ef si. Bioprccess Bhsyst. Cng. 24, 
263-271 (2002). 

2. Wagnw, W. et si. in Qiatecttnoicgy 1996-2000. The 
Years of Controversy [Gasket, G. & EJaver, M.W., eds.) 
80-55 (Science Museum, London, 2001} 



Mining the literature and large 
datasets 



(or perhaps was exacerbated by) a UK 
government seen to be welcoming of GM 
foods and crops. Another negative was that 
it was major transnational corporations — 
another questionable community in the 
eyes of much of the public here — that 
were seeking to push their new products 
onto the public without previous debate 
and without there being any perceptible 
benefit. And finally, the potentially negative 
impact of GM crops on organic farmers — 
who are seen by some as crucially 
important for the sustainable future of 
food production — and the relatively small 
scale of agricultural production in the 
United Kingdom (and Europe) have also 
been important issues. 

The question to be answered, therefore, 
is not how to force the EU to accept GM 
foods and crops against its own public 
opinion, but how to change public opinion 
in the EU. The UK government is currendy 
conducting several exercises that it hopes 
will provide the facts to support a relaxation 
of die moratorium on growing GM crops. 
These include a major review of the costs 
and benefits of GM crops (just finished), 
a scientific review of the issues ( also now 
finished), a series of crop trials (results in 
September) and a public debate on GM 
crops, 'GM nation* (just finished). 

Whether these will change attitudes is 
moot: the costs-and-benefits review has 
concluded that the economic value of die 
few currently available GM crops that could 
be grown in the UK is likely to be limited 
because of negative consumer attitudes to 
GM foods. 

iVf II Barnes 

J The Spinney, 

Watford WDI74QF, UK 

e-mail: \1Barnes466<$nolxom 

To the editor: 

Several articles in the July and August Issues 
of Nature Biotechnology (21, 735-738, 2003; 
21, 852-854, 2003) discuss whether the US 
strategy of forcing the European Union 
(EU; Brussels, Belgium) to accept GM foods 
by referring to World Trade Organisation 
(WTO; Geneva, Switzerland) rules will bear 
fruit We dp not believe so — rather the 
opposite. 

A central claim in the arguments of 
both President Bush and US commerce 
representative Robert B. Zoellickis that the 
risk of GM foods is negligible. The veracity 
of that statement, however, depends on what 
is defined as risk- A common understanding 



To the editor: 

tn the accelerating quest for disease 
biomarkers, the use of high-throughput 
technologies, such as DNA microarrays 
and proteomics experiments, has produced 
vast datasets identifying thousands of 
genes whose expression patterns differ in 
diseased versus normal samples. Although 
many of these differences may reach 
statistical significance, they are not always 
biologically meaningful. For example, 
reports of mRNA or protein changes of 
as little as two-fold arc not uncommon, 
and although some changes of this 
magnitude turn out to be important, most 



are attributable to disease-independent 
differences between the samples. Evidence 
gleaned from other studies linking genes to 
the disease is helpful, but with such large 
datasets, a manual literature review is often 
not practical. Thus, the power of these 
emerging technologies— the ability to 
quickly generate large sets of data — has 
challenged current means of evaluating 
and validating these data. One study from 
1999, for example, reveals that a researcher 
would have to scan 130 different journals 
and read 27 papers per day to follow a 
single disease, such as breast cancer 1 . 
To address this need, my group at 
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Harvard recently developed a freely 
accessible automated literature- 
mining tool, termed MedGene, that 
comprehensively summarizes the 
relationships among over 50,000 named 
human genes (and their synonyms) and 
over 4,000 human diseases from over 
12 million records in Medline 
(http://hipseq.med.harvard.edu/MedGcne). 
Several key features of this resource are 
worth noting. First, MedGene is not 
limited to any specific relationship type, 
but rather encompasses all reported 
gene-dtsea.se links, including the 
genetic, biochemical, pharmacological, 
epidemiological and physiological. Second, 
the database assigns a mathematical score 
summarizing the strength of the 
association between the disease and the 
gene, which allows semiquantitative 
analysis and organizes the genes in rank 
order. Finally, the relationships are 
identified automatically by advanced text 
searching and filtering algorithms that 
result in low 

rates of false -positive and false- negative 
linkages 2 , fn one query, MedGene 
identified nearly 2,400 breast 



cancer-related genes, whereas the same 
search in lour commonly used databases 
yielded a combined total of 286 genes, 
260 of which were iucludcd in the 
MedGene list 1 " 5 . 

A summary of all gene- disease 
relationships offers the unique opportunity 
to both evaluate and validate the outcome 
of high -throughput experiments. For 
example, we used MedGene to analyze a 
DNA inicroarray experiment in which over 
2,000 genes demonstrated statistically ^ 
significant differences in expression 
between normal breast tissue and breast 
cancer. It was able to identify the subset of 
these genes previously described as breast 
cancer- related genes in the literature. To 
determine whether gene expression level 
correlated with the strength of the 
association between gene and breast 
cancer, we plotted gene expression levels 
against the breast cancer literature 
relationship scores assigned by MedGene. 
Interestingly, 

there is no correlation when considering 
expression differences as high as fivefold; 
however, a significant correlation is 
observed (r = 0.41; P - 0,05) among genes 



showing a difference of tenfold or more. 
Thus, lor tills experiment, expression level 
differences as high as fivefold cannot be 
attributed to the disease without 
corroborating evidence. It will be 
interesting to learn if similar results hold 
for other diseases and other experiments. 

As the search for disease biomarkers and 
drug targets comes to rely increasingly 
upon genomic-scale technologies, demand 
will grow for automated resources, such as 
MedGene, that help process the resulting 
data volume. 

Joshua lABaer 

Institute of Protcomics, 
Harvard Medical School, 
250 Longwood Ave., BCMP, 
Boston, Massachusetts 021 15 t USA 
e-tttail: josh&?hms.harvard.edu 

1. Oaasiri, R.A., Giasser, S.R., Stetten, O.L. 4 
Wheeler, D.A. Oncogene 18. 7958-7955 (1999). 

2. Hu, V. etal. J. Proteome Res. Z, 405-412 (2003). 

3. Stetfen, D.L., Levine, A.E., Yarus, S., Baasin, R.A. 
& Wheeier, D.A. 8ioinformatics 16, 539-649 
(2000). . 

4. Bairoch, A. & Apweiier, R. Nucleic Acids Res. 28, 
45-48(2000). 

5. Rebhan, M., Chatita-Caspi, V., Prilusky, J. & lancet, 
D. Trends Genet. 13. 163(1997). 



n 

3 

a. 

a> 

2 

CO 



CO 

s 

CM 

© 



r 



NATURF BIOTECHNOLOGY VOLUME 21 NUMBER 9 SEPTEMBER 200.) 



977 



Molecular and Cellular Biology, Mar. 1999, p. 1720-1730 
0270-7306799/$04.00+0 

Copyright © 1999, American Society for Microbiology. All Rights Reserved. 



Vol. 19, No. 3 



Correlation between Protein and mRNA Abundance in Yeast 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell iysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS) 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang 

m^°?« J ' ^f 1S «? in ; M ' A ' BaSnii ' D ' E ' Bassett > J r -> p - H^ter, B- Vogelstein, and K. W. Kinzler, Cell 
*S:Z43-251, 1^7). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient. 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sentia! but largely unexplored area of biology. With recent 
technical advances including the development of differential 
display-PCR (21), of cDNA miexoarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible" to establish globai and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for Lhe quantitative description of biological 
systems. This evidence includes discoveries of pos transcrip- 
tional mechanisms controlling the protein translation rate. (15), 
the half-lives of specific proteins or mRNA* (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence- 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with lime of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, see reference 1 1) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits Lhe separation, visualization, and quantitation of 
thousands of proteins reproducibfy on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrome trie protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by. microspray liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for predic- 
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t A in t ^ a P; oteom f. ana ^ s . b y 2DE and mass spectrometry. In part 1, proteins are separated by 2DE, stained spots are excised and subjected 

to in-gel digestion with trypsin, and the resulting peptides are separated by on-line capillary high-performance liquid chroma ography in part 11 a peptide^ shown 
elutjng from the column in part The peptide is ionized by electrospray ionization and'enters the mass spectrometer. The mass of the ionke^ 

TJJ^f} ^ ^T* 0 " These peptide fragments are detected as a tandem mass (MS/MS) spectrum in the third quadripole mass filter wh«e two L 

^^i^ inWafd fr ° m thC N C lCnniDi ° fthe JD ,U > ^ M5J/MS spectrum from he 

£^hIS^£ OT 5 " ffl P ared «o predicted tandem mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the da abase, the peptide and, by association, the protein from which the peptide was derived can be identified. Unambiguous protein identificaUou is attained linTZte 
analysis because multiple peptides are identified as being derived from the same protein P iacnimcal « J n is attained in a single 



tion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteome maps with current technol- 
ogy will be very challenging, irrespective of the tvpe of organ- 
ism. 

MATERIALS AND METHODS 

Yens! strain and growth conditions. The source of protein and message tran- 
scripts for all experiments was YPH499 (MAT a ura3-52 Iyx2-8QJ adc2-101 
Ieu2-M his3M0O trpt-^63) (30). Logarithmically growing ceils were obtained by 
growing yeast cells to early log phase (3 X 10 6 cclis/rol) in YPD rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tryptophan) 
at 30*C (35). Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present in YPD medium. Protein was 
harvested as described by Garrels and coworkers (12). Harvested protein was 
lyophilized, resuspended in isoelectric focusing gel rehydration solution, and 
stored at -80°C 

2DE. Soluble proteins were run in the first dimensiou by using a commercial 
flatbed electrophoresis system (MuJtiphor J J; Pharmacia Biotech). Immobilized 
polyacrylaniide gel (IPG) dry strips with nonlinear pH 3.0 to 10.0 gradients 
(Amersham-Pharmacia Biotech) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell lysates was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40, 10 mM dithiothreitol), and 250 
to 380 u.1 of solution was added to individual lanes ot an IPG strip rehydration 
tray (Amersham-Pharmacia Biotech). The strips were allowed to rehydrate at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 b, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPG strips were reequilibrated for 8 min in 2% (wt/vol) dithiothreitol in 
2% (wt/vol) SDS-6 M urea-30% (wt/vol) grycerol-0.05 M Tris HC1 (pH 6.8) and 
for 4 min in 2.5% iodoacet amide in 2% (wt/vol) SDS-6 M urea-30% (wt/vol) 
giycero)-0.05 M Tris HQ (pH 6.8). Following reeuuilibratioo, the strips were 
transferred and apposed to 10% polyacrylamide second-dimension gels. Poly- 
acrylamide gels were poured in a casting stand with 10% acrylamidc-2.67% 
piperazine diacryiamidc-0.375 M Tris base-HCI (pH M)-iU % (wtVol) SDS-0.05% 
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FIG. 2, 2D silver-stained gel of the proteins in yeast total cell lysate. Proteins were separated in the first dimension (horizontal) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometry and database searching. The spots are labeled oa the 
gel and correspond to the data presented in Table 1. Molecular weights tire given in thousands. 



(wt/vol) ammonium persulfate-0.05% TEMED (iY^/^V'-tetramethylethyl- 
enediamine) in MiUi-Q water. The apparatus used to run second-dimension gels 
was a noncommercial apparatus from Oxford Glycosciences, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
50 uiA (constam)-500 V^S5 W for 20 win, followed by 200 uiA (consta'nt)-500 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed ami silver stained according to the procedure of Shevchenko 
et ai. (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detect ton limit by mass spectrometry, spots were cut out and pooled from up to 
four identical cold, silver-stained gels. In-gcl tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by nii- 
crocapillary 1X-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250,514 protein sequences) (24a) by using the 
computer program Sequcst (8), which matches theoretical and acquired tandem 
mass spectra. A ptotetu match was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. All protein 
idea tificat ions were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculescu and coworkers have previously generated 
frequency tables foi yeast aiRNA transcripts from the same strain grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based oa two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
15 bp at the 3'-most cutting site for Mailt Second, many transcript tags can be 
concatenated into a single molecule and then. sequenced, revealing the identity of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YFH499 growing at mid-log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules pei cell (16), this would represent 
a 1.3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of detecting such transcripts, Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, and performed the calculation as described by Velculescu and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(i) no CATG site was found in the open reading frame (ORF), (ii) a CATG site 
was found but the corresponding 10-bp putative SAGE lag was not found ia the 
frequency tables, or (ui) identical putative SAGE tags were present for multiple 
genes (e.g., TDH2_YEAST and TDH3_YEAST). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 TABLE 1— Continued 



Mnl wt 




Spot no. 


TnJ gene 


Protein 
abundance 

cell) 


uiRNA 
abundance 
(copics/ccil) 


Codon 
bias 


Mnl wt 


Pi 


Spot Tit). 


YPD gene 
namtz a 


Proiein 
abundance 
(10 5 copies/ 
cell) 


niRNA 
abundance 
(copics/cc)J) 


Codon 
bias 


17,259 


6.75 


133 


CPRI 


15.2 


61.7 


0.769 


39,477 


5.58 


86 


FBA1 


17.8 


183.6 


0.935 


18,702 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


39,477 


5.58 


87 


FBA1 


427.2 


183.6 


0.935 


18,726 


4.44 


147 


YKL056C 


612 


88.4 


0.831 


39,540 


6.50 


150 


HOM2 


60.3 


4.5 


0.592 


18,978 


5.95 


135 


YER067W 


3.7 


6.7 


0.118 


39,561 


6.12 


156 


PSA1 


96.4 


27.5 


0.718 


19,108 


5.04 


130 


YLRioyw 


94.4 


9.7 


0.680 


41,158 


6.01 


49 


YNL134C 


14.9 


1.5 


0.316 


19,681 


9.08 


136 


ATP7 


U.O 


NA** 


0.246 


41,623 


7.18 


58 


BAT2 


19.0 


8.9 


0.250 


20,505 


6.07 


111 


GUK1 


16.5 


3.7 


0.422 


41,728 


7.29 


110 


ERG10 


24.1 


4.5 


0.543 


21,444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


41,900 


5.42 


74 


TOM40 


213 


2.2 


0.375 


21,583 


4.98 


95 


TSA1 


110.6 


40.1 


0.S45 


42,402 
42,883 


6.29 


45 


CYS3 


6.7 


8.9 


0.621 


22,602 


4.30 


80 


EFB1 


66.1 


23.8 


0.875 


5.63 


67 


DYS1 


15.8 


5.2 


0.526 


23,079 


6.29 


112 


SOD2 


12.6 


Z2 


0.351 


43,409 


6.31 


107 


SER1 


10.5 


1.5 


0.292 


23,743 


5.44 


137 


HSP26 


NA d 


0.7 


0.434 


43,421 


5.59 


91 


ERG6 


2.2 


14.1 


0.408 


24,033 


5.97 


96 


ADK1 


17.4 


16.4 


0.656 


44,174 


7.32 


56 


YBR025C 


13.1 


6.0 


0.684 


24,058 


4.43 


143 


YKL117W 


29.2 


10.4 


0.339 


44,682 


4.99 


72 


TIF1 


2.9 


39.4 


0.834 


24.353 


6.30 


140 


TFS1 


8.1 


0.7 


0.146 


44,707 


7.77 


108 


PGK1 


13.7 


165.7 


0.897 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


0.359 


44,707 


7.77 


109 


PGK1 


315.2 


165.7 


0.897 


24,808 


6.33 


97 


GSP1 


26.3 


5.2 


0.735 


46,080 


6.72 


30 


CAR 2 


15.4 


NA" 


0.495 


24.908 


8.73 


122 


RPS5 


18.6 


NA C 


0.899 


46,383 


8.52 


53 


TDPI 


7.7 


0.7 


0.436 


25,081 


4.65 


81 


MRP8 


9.3 


NA C 


0.241 


46,553 


5.98 


47 


IDP2 


32.4 


NA e * 


0.197 


25,960 


6.06 


116 


RPE1 


5.8 


0.7 


0.372 


46,679 


6.39 


50 


ENOl 


35.4 


0.7 


0.930 


26,378 


9,55 


127 


RPS3 


96.8 


NA e 


0.863 


46,679 


6.39 


51 


ENOl 


6.6 


0.7 


U.7JU 


26,467 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


46,679 


6.39 


52 


ENOl 


2.2 


0.7 


u.^ou 


26,661 


5.84 


98 


TPU 


NA d 


NA' : 


0.900 


46,773 


5.82 


63 


EN02 


15.5 


289.1 


u.yov 


27,356 


5.56 


93 


PRE8 


6.9 


0.7 


0.129 


46,773 


5.82 


64 


EN02 


635.5 




U.7UU 


27,334 


6.13 


1.15 


YI-IR049W 


18.4 


12 


0.520 


46,773 


5.82 


65 


ENH2 




9RQ 1 


u.you 


27,472 


5.33 


92 


YNL0I0W 


31.6 


3.7 


0.421 


46J73 


5.82 


66 


EN02 


31.0 


9RQ 1 




27,480 


8.95 


123 


GPM1 


10.0 


169.4 


0.902 


47,402 


6.09 


126 


COR1 


15 


11 7 


n An 


27,480 


8.95 


124 


GPMl 


231.4 


169.4 


0.902 


47,666 


8,98 


54 


AAT2 


XI. t 


A ft 
O.U 


fl HQ 


27,480 


8.95 


125 


GPM1 


7.5 


169.4 


0.902 


48,364 


5.25 


73 


WTM1 
w i (vi i 


/4-J 


1 J.4 


n ice 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0381 


48,530 


6.20 


DI 


ivm ji/ 


JO A 


zy.u 


ft Cl£ 


27,874 


4.46 


78 


YST1 


13.6 


52.8 


0.805 


48,904 


5.18 


69 


T V*?Q 

i— 1 


if, 1 


j. / 


U.40J 


28,595 


4.51 


41 


PXJP2 


4.4 


0.7 


0.147 


48,987 


4.90 


153 


SUP45 




11 n 
1 1.7 


fl '2T7 


29,156 


6.59 


114 


YMR226C 


14.5 


2.2 


0.283 


49J27 


5.47 


70 


PR02 


J.J.D 


5 9 




29,244 


8.40 


120 


DPMI 


5.0 


11.2 


0.362 


49,912 


9.27 


62 


TEF2 


JJO.J 


78*? n 




29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


50*444 


5.67 


35 


YDR190P 


4.8 


12 




30,012 


6.39 


138 


PRB1 


21.2 


1.5 


0.449 


50,837 


6.11 


32 


YEL047C 


3.8 


1.5 


fl 3R7 


30,073 


4.63 


77 


BMH1 


14.7 


28.2 


0.454 


50,891 


4.59 


151 


TUB2 


11.2 


7 d 


fl ACIA 
U.HUh 


30,296 


7.94 


121 


OMP2 


67.4 


41.6 


0.499 


51,547 


6.80 


27 


LPD1 


18.9 


12 


U.JJ J. 


30,435 


6.34 


89 


GPP1 


70.2 


11.2 


0.703 


52,216 


7.25 


29 


SHM2 


19.7 


7.4 


0.722 


31,332 


5.57 


88 


ILV6 


13.9 


3.0 


0.402 


52,859 


5.54 


37 


YFR044C 


30.2 


6.7 


0.442 


32,159 


5.46 


113 


IPP1 


63.1 


3.7 


0.752 


53,798 


5.19 


71 


HXK2 


26.5 


7.4 


0.756 


32,263 


6.00 


149 


HIS1 


214 


4.5 


0.232 


53,803 


6.05 


145 


GYT6 


4.4 


0.7 


0.147 


33,311 


5.35 


84 


SPE3 


15.1 


6.7 


0.468 


54,403 


5.29 


39 


ALD6 


37.7 


12 


0.664 


34,465 


5.60 


129 


ADE1 


8.7 


5.2 


0.305 


54,403 


5.29 


40 


ALD6 


6.6 


12 


0.664 


34,762 


5.32 


85 


SEC 14 


10.9 


6.0 


0.373 


54,502 


6.20 


31 


ADE13 


6.3 


1.5 


0,417 


34,797 


5.85 


42 


URA1 


49.5 


8.9 


0.137 


54,543 


7.75 


25 


PYK1 


225.3 


101.8 




34,799 


6.04 


90 


BEL1 


103.2 


81.0 


0.875 


54,543 


7.75 


26 


PYK1 


39.8 


101 8 




35,556 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


55*221 


6.66 


146 


i * * 1 * f 1 VT 


16.3 


J.U 


n 7 a a 


35,619 


8.4! 


59 


TDH1 


69.8 


32.T 


0.940 


55,295 


4.35 


134 


PD11 


66.2 


14 1 


n ^bq 

U_Ja5' 


35,650 


5.49 


68 


CAR! 


5.2 


3.0 


0.339 


55,364 


5.98 


24 


GIJC1 


• 22^6 


fx n 

O.U 




35,712 


6.72 


117 


TDH2 


49.6 


473.0 C 


0,982 


55,481 


7.97 


118 


ATPI 


21.6 


9 *> 




35,712 


6.72 


154 


TDH2 


863.5 


473.0* 


0.982 


55,886 


6.47 


28 


r*v^4 


79 9 


NT A c 


fl AAA 
U.444 


35,712 


6.72 


155 


TDH2 


79.4 


473.0 1 '' 


0.982 


56 167 


5.83 


D D 




\A 1 

14. j 


1 f\ 
J.U 


0.324 


36,272 


4.85 


128 


APA1 


8.7 


0.7 


0.425 


56,167 


5^83 


34 


AROS 


9.1 


3.0 


0.324 


36,358 


5.05 


75 


YJR105W 


17.6 


17.1 


0.522 


56,584 


6.36 


20 


CYB2 


18.9 


NA C 


0259 


36,358 


5.05 


76 


YJR105W 


27.5 


- 17.1 


0.522 


57,366 


5.53 


60 


FRS2 


13 


0.7 


0.451 


36,596 


6.37 


79 


ADH2 


58.9 


260.0 C 


0.711 


57,383 


5.98 


144 


ZWF1 


5.6 


0.7 


0.215 


36,714 


6.30 


102 


ADH1 


746.1 


260.0 


0.913 


57,464 


5.49 


36 


THR4 


21.4 


3.7 


0.508 


36,714 


6.30 


103 


ADH1 


17.6 


260.0 


0.913 


57,512 


5.50 


7 


SRV2 


6.5 


NA C 


0.260 


36,714 


6.30 


104 


ADH1 


61.4 


260.0 


0.913 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


36,714 


6.30 


105 


ADH1 


517 


260.0 


0.913 


58,573 


6.47 


17 


ACH1 


4.4 


1.5 


0.327 


37,033 
37,796 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


58,573 


6.47 


18 


ACHI 


5.4 


1.5 


0.327 


7.36 


57 


IDH2 


29.4 


6.7 


0.330 


61,353 


5.87 


21 


PDC1 


6.5 


200.7 


0.962 


37,886 


6.49 


106 


ILV5 


76.0 


4.5 


0.892 


61,353 


5.87 


22 


PDC1 


303.2 


200.7 


0.962 


38.700 


7.83 


55 


BAT1 


30.9 


11.2 


0.469 


61,353 


5.87 


23 


PDCI 


16.3 


200.7 


0.962 


38,702 


6.24 


46 


OCR2 


NA' ; 


12 


0.326 


61,649 


5.54 


38 


CCT8 


12 


1.5 
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Mol 


P' 


Spot no. 


YPD gene 
name 


Protein 
abundance 
tuples/ 

cell) 


uiRNA 
abundance 
(copicft/cell) 


Codon 


61,902 


6.21 


101 


PDC5 


4.3 


NA tf 


0.828 


62,266 


6.19 


16 


TCL1 


20.1 


NA ,: 


0.327 


62,862 


8.02 


19 


1LV3 


5.3 


4.5 


0.548 


63,082 


6.40 


119 


PGM2 


2.2 


3.0 


0.402 


64,335 


5.77 


5 


PAB1 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


sni 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STT1 


6.4 


0.7 


0.313 


66,450 


5.29 


141 


SSB2 


7.U 


NA £ 


0.880 


66,450 


5.29 


142 


SSB2 


2.3 


NA' : 


O.SSO 


66,456 


5.23 


10 


SSB1 


64.5 


79.5 


0.907 


66,456 


5.23 


11 


SSB1 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSB1 


13.7 


79.5 


0.907 


68,397 


5.82 


82 


LEU4 


3.1 


3.0 


0.407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKL029C 


2.8 


3.7 


0.353 


75,396 


5.82 


' 6 


GRS1 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MBT6 


2.0 


NA* 


0.772 


85,720 


6.25 


2 


MET6 


10.9 


NA C 


0.772 


85,720 


6.25 


3 


MET6 


1.4 


NA' 


0.772 


93,276 


6.11 


131 


EFT1 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFn 


5.7 


41.6 . 


0.890 


102,064' 


6.6 r 


94 


ADE3 


4.8 


5.2 


0.423 


J 07,482* 


5.33 e 


4 


MCM3 


2.7 


NA C 


0.240 



* YPD gene names arc available from Use YPD website (39). 

6 NA, calculation amid noi be performed or was not available. 
c mRNA data inconclusive nr NA. 

d No methionines in predicted ORF; therefore, protein concentration was not 
determined. 

* Measured molecular weight or pi did not match theoretical molecular weight 
cr pT. 



Protein quantitation. [ 35 S]m at hionine- labeled gels were exposed to X-ray film 
overnight, and then the silver stain and film were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-tnl microcentrifuge lubes, and scintillation cocktail (100 \d) was added. The 
samples were vonexed and counted. In addition, two parallel geis were electro- 
blotted to polyvinylideue difluoricic membranes. The membranes were exposed 
to X-ray film, and frnir intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/ methionine wus found. This cumber was used to quantitale 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared ami run on yeast mctabolically labeled for l ( 2, 6, or 18 h. The 
corresponding 156 spots were excised from each gel, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein balf-ljves were calculated 
based on the N-end rule (33). When the N-terminal processing was not luiown 
experimentally, it was predicted based on the affinity of methionine aminopep- 
tidase (31). 

RESULTS 

Characteristics of proteome approach. Nearly every facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-peiformance liquid chromatog- 
raphy. Second, the etuting peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysale (40 \ig), 
metabolically labeled with [^methionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They arc indi- 
cated in Fig. 2 and detailed in Tabic 1. 

The (ID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29.156; pi, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression le vels ranged from 2,200 (PGM2) 
to 863,000 (TDH2/TDH3) copies/ceU. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D geJ. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e. f 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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i protein, S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. 4A The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis Is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propeasity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were lughly similar. There was high variability in 
Ihe data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) Ls much more time- 
consuming because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in ceils 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Tliird, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include translational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequency tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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than 4 copies/cell), the error associated with these values may 
be quite large. The niRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 1 5,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per celt would 
be detected 12% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with [ 35 S]methionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based on the specificity of methionine aminopeptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



Hie amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 (xg) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslatkmai modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
ENOIJYEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of (ugh protein abundance in the gel 
include EFT1, TIF1, HXK2, GSP1, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. Lf pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A harrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low- abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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The relationship between gene expression measured at 
the mRNA level and the corresponding protein level is not 
well characterized in human cancer. In this study, we 
compared mRNA and protein expression for a cohort of 
genes In the same lung adenocarcinomas. The abun- 
dance of 165 protein spots representing 98 individual 
genes was analyzed in 76 lung adenocarcinomas and nine 
non-neoplastic lung tissues using two-dimensional poiy- 
acrylamlde gel electrophoresis. Specific polypeptides 
were identified using matrix-assisted laser desorption/ 
ionization mass spectrometry. For the same 65 samples, 
mRNA levels were determined using oligonucleotide mi- 
croarrays, allowing a comparative analysis of mRNA and 
protein expression among the 165 protein spots. Twenty- 
eight of the 165 protein spots (17%) or 21 of 98 genes 
(21.4%) had a statistically significant correlation between 
protein and mRNA expression (r > 0.2445; p < 0.05)? 
however, among all 165 proteins the correlation coeffi- 
cient values (r) ranged from -0.467 to 0.442. Correlation 
coefficient values were not related to protein abundance. 
Further, no significant correlation between mRNA and 
protein expression was found (r - -0.025) if the average 
levels of mRNA or protein among all samples were applied 
across the 165 protein spots (98 genes). The mRNA/ 
protein correlation coefficient also varied among pro- 
teins with multiple fsoforms, indicating potentially sep- 
arate Isoform-specific mechanisms for the regulation of 
protein abundance. Among the 21 genes with a signifi- 
cant correlation between mRNA and protein, five genes 
differed significantly between stage I and stage III lung 
adenocarcinomas. Using a quantitative analysis of mRNA 
and protein expression within the same lung adenocarci- 
nomas, we showed that only a subset of the proteins 
exhibited a significant correlation with mRNA abundance. 
Molecular & Cellular Proteomlcs 1:304-313, 2O0Z 

Lung cancer is the leading cause of cancer death for both 
men and women in the United States. Adenocarcinomas of 
the lung comprise -40% of all new cases of non-small cell 
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lung cancer and are now the most common histologic type. 
Functional genomics, broadly defined as the comprehensive 
analysis of genes and their products, have become a recent 
focus of the life sciences (1). Application of these approaches to 
lung adenocarcinomas has the potential to aid in the identifica- 
tion of high risk patients with resectable early stage lung cancer 
that may benefit from adjuvant therapy, as well as to identify 
new therapeutic targets. In human lung cancer, however, little is 
currently understood regarding the relationship between gene 
expression as determined by measuring mRNA levels and the 
corresponding abundance of the protein products. 

A number of powerful techniques for analysis of gene ex- 
pression have been used including differential display (2), 
serial analysis of gene expression (3), DNA mlcroarrays (4), 
and proteomics via two-dimensional polyacryiamide gel elec- 
trophoresis and mass spectrometry (5). Bioinformatics tools 
have also been developed to help determine quantitative 
mRNA/protein expression profiles of all types of cells and 
tissues (6) and now can be applied to benign and malignant 
tumors. DNA microarrays (cONA and oligonucleotide) permit 
the parallel assessment of thousands of genes and have been 
utilized in gene expression monitoring (7), polymorphism anal- 
ysis (8), and DNA sequencing (9). Recent studies have fo- 
cused on classification or identification of subgroups of lung 
tumors using DNA microarrays (10, 11). The use of mRNA 
expression patterns by themselves, however, is insufficient for 
understanding the expression of protein products, as addi- 
tional post-transcriptional mechanisms, including protein 
translation, post-translational modification, and degradation, 
may influence the level of a protein present in a given cell or 
tissue. Proteomic analyses, a complementary technology to 
DNA microarrays for monitoring gene expression, involves 
protein separation and quantitative assessment of protein 
spots using 2D 1 -PAGE and protein identification using mass 
spectrometry. By combining proteomic and transcriptional 
analyses of the same samples, however, it may be possible to 
understand the complex mechanisms influencing protein ex- 
pression in human cancer. 

In this study, we determined mRNA and protein levels for 
165 proteins (98 genes) in 76 lung adenocarcinomas and nine 



1 The abbreviations used are: 2D, two-dimensional; MALDI-MS, 
matrix-assisted laser desorption/ioni2at(on mass spectrometry. 
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Protein and mRNA Correlation in Lung Adenocarcinomas 



Table I 

Correlation coefficients of protein and mRNA where only one spot was present on 20 gels 
r*. correlation coefficient value > 0.2445; p < 0.05. Values in boldface are significant at p < 0.05. 
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Table I — continued 



Spot Unigene Gene name 



Protein name 



^lo u S 'nll? TCP1 "°- 1237 T-complex protein I, « subunit 

1769 Hs.9614 NPM1 -0.1738 B23/numatrin 

0089 Hs.74335 HSPCfi -0.2049 Hsp90 

2511 Hs.153179 FABP5 -0.2109 E-FABP/FABP5 

1739 Hs.16488 CALR -0.2344 Calreticulin 32 

l£l u S ^!c 61 GS ™ 4 "°* 2438 Glutathione S-transferase M4 (GST m4) 

2533 Hs.77060 PSMB6 -0.2512 Macropain subunit A 



non-neoplastic lung tissues. Protein levels were determined 
using quantitative 2D-PAGE analysis, and the separated pro- 
tein polypeptides were identified using matrix-assisted laser 
desorption/ionization mass spectrometry (MALD!- MS). The 
corresponding mRNA levels for the identified proteins within 
the same samples were determined using oligonucleotide 
microarrays. Correlation analyses showed that protein abun- 
dance is likely a reflection of the transcription for a subset of 
proteins, but translation and post-translational modifications 
also appear to influence the expression levels of many indi- 
vidual proteins in lung adenocarcinomas. 

EXPERIMENTAL PROCEDURES 
Tissues- Fifty-seven stage I and 19 stage III lung adenocarcino- 
mas, as well as nine non-neoplastic lung tissue samples, were used 
for protein and mRNA analyses. Patient consent was obtained, and 
the project was approved by the Institutional Review Board. All tis- 
sues were obtained after resection at the University of Michigan 
Health System between May 1991 and July 1998. Tissues were all 
snap-frozen in liquid nitrogen and then stored at -80 *C. The patients 
included 46 females and 30 males ranging in age from 40.9 to 84.6 
(average 63.8) years. Most patients (66/76) demonstrated a positive 
smoking history. Sixty-one tumor samples were classified as bron- 
chial-derived, 14 were classified as bronchoalveolar, and one had 
both features. Eighteen tumor samples were classified as well differ- 
entiated, 38 were classified as moderate, and 19 were classified as 
poorly differentiated adenocarcinomas. Hematoxylin-stained cryostat 
sections (5 ^m). prepared from the same tumor pieces to be utilized 
for protein and mRNA isolation, were evaluated by a pathologist and 
compared with hematoxylin- and eosin-stained sections made from 
paraffin blocks of the same tumors. Specimens were excluded from 
analysis if they showed unclear or mixed histology (e.g. adenosqua- 
mous), tumor cellularity less than 70%, potential metastatic origin as 
indicated by previous tumor history, extensive lymphocytic infiltration, 
or fibrosis or.rf the patient had received prior chemotherapy or 
radiotherapy. 

Oligonucleotide Array Hybridization -The HuGeneFL oligonucleo- 
tide arrays (Affymetrix, Santa Clara, CA) containing 6800 genes were 
used in this study. Total RNA was isolated from all samples using 
Trizol reagent (Invitrogen). The resulting RNA was then subjected to 
further purification using RNeasy spin columns (Qiagen). Preparation 
of cRNA, hybridization, and scanning of the HuGeneFL arrays were 
performed according to the manufacturer's protocol (Affymetrix, 
Santa Clara, CA). Data analysis was performed using GeneChip 4.0 
software. The gene expression profile of each tumor was normalized 
to the median gene expression profile for the entire sample. Details of 
data trimming and normalization are described elsewhere (11). 

2D-PAGB and Quantitative Protein Analysis- Tissue for both pro- 
tein and mRNA isolation came from contiguous areas of each sample. 
Protein separation using 2D-PAGE, silver staining, and digitization 



were performed as described previously (12, 13). Our 2D-PAGE sys- 
tem allows us to run 20 gels at one time (one batch). Spot detection 
and quantification were accomplished utilizing Bio Image Visage Sys- 
tem software (Bioimage Corp., Ann Arbor, Ml). The integrated inten- 
sity of each spot was calculated as the measured optical density 
units x mm 2 . Of the total possible 2000 spots detectable on each gel, 
820 spots on the gel of each sample were matched using a Gel-ed 
match program with the same spots on a chosen "master" gel. In 
each sample, 250 ubiquitously expressed reference spots were used 
to adjust for variations between gels, such as that created by subtle 
differences In protein loading or gel staining. Slight differences be- 
cause of batch were corrected after spot-size quantification. 

Mass Spectrometry and 2D Western Blotting- Preparative 2D gels 
were run using extracts from A549 lung adenocarcinoma cells (ob- 
tained from ATCC) and using the identical experimental conditions as 
the analytical 2D gels, except 30% more protein was loaded. The 
resolved protein gels were silver-stained using successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1 % silver nitrate for 40 
min, and 0.014% formaldehyde plus 2% sodium carbonate for 10 
min. For protein identification, protein polypeptides underwent trypsin 
digestion followed by MALDI-MS using a MALDI-TOF Voyager-DE 
mass spectrometer (Perseptive Biosystems, Framingham, MA). The 
masses were compared with known trypsin digest databases using 
the MS-FfT database (University of California, San Francisco; 
prospector.ucsf.edu/ucsfhtml3.2/msfit.htm). Some of the polypep- 
tides included in the analysis had been Identified prior to this study on 
the basis of sequencing (14). The identified protein spots used in this 
paper are shown In Fig. 1A. The method for 2D-PAGE Western blot 
verification was as described previously (1 5). The 2D Western blots of 
GRP58 and Op18 are shown in Fig. 1 , C and £; the others, such as 
GRP78, GRP75, HSP70, HSC70, KRT8, KRT18, KRT19, Vimentin, 
ApoJ, 14-3-3, Annexin I, Annexin II, PGP9.5, DJ-1, GST-pi, and 
PGAM, are described elsewhere. 2 

Statistical Analysis- Missing values were replaced with the mean 
value of the protein spot. The transform x -> log (1 + x) was applied 
to normalize all protein expression values. The relationship between 
protein and mRNA expression levels within the same samples was 
examined using the Spearman correlation coefficient analysis (1 6). To 
identify potentially significant correlations between gene and protein 
expression, we used an analytical strategy similar to SAM (signifi- 
cance analysis of microarrays) (17), which uses a permutation tech- 
nique to determine the significance of changes In gene expression 
between different biological states. To obtain permuted correlation 
coefficients between gene and protein expression, genes were ex- 
changed first in such a way that permutated correlation coefficient 
were calculated based on pseudo pairs of genes and proteins. The 
distribution of permutated correlation coefficients became stable after 
60 permutations. This procedure was then repeated 60 times to 
obtain 60 sets of permutated correlation coefficients. For each of the 
60 permutations, the correlations of genes and proteins were ranked 



2 Chen ei at. , submitted for publication. 
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Table II 

Correlation coefficients of protein and mRNA where multiple isofbrms were present on 2D gels 
r*. correlation coefficient value > 0,2445; p < 0.05. Values in boldface are significant at p < 0.05. 
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-0.0137 


1810 


Hs.75990 


HP 


-0.4672 


1459 


Hs.75990 


HP 


0.0802 


1458 


Hs.75990 


HP 


-0.0305 


0619 


Hs.75990 


HP 


0.0461 


0615 


Hs.75990 


HP 


-0.0034 


1250 


Hs.41707 


HSPB3 


-0.1024 


0549 


Hs.79037 


HSPD1 


0.1074 


0338 


Hs.79037 


HSPD1 


0.2265 


0333 


Hs.79037 


HSPD1 


0.1383 


0331 


Hs.79037 


HSPD1 


0.1603 


2381 


Hs.65114 


KRT18 


0.2016 


0535 


Hs.65114 


KRT18 


0.1106 



Protein name 



OP18(Stathmln) 
Tropomyosins 1-5 

Protease disulfide isomerase (GRP58) 
Glyceraldehyde-3-phosphate dehydrogenase 
Hsp27 

Triose phosphate isomerase (TP I) 

Cytokeratin 18 

OP18(Stathmin) 

OP18 (Stathmin) 

Annexln variant I 

Cytokeratin B 

Vimentin 

Vimentin 

Aldose reductase 
14-3-3 t) 
Annexln I 
Cytokeratin 18 
Hsp27 

Phospholipase C (GRP58) 
Aldose reductase 
Aldose reductase 
Aldose reductase 
Aldehyde dehydrogenase 
Aldehyde dehydrogenase 
Aldehyde dehydrogenase 
Annexin variant I 
Annexin I 
Annexin I 

Lipocotin (annexin II) 
Lipocotin (annexin II) 
Lipocotin 

Apolipoprotein A1 (ApoA1) 

Apolipoprotein A1 (ApoA1) 

Apolipoprotein A1 (ApoA1) 

ATP synthase 0 subunlt precursor 

ATP synthase p subunit precursor 

ATP synthase p subunit precursor 

Apolipoprotein J (ApoJ) 

Apolipoprotein J (ApoJ) 

elF-5A 

elF-5A 

L-FABP 

L-FABP 

Glycera!dehyde-3-phosphate dehydrogenase 

Glyoxalase-I 

Glyoxalase-1 

Huntingtin-associated protein 1 (nsuroan 1) 

ar-Haptoglobin 

a-Haptoglobin 

a-Haptoglobin 

B-haptogtobin 

B-haptoglobin 

Hsp27 

Hsp60 

Hsp60 

Hsp60 

Hsp60 

Cytokeratin 18 
Cytokeratin 18 
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Correlation 
r\ correlation coefficient value 



Table II— continued 

coefficients of protein and mRNA where multiple isoforms were present on 2D gels 
> 0.2445; p < 0,05. V alues In boldface are significant at p < 0.05. 



Spot 


Unigene 


Gene name 


r* 


0529 


rib. do t m ■ 


KRT18 


0.1279 


0528 


ns.oo 1 14 


KRT18 


0.0414 


0527 


ns.oo 1 14 


KRT18 


0.0436 


0514 


MS.DOl 14 


KRT18 


0.0733 


u**o 1 


HS. ^42463 


KRT8 


-0.0111 




Hs. 242463 


KRT8 


0.0347 


HA A A 


Hs.242463 


KRT8 


-0.1311 




MS, 242463 


KRTB 


0.0942 




HS.Di915 


LAP 18 


0.0495 


n^9i 


Hs. 75655 


P4HB 


--0.0546 




Hs.75655 


P4HB 


-6.0041 


in 


Hs. 75323 


PHB 


0.0441 


Uo of 


Hs.75323 


PHB 


0.1402 


nnc 
vodo 


Hs. 297681 


SERP1NA1 


-0.0227 


0322 


Hs.297681 


SERPINA1 


-0.0277 


0241 


Hs.297681 


SERPINA1 


-0.0148 


1280 


Hs.301254 


SFTPA1 


-0.148B 


127B 


Hs.301254 


SFTPA1 


-0.2040 


0866 


Hs. 73980 


TNNT1 


0.1162 


0778 


Hs.73980 


TNNT1 


0.0740 


1213 


Hs.83846 


TPI1 


0.0024 


1210 


Hs.B3848 


TPI1 


0.0490 


1207 


Hs.83848 


TPI1 


-0.1615 


1204 


Hs.83848 


TPI1 


0.0209 


1202 


Hs.83848 


TPI1 


0.0721 




Hs.83848 


TPI1 


0.2265 


1052 


Hs.77899 


TPM1 


-0.1040 


1039 


Hs.77899 


TPM1 


-0.2999 


1035 


Hs.77899 


TPM1 


-0.3821 


0783 


Hs.77899 


TPM1 


0.0757 


1574 


Hs. 194366 


TTR 


-0.0065 


0809 


Hs.194366 


TTR 


0.0399 


2202 


Hs.76118 


UCHL1 


-0.0220 


1246 


Hs.76118 


UCHL1 


-0.1261 


1242 


Hs.76118 


UCHL1 


0.1473 


0606 


Hs,297753 


VIM 


0.0951 


0594 


Hs.297753 


VIM 


-0.2664 


0508 


Hs.297753 


VIM 


0.1008 


0419 


Hs.297753 


VIM 


0.0032 


1279 


Hs.75544 


YWHAH 


0.0059 



Protein name 



Cytokeratin 18 
Cytokeratin 1B 
Cytokeratin 18 
Cytokeratin 18 
Cytokeratin 8 
Cytokeratin 8 
Cytokeratin 8 
Cytokeratin 8 
0P18 (Stathmin) 
PDI (proly-4-OH-B) 
PD1 (proly-4-OH-B) 
Prohibits 
Prohibits 
a-1-Antitripsin 
ct-1 -Antitrlpsin 
ff-1-Antitripsin 

Pulmonary surfactant-associated protein 
Pulmonary surfactant-associated protein 
Troponin T 
Troponin T 

Triose phosphate isomerase (TP!) 
Triose phosphate isomerase (TPI) 
Trioss phosphate isomerase (TPI) 
Triose phosphate isomerase (TPI) 
Triose phosphate Isomerase (TPI) 
Trios9 phosphate isomerase (TPI) 
Tropomysln clean-product 
Cytoskeletal tropomyosin 
Tropomyosin 
Tropomyosins 1-5 
Transthyretin 
Transthyretin multimere 

Ubiquitin carboKyl-terminal hydrolase isozyme L1 
Ubiquitin carboxyl-terminai hydrolase isozyme L1 
Ubiquitin ca/boxyl-terrninal hydrolase isozyme L1 
Vimentin 

Vimentin-dsrived protein (vid4) 
Vimentin-derived protein (vid2) 
Vimentin-derived protein (vidl) 
14-3-3 ti 



such that p p tft denotes the rth largest correlation coefficient for pth 
pennutation. Hence, the expected correlation coefficient p b % was the 
average over the 60 permutations, Pb ® - 2™. , p p ^60. A scatter plot of 
observed correlations (p(f)) versus the expected correlations is shown in 
Fig. 2D. For this study, we chose threshold A - 0.1 15 so that correlation 
would be considered significant if absolute value of difference between 
p(i) and was greater than the threshold. Twenty-nine (including one 
with observed correlation coefficient -0.4672) of 1 65 pairs of gene and 
protein expression were called significant in such criteria, and the 
permuted data generated an average of 5.1 falsely significant pairs of 
gene and protein expression. This provided an estimated false dis- 
covery rate (the percentage of pairs of gene and protein expression 
identified by chance) for our data set. 

RESULTS 

(Do/relation of Individual Proteins and mRNA Expression 
within Each Tumor— We have examined quantitatively 165 



protein spots on 2D gels representing 98 genes and com- 
pared protein levels with mRNA levels for a cohort of 85 lung 
adenocarcinomas and normal lung samples. Of the 165 pro- 
tein spots, 69 proteins were represented by only one known 
spot on 2D gels for an Individual gene, whereas 96 protein 
spots showed multiple protein products from 29 different 
genes. 2D Western blotting verified the proteins Identified by 
mass spectrometry when specific antibodies were available. 
Spearman correlation coefficients of the proteins and their 
associated mRNA for each protein spot were generated using 
all 76 lung adenocarcinomas and nine non-neoplastic lung 
tissues (see Tables I and II, and see Rgs. 1 and 2). The 
correlation coefficients (r) ranged from -0.467 to 0.442 (Rg. 
2D). A total of 28 protein spots (21 genes) were found to have 
a statistically significant correlation between expression of 
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F.G. 1 A } d.grtal .mage of ft silver-stained 2D-PAGE separation of a stage I lung adenocarcinoma showing protein spots separated bv 
molecular ma» (MM* and .soelectric point (P/). Twenty-eight protein spots whose expression levels are conJ^ymn^^bS^n 
indicated by the black arrows. B, the outlined areas of A showing protein GRP58. C. 2D Western blot of GRP58 from * ^49 Za 
adenocarcinoma cell hne. D, the outlined areas of A showing the protein isoforms of Op18. * 2D Western blot of ^18 ^ ^4^cells 9 



their protein and mRNA (r > 0.2445; p < 0.05). This accounts 
for 17% (28/165) of the 165 protein spots. Among the 69 
genes for which only a single protein spot was known (Table 
I), nine genes (9/69, 13%) were observed to show a statisti- 
cally significant relationship between protein and mRNA 
abundance (r > 0.2445; p < 0.05). The proteins whose ex- 
pression levels were correlated with their mRNA abundance 
included those involved in signal transduction, carbohydrate 
metabolism, apoptosis, protein post-translational modifica- 
tion, structural proteins, and heat shock proteins (Table III). 

Individual Isoforms of the Same Protein Have Different 
Protein/mRNA Correlation Coefficients-OI the 165 protein 
spots, 96 represent protein products of 29 genes with at least 
two isoforms. Among these 96 protein spots, 19 (19/96 pro- 
tein spots, 20%) showed a statistically significant correlation 
between their protein and mRNA expression if > 0.2445; p < 
0.05) (T able II) and represented 12 genes (12/29, 41 %). Individ- 
ual isoforms of the same protein demonstrated different 
protein/mRNA correlation coefficients. For example, 2D -PAGE/ 
Western analysis revealed four isoforms of OP18 differing In 
regards to isoelectric point but similar in molecular weight. 
Three of the four isoforms (spots 1 492, 1 493, and 1 494) showed 
a statistically significant correlation between their protein and 
mRNA abundance (r = 0.3234, 0.3154, and 0.4003, respective- 
ly). The forth isoform (spot 1488) showed no correlation be- 



tween protein and mRNA expression (r = 0.0495). Similarly, just 
one of five quantified isoforms of cytokeratin 8 (spot 439) dem- 
onstrated a statistically significant correlation between protein 
and mRNA abundance (r = 0.3049; p < 0.05) (Table II): 

In addition to differences In the relationship between mRNA 
levels and protein expression among separate isoforms, some 
genes with very comparable mRNA levels showed a 24-fold 
difference in their protein expression. Genes with comparable 
protein expression levels also showed up to a 28-fold vari- 
ance in their mRNA levels. 

Lack of Correlation for mRNA and Protein Expression when 
Using Average Tumor Values across All 165 Protein Spots (98 
Genes) -The relationship between mRNA and protein expres- 
sion was also examined by using the average expression 
values for all samples. To analyze this relationship using this 
approach, the average value for each protein or mRNA was 
generated using all 85 lung tissue samples. The range of 
normalized average protein values ranged from -0.0646 to 
0.0979 (raw value 0.0036 to 4.1947), and the range for mRNA 
was from 0 to 15260.5 for all 165 individual protein spots. The 
Spearman correlation coefficient for the whole data set (165 
protein spots/98 genes) was -0.025 (Fig. 3/1). Even for the 28 
protein spots (Fig. 2D) that were found to have a statistically 
significant correlation between their mRNA and protein, use of 
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A: 0P18(*ppt#149): 



1C00 

mRNA vsiu* 
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»0O 



-04 



0 : Distribution of 18V cornslatio* coefficients 



— I— 




Rg. 2. A-C, plots showing the correlation between mRNA and protein for the three selected genes Op18, Annexin IV and GAPD for all 76 
lung adenocarcmomas and nine non-neoplastic lung samples (p < 0.05). O, distribution of all 165 Spearman correh L^oeSts W and 

T^zrrr* A 7 re r deta : ,ed description of the method is provided under w« p***™.^^ 

of he 1 65 protehs demonstrate a slgmficant correlation between mRNA and protein levels as demonstrated by the valuesThown beyond he 
outer range of threshold A ~ 0.1 15. Normalized protein values were used, thus negative values for some p roieins netted \ 

stage III (r» = 19) lung adenocarcinomas (Table Hi). The num- 
ber of non-neoplastic lung samples (n = 9) was insufficient for 
a separate correlation analysis of this group. Many of the 
protein spots represent one of several known protein isoforms 
for a given gene. The majority of genes (16721) did not differ in 
the proteirVmRNA correlation between stage I and stage III 
tumors indicating a similar regulatory relationship between the 
mRNA and protein spot. GRP-58, PSMC, SOD1, TPI1, and 
VIM, however, were found to demonstrate significant differ- 
ences in the correlation coefficients between stage I and 
stage ill lung adenocarcinomas. For GRP-58, PSMC, and VIM 
the change in the correlation coefficient was because of a 
relative increase in protein expression in stage ill tumors. For 
SOD and TPI the change resulted from a relative decrease in 
expression of this specific protein in stage III tumors. 

DISCUSSION 

Relatively little Is known about the regulatory mechanisms 
controlling the complex patterns of protein abundance and 
posttranslational modification in tumors. Most reports con- 
cerning the regulation of protein translation have focused on 



the average value resulted in a correlation coefficient value of 
-0.035, which was not significant (Fig. 3S). 

Lack of a Relationship between Protein/mRNA Correlation 
Coefficients and Average Protein Abundance— To determine 
whether an absolute protein level might influence the corre- 
lation with mRNA, the mean value of each protein (relative 
abundance) and the Spearman protein/mRNA correlation co- 
efficients among all 85 samples were examined. No relation- 
ship between the protein abundance and the correlation co- 
efficients was observed (r = 0.039; p > 0.05). A detailed 
analysis of separate subsets of proteins with differing levels of 
abundance (less than -0.0014, larger than -0.0014, or larger 
than 0.0077) also showed a lack of correlation between mRNA 
and protein expression among the 83 (50%), 82 (50%), and 41 
(25%) of 1 65 total protein spots, respectively (r = 0.016, 0.08, 
and 0.172, respectively). 

Stage-related Changes in the Protein/mRNA Correlation 
Coefficients-To determine whether the 21 genes (28 protein 
spots) showing a significant correlation between the protein 
and mRNA expression among ail samples demonstrate 
changes in this relationship during tumor progression, the 
correlations were examined separately for stage I (n = 57) and 
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Table til 

Stage-dependent analysis of protein-mRNA correlation coefficients 
r, correlation coefficient. Values in boldface indicate a significant difference between stage I and stage III. 



Spot 


Gene name 


r (Stage I) 


r (Stage III) 


1874 


AKR1B1 


0.269 


0.106 


2524 


ANXA1 


0.184 


0.572 


0994 


ANXA4 


0.660 


0.362 


0963 


ANXA5 


0.241 


0.390 


1314 


DJ-1 


0.363 


0.354 


1405 


FTL 


0.126 


0.358 


0855 


GAPD 


0.243 


0.581 


0350 


GRP58 


0.327 


-0.087 


0264 


HNRPK 


0.360 


0.243 


1192 


HSPB3 


0.457 


0.633 


0523 


KRT18 


0.115 


0.371 


0439 


KRT8 


0.323 


0.436 


1492 


LAP 18 


0.483 


0.663 


1638 


LGALS1 


0.200 


0.528 


1252 


PSMC 


0.253 


0.060 


1104 


SFN 


0.465 


0.475 


1454 


SOD1 


0.352 


0.079 


1203 


TPI1 


0.378 


0.009 


0957 


TPM1 


0.475 


0.225 


0593 


VIM 


-0.054 


0.556 


0935 


YWHAH 


0.283 


0.210 



one or several protein products (18). Celis ef a/. (19) found a 
good correlation between transcript and protein levels among 
40 well resolved, abundant proteins using a proteomic and 
microarray study of bladder cancer. By comparing the mRNA 
and protein expression levels within the same tumor samples, 
we found that 1 7% (28/165) of the protein spots (21/98 genes) 
show a statistically significant correlation between mRNA and 
protein. These proteins appear to represent a diverse group of 
gene products and include those involved in signal transduc- 
tion, carbohydrate metabolism, protein modification, cell struc- 
ture, heat shock, and apoptosis. These results suggest that 
expression of this subset of 1 65 proteins is likely to be regulated 
at the transcriptional level in these tissues. The majority of the 
protein isoforms, however, did not correlate with mRNA levels, 
and thus their expression is regulated by other mechanisms. We 
also observed a subset of proteins that demonstrated a nega- 
tive correlation with the mRNA expression values; for example 
a-haptoglobtn demonstrated a strong negative correlation with 
its mRNA expression values. This may reflect negative feedback 
on the mRNA or the protein or the presence of other regulatory 
influences that are not understood currently. 

Post-translational modification or processing will result in 
individual protein products of the same gene migrating to 
different locations on 2D-PAGE gels (20). Because the identity 
of all possible isoforms for each protein examined has not 
been characterized completely, this may influence the corre- 
lation analyses performed in this study. This is partly because 
of limitations of the 2D-PAGE and mass spectrometry tech- 
nologies (21, 22). Potential inconsistencies between mRNA 
and protein correlations that have been reported may also be 
because of differences, even in the same gene, in the mech- 



Function 

Carbohydrate metabolism; electron transporter 
Phospholipase inhibitor; signal transduction 
Phospholipase inhibitor 

Phospholipase Inhibitor; calcium binding; phospholipid binding 
Signal transduction 
Iron storage protein 

Carbohydrate metabolism (glycolysis regulation) 
Signal transduction; protein disulfide isomerase 
RNA-binding protein (RNA processing/modification) 
Heat shock protein 
Structural protein 
Structural protein 

Signal transduction; eel! growth and maintenance 
Apoptosis; cell adhesion; cell size control 
Protein degradation 

Signal transduction (protein kinase C inhibitor) 

Oxidorec'uctase 

Carbohydrate metabolism 

Structural protein (muscle); control of heart 

Structural protein 

Signal transduction 



anisms of protein translation among different cells or as 
measured in different laboratories (23). 

In this study, we examined 165 protein spots identified in 
lung adenocarcinomas. Ninety-six protein spots, representing 
the products of 29 genes, contained at least two protein 
isoforms. Nineteen of 96 protein spots, representing 12 
genes, were shown to have a statistically significant correla- 
tion between their protein and mRNA expression, suggesting 
that the levels of these proteins reflects the transcription of the 
corresponding genes. Differences in proteirvmRNA correlations 
were found among the individual isoforms of a given protein. For 
example, of the four OP1 8 isoforms, three showed a statistically 
significant correlation between the protein and mRNA expres- 
sion levels. The lack of relationship for the one isoform, how- 
ever, indicates that individual protein isoforms of the same gene 
product can be regulated differentially. This is not unexpected 
and likely reflects other post-translational mechanisms that can 
influence isoform abundance in tissues and cancer. 

In addition to the analyses of the correlation of mRNA/ 
protein within the same tumor samples, we also tested the 
global relationship between mRNA and the corresponding 
protein abundance across all 165 protein spots in the lung 
samples. A protein and mRNA average value for each gene 
was generated using all 85 lung tissues samples. We ob- 
served a very wide range of normalized average protein and 
mRNA values. The correlation coefficient generated using this 
average value data set was -0.025, and even for the 28 
protein spots that showed a statistically significant correlation 
between individual mRNA and proteins, the correlation value 
was only -0.035. This suggests that it is not possible to 
predict overall protein expression levels based on average 
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Fig. 3, The overall correlation of 
mRNA and protein levels across all 
165 protein spots (A) and across 28 
protein spots that contained individ- 
ual r values larger than 0.244 (B) are 
shown. Each protein or mRNA mean 
value was calculated based on all 76 
lung adenocarcinomas and nine non- 
neoplastic lung samples using quantita- 
tive 2D-PAGE and Affymetrix oligonu- 
cleotide microarrays. The Spearman 
correlation coefficients for the two data 
sets i/\ and B) were -0.025 and -0.035, 
respectively, indicating a lack of correla- 
tion if mean values for mRNA and protein 
for all samples is used. 



A: Correlation of protein arid mRNA 
across all 165 protein spots 
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B: Correlation of protein and mRNA 
across 28 proteins with r> D.244 
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mRNA abundance in lung cancer samples. This conclusion is 
also supported by previous results from Anderson and Seil- 
hamer (24), who examined 19 genes in human liver cells, and 
by Gygi ef a/. (25), who examined 106 genes in yeast. Both 
studies found a lack of correlation between mRNA and protein 
expression when average or overall levels were used. 

A good correlation was reported when the 11 most abun- 
dant proteins were examined in yeast (25), suggesting that the 
level of protein abundance may be a factor that may influence 
the correlation between mRNA and protein. In the present 
study, a fairly wide range of mean protein values among 165 
protein spots in lung adenocarcinomas was observed, and 
the correlation coefficients also varied from -0.467 to 0.442. 



A comparison between the mean value of each protein and 
the correlation coefficient generated using all 85 tissue sam- 
ples did not reveal a strong relationship between the overall 
protein abundance and the correlation coefficients (r = 0.039; 
p > 0.05). Detailed analysis of different subsets of protein abun- 
dance also failed to show a correlation between mRNA and 
protein expression. Thus in contrast to yeast, a relationship 
between mRNA/protein correlation coefficient and protein 
abundance in human lung adenocarcinomas was not observed. 

The results of this study indicate that the level of protein 
abundance in lung adenocarcinomas is associated with the 
corresponding levels of mRNA in 17% (28 proteins) of the 
total 165 protein spots examined. This was substantially 
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higher than the amount predicted to result by chance alone 
(which was 5.1) and suggests that a transcriptional mecha- 
nism likely underlies the abundance of these proteins in lung 
adenocarcinomas. We also demonstrate that the expression 
of individual isoforms of the same protein may or may not 
correlate with the mRNA, indicating that separate and likely 
post-translational mechanisms account for the regulation of 
isoform abundance. These mechanisms may also account for 
the differences in the correlation coefficients observed between 
stage I and stage III tumors, indicating that specific protein 
isoforms show regulatory changes during tumor progression. 
Further studies in lung adenocarcinomas will examine the rela- 
tionship between the expression of individual protein isoforms 
and specific clinical-pathological features of these tumors, such 
as the presence of angiolymphatic invasion, and nodal or pleu- 
ral surface involvement. The potential to identify specific protein 
isoforms associated with biological behavior in lung adenocar- 
cinomas would be of considerable interest and will add to our 
understanding of the regulation of gene products by transcrip- 
tional, translational, and post-translational mechanisms. 
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primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s)! 

In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B^ In Exhibit B, fc V means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

As shown in Exhibit B, of the 31 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (i.e., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed thai in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 



6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion thai for human genes, an increased level of mRN A in a tumor 
(issue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative ro 
The normal tissue. In tact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and thai all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validhy of the 
application or any patent issued thereon. 
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EXHIBIT A 

DECLARATION OF PAUL POLAK1S, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of die research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 

expressed from Uiese differentially express^^enc"transcripts~and-have-used-these 

antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in -both .the tumor and normal 
cells analyzed. 

5. From the inRNA and protein expression analyses described in paragraph 4 
above, wc have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in flic level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased Jevels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the val idity of the application or any patent issued 
thereon. 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over U00 publications have de- 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations In cancer, but very few of the genes 
afTected are known. Here, we performed high-resolution CGH analysis on 
cDNA microarrays In breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 Independent am pi Icons, ranging In size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes bad a substantial impact on gene expression, with 44% of the 
highly amplified genes showing ovcrexpression and 103% of the highly 
overexpreased genes being amplified. Statistical analysis wltb random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes In breast cancer and 
many novel targets for genomic alterations, including the HOXB7 gene, 
the presence of which in a novel ampllcon at 17q21 J was validated In 
10.2V* of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 

(novel genes whose overexprecslon Is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer, identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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Fig. 1 . Impact of gene copy number on global gene expression levels. A. percentage of 
over- and underexprcucd genes (Y axis) according to copy number ratios (X axis). 
Threshold valuca used for over- and umferexpreasion were >2.184 (global upper 7% of 
the cDNA ratios) and <0,4826 (global lower 7% of the expression ratios). B. percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >1 J and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 3 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and {b) identify and characterize those genes whose mRNA expres- 



3 The abbreviations used are: CGH, comparative genomic hybridisation; FISH, fluo- 
rescence in situ hybridization; RT-PCR, reverse o^nscription-PCR. 
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Fig. 2. Gcnomc-wkte copy number and expression analysis in the MCF-7 bruit cancer cell line. A, chromosomal CGH analyiis of MCF-7. The copy number ratio profile (6fc* 
line) across the entire genome from 1 p telomere to Xq telomere is »hown along with £ 1 SD {orange Urns). The Woe* horizontal tine indicates a ratio of 1 .0; red Hm, a ratio of 0.8; 
and green line, a ratio of 1.2. &~C genome-wide copy number analysis in MCF-7 by CGH on cDNA microamy. The copy number ratios were plotted aa a function of the position 
of the cDNA clones along the human genome. In 8, individual data points are connected with a line, and a moving median of 10 adjacent clones is shown. Red hotixoniai Um, toe 
copy number ratio of 1 .0. In C, individual data points are labeled by color coding according to cDN A expression ratios. The bright red dots indicate the upper 2%, and dark red dots. 
the next 5% of the expression ratios in MCF-7 cells (overexpressed genes); bright gretn dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios 
(underexprcsacd genes); the rest of the observations are shown with black crosses. The chromosome numbers are shown at the bottom of the figure, and chromosome boundaries am 
indicated with a dashed line. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell Lines. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1428, 1*85781, MCF7. MDA-361. MDA-436. MDA-453, MPA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manas s as, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Micro arrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (1 1-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 *ig of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Atul and Rsal (Lire Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
txg of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with CyS-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization ( 14, 15) and 
posthybridizatioD washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty n% of reference RNA were 
labeled with Cy3-dUTP and 3.5 ng of test mRNA with Cy5-dUTP, ami the 
labeled cDNAs were hybridized on microarrays as described ( 13, 1 5). For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. Fur 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four times onto the 
array. Low quality measurements (i.e., copy number data with mean reference 
intensity < 100 fluorescent units, and expression data with both test and 
reference intensity <100 fluorescent units and/or with spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define cutpotnts for increased/ 
decreased copy number. Genes with CGH ratio > 1 .43 (representing the upper 
5% of the CGH ratios across ail experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1.43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (1). We calculated a weight. w r for each gene as follows: 

m, , - rn^o 
. W% = 'si + <7q 

where m gXl <r gl and m^, tr^ denote the means and SDs for the expression 
levels for amplified and nonamplified cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and AmpUcon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141. 6 A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Am pi icons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



* Internet address: httpy/rcKwch. nhgri.nih.gov/microam 
7 Internet address: www.genomc.ucsc.edu. 
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Table I Summary of tndeptndent ampticons in 14 brtast cancer cttt tints by 
CGH microarray 



r 

Location 


aian (moj 


Ptv4 fMM 


size IMDJ 


Ipl3 


132.79 


132.94 


0.2 


Iq21 


173.92 


177.25 


3.3 


Iq22 


179.28 


179.57 


0.3 


3pl4 


71.94 


74.66 


2.7 


7pl2.I-7pllJ 


55.62 


60.95 


5.3 


7q31 


125.73 


130.96 


5.2 


7q32 


140.01 


140.68 


0.7 


8c£UI-*q2U3 


86.45 


92.46 


6.0 


8q2t.3 


98.45 


103.05 


* 4.6 


8q23.3-«q24.l4 


129.88 


142.15 


12 J 


8q24.22 


151.21 


152.16 


1.0 


9pl3 


38.65 


39.25 


0.6 


I3q22-q3l 


77.15 


81.38 


4.2 


16q22 


86.70 


87.62 


0.9 


I7qll 


29.30 


30.85 


1.6 


I7ql2-q2l.2 


39.79 


42.80 


3.0 


l7q2IJ2~q2l.33 


52.47 


55.80 


33 


l7<*22-q23.3 


63.81 


69.70 


5.9 


|7q23.3-q24J 


69.93 


• 74,99 


5.1 


I9ql3 


40.63 


41.40 


0.8 


20qll.22 


34.S9 


35.85 


1.3 


20ql3.12 


44.00 


45.62 


1.6 


20qi3.IZ-ql3.l3 


46.45 


49.43 


3.0 


20ql3.2-ql3.32 


51.32 


59.12 


7.8 



CGH were validated, with tq21, l7qlZ— q2L2, 17q22-q23, 20qI3.l, 
and 20ql3.2 regions being most cornnrionly amplified. Furthermore, 
the boundaries of these amplicons wcire precisely delineated. In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression «iata on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lp!3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl l-p!2 (Fig. 3-4). In BT-4 74, the two known amplicons 
at I7ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig, IB). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB 7, were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3£. inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-*474, UACC812, and ZR-75-30 cells. Furthermore, this novel 



extended to include neighboring oonamplifted clones (ratio* <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RPl 1-361K8 was la- 
beled with SpcctnimOrange (Vysta, Downers Grove, IL), and Spectrum* 
Orange-labeled probe for EGFR was obtained from Vysts. SpectnimCreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formal in- fixed, paraffin -embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
( 1 8). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the N1H. Specimens containing a 2* fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, Wl) with 10 ng of mRNA 
as a template. HOXB7 primers were 5 '-G AGCAOAGGGACTCGGACTT-3 ' 
and 5' -GCOTCAGGTAGCG ATTGT AG-3' . 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH mtcroarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. \B\ Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1 ,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of ON A (Table 
1), Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH mferoamy profile*. 4, genes in the 
7p! I -pi 2 imp I icon in the MDA-468 cell line are highly expressed (red dots) and include 
the EGFR oncogene. B. several genes in the I7ql2, I7q21.3, and I7q23 amplicons to the 
BT-474 breast cancer cell line sre highly ovcrexpTcssed (rwf) and include the HOXB 7 
gene. The data labels and color coding are as indicated for Fig. 2C. Insctj show 
chromosomal CQH profiles Tor the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (gretn) to MDA-468 {A) and ffOXB7-tpecific probe {red) and chro- 
mosome 17 centromere (grmn) to BT-474 cells (B). 
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Pig. 4. List of 50 gene* with a statiitically 
significant correlation (a value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene are indicated. The gene* have been ordered 
according to their position in the genome. The color 
mops on the ngfc/ illustrate the copy number and 
expression ratio patterns in toe 14 cell lines. The 
key to the color code it shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 




amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



* Internet address; hup^www.geneontology.orgf. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantia] with the most dramatic effects seen in the case of high- 



9 Internet address: httpy/www.ncbi.nJm.nih.gov/entrcz. 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH raicroarray analysis identified 24 independent breast 
cancer amp] icons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the bomeobox gene region at 1 7q2 1 .3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transaction induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, AfYC, 
EGFRy ribosomal protein s6 kinase, and AJB3, but also numerous 
novel genes such as NRAS-related gene (lpl3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, ceil prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (£>) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q2L3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 
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Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation in 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6,691 mapped human genes. In 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell tines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
the quantitative analysis of amplicon shape provide significant 
improvement in the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation In gene copy number contributes to 
variation in gene expression In tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-foW change in DNA copy number is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation In gene expression among the breast 
tumors Is directly attributable to underlying variation in gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
[eg. FGFR1 (8pll), MYC (8q24), CCND1 (Uql3), ERBB2 
(17ql2), and ZNF217 (20ql3)j and tumor suppressor genes 
[RBI (13ql4) and TP53 (I7pl3)], the relevant gene(s) within 
other regions (e.g., gain of Iq, 8q22, and 17q22-24, and loss of 
Bp) remain to be identified. A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays. we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell Lines. Primary breast tumors were predominanUy 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9), and 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site, www.pnasjorg. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by elhanol precipitation. 

DNA Labeling and Microarray rrybridbatlons. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et aL (7), with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. 'Test** DNA 
(from tumors and cell lines) was f luorescentry labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles arc 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 
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identifying the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UniGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genomc.ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for ail elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.c., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
Path" (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 



deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. 16), as we did before 
(7), demonstrated the sensitivity of our method to detect single- 
copy loss (45, XO), and 13- (47.XXX), 2* (48^CXXX) t or 
23-fold (49.XXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readQy iden- 
tifiable. For example, gains within lq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer cell lines/ tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as were losses within lp, 3p. 8p, and 13q (80%/24%, 
80%/22%, 80%/22%, and 70%/18%, respectivery), consistent 
with published cytogenetic studies (refe. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the WAS web site). The total 
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«o. 2. DNA copy number » Iteration across chromosome 8 by array CGH. (a) ONA copy number profiles are illustrated for cell lines containing different number! 
of X chromosomes* for breast cancer cell lines, and for breast tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering to 
highlight recurrent copy number changes. The 241 genes present on the mlcroerrays and mapping to chromosome 8 are ordered by position along the 
chromosome. Fluorescence ratios (test/reference) are depicted by a tog 2 pseudocolor scale (indicated). Selected genes are Indicated with color-coded text (red 
increased; green, decreased; black, no change; gray, not well measured) to reflect correspondingly altered mKMA levels (observed In the majority of the subset 
of samples displaying the DNA copy number change). The map positions for genes of interest that are not represented on the mkroarray ere indicated In the 
row above those genes represented on the array, (b) Graphical display of DNA copy number profile for breast cancer cell line SKBM. Fluorescence ratios 
(tumor/normal) are plotted on a log 2 Kale for chromosome 8 genes, ordered along the chromosome. 



number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade (P = 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P » 0.04), and harboring TP53 mutations (P = 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysts is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplkon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 7b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microanrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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Rg. 3. Concordance b«tw**fi DMA copy number end gens expression across chromosome 17. DNA copy number alteration (Upper) and mRMA levels (Lower) 
are Illustrated for breast cancer ceU lines and tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering {Upper), and the 
identical sample order Is maintained {Lower). The 354 genes present on the mkroarrayt and mapping to chromosome 17, and for which both DMA copy number 
and mRNA levels were determined, arc ordered by position along the chromosome; selected genes are Indicated in color-coded text (see Rg. 2 legend). 
Fluorescence ratios (test/reference) are depicted by separate togi pseudocolor scales (indicated). 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). Hie overall patterns of gene amplification and 
elevated gene expression are quite concordant; he., a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2) t and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6 t 095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
/ tests comparing adjacent classes: cell lines, 4 x 10" 49 , 1 x 10~ 49 , 
5 x 10-\ 1 x 10-*; tumors, 1 x 10^« 1 x lQ-* ,4 f 5 x 1Q- 41 , 
1 x 10" 4 ). A linear regression of the average tog(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 46). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 46) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Fig. 4. Genome-wWe influence of DNA copy number alteration! on mRNA levels, (a) For breast cancer celt line* (gray) end tumor samples (black), both 
mean-centered mRNA fluorescence ratio (looj state) quertffes (box plots indicate 25th. 50th. and 75th percentile) and averages (diamonds; V-vakie error bars 
Indicate standard errors of the mean) are plotted for each of five classes of genes, representing DNA deletion (tumor/normal ratio < 0~fl)» no change (&6-1 
low- (1.2-2), medium- (2-4), and high-level (>4) amplification. P values for pair-wise Student's t tests, comparing averages betw e en adjacent dasses (moving 
lefttoright).are4 x \Q— t 1 x 1<r« 5 x 10-*, 1 x I0" J (cell «nes). and 1 x 10-* 1 x 10-*« 5x 10~«\ 1 x 1 0" 4 (tumors). (0) Distribution of correlatlom between 
DNA copy number and mRNA levels, for 6,095 different human genes across 37 breast tumor samples. (0 Wotof observed versus expected correlation coefficients. 
The expected values were obtained by randomization of the sample labels in the DNA copy number data set The line of unfty Is indicated, (d) Percent variance 
in gene expression (among tumors) directly explained by variation In gene copy number. Percent variance explained (black line) and fraction of data retained 
(gray line) are plotted for different fluorescence intensity/background (a rough surrogate for signal/noise) cutoff values. Fraction of data retained b relative 
to the 1.2 Intensity/background cutoff. Details of the line at regression model used to estimate the fraction of variation in gene expression attributable to 
underlying DNA copy number alteration can be found In the supporting information (see Estimating the fraction of Variation In Gene Expression Attributtble 
to Underlying DNA Copy Number Alteration). ■ 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. 4V/). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data roost reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 44). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a scries of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be gencralizable 
(but would nevertheless still be remarkable if only applicable to. 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips er oL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzcr er aL (15) recently reported that in metastatic 
colon rumors only -4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et al. (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lowcr-resotution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gene(s) 
within amplicons (16)], on a large number of samples, m addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneupkridy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension* the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochioroetric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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ABSTRACT The consistent cytogenetic translocation of 
chronic myelogenous leukemia (the Philadelphia chromosome, 
Ph 1 ) has been observed in cells of multiple hematopoietic 
lineages. This translocation creates a chimeric gene composed 
of breakpolnt-cluster-reglon (bcr) sequences from chromosome 
22 fused to a portion of the abl oncogene on chromosome 9. the 
resulting gene product (P210 c ** bl ) resembles the transforming 
protein of the Abelson murine leukemia virus in its structure 
and tyrosine kinase activity. P210 c *' bl is expressed in Ph 1 . 
positive cell lines of myeloid lineage and in clinical specimens 
with myeloid predominance. We show here that Epstein-Bar r 
virus-transformed B-lymphocyte lines that retain Ph 1 can 
express V2W^ X 9 The level of expression in these B-cell lines is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation in the level of bcr-abl mRNA 
produced from a single Ph 1 template. 



Chronic myelogenous Leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-lymphoid, contain Ph 1 in 
early or chronic phase , as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6). Although the precise 
translocation breakpoints are variable, an RN A- splicing 
mechanism generates a very similar 8-kilobase (kb) mRNA in 
each , case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P210 c " abl (10-13), with an amino-terminal segment derived 
from a portion of the exons of bcr on chromosome 22 and a 
carboxyl-terminal segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl and the resulting P210 c ' abl is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resulting P160 v ' abl transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the c-abl 
gene product (15). 
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In concert with structural modification of the amino- 
terminal portion of the abl gene , increased level of expression 
has been implicate^ in activation of c-abl oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher leyels of the 8-kb bcr-abl mRNA and P210 c - abl than 
the c-abl mRNA forms (6 and 7 kb) and P145 cabI gene product 
(5, 8, 9, 11). The higher level of expression of the chimeric 
bcr-abl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6.7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5,6).. 

We have analyzed a series . of Epstein-Barr vims-immor- 
talized B-lymphoid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of Ph 1 always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that cell lines that retain Ph 1 do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable* than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 c aW suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-abl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings. Epstein-Barr virus-transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report* 
ed (16). the cell lines are designated according to patient 
number, karyotype, and lineage. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line, 9;22 translocation (Ph 1 ), cell line 33; and SK-CML7BN- 
2 refers to B-ceil line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just prior to analysis for abl 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern of in vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Cells (1.5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 
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supplemented with 5% dialyzed fetal bovine serum. Cells 
were then resuspended in 2 ml of the minimal medium. 
Labeling was started with the addition of [ 32 P]orthophos- 
phate (1 mCi/ml; ICN; 1 Ci = 37 GBq) and continued at 37°C 
for 3-4 hr. 

Immunoprecipitation and Immunoblotting. Immunoprecip- 
itations were carried out as described (10). Cells (1.5 x IQ 7 ) 
were washed with phosphate-buffered saline and extracted 
with 3-5 ml of phosphate lysis buffer (1% Triton X-100/0.1 
NaDodSO 4 /0.5% deoxycholate/10 mM Na 2 HP0 4 , pH 7.5/ 
100 mM NaCl) with 5 mM EDTA and 5 mM phenylmethyl- 
sulfonyl fluoride. Extracts were clarified by centrifugation 
and precipitated with normal or rabbit anti-abl sera (anti- 
pEX-2 or anti-pEX-5) (17). The precipitated proteins were 
electrophoresed in a NaDodS0 4 /8% polyacrylamide gel. 
32 P-labeled proteins were detected by autoradiography. 
Alternatively* abl proteins were detected by immunoblotting. 
Extracts from unlabeled cells were clarified, and proteins 
were concentrated by immunoprecipitation with rabbit anti- 
sera against aW-encoded proteins [anti-pEX-2 and anti-pEX- 
5 combined (17)] and then fractionated in %% acrylamide gels. 
The proteins were transferred from the gel to nitrocellulose 
filters, using protease-facilitated transfer (18). The abl- 
encoded proteins were detected using murine monoclonal 
antibodies as a probe and peroxidase-conjugated goat anti- 
mouse second stage antibody (Bio-Rad) for development. 
Rabbit antisera and mouse monoclonal antibodies to abl 
proteins were prepared using bacterially expressed regions of 
the v-abl protein as immunogens (17, 19). Anti-pEX-2 anti- 
bodies react with the internal tyrosine kinase domain and 
anti-pEX-5 antibodies react with the carboxyl-terminal seg- 
ment of the abl proteins. 

RNA Analysis. RNA was extracted from 10 s cells by the 
NaDodSO^/urea/phenol method (20). Polyadenylylated 
RNA was purified by oligo(dT) affinity chromatography. 
Samples were electrophoresed in a 1% agarose/formalde- 
hyde gel and transferred to nitrocellulose, abl RNA species 
were detected by hybridization with a nick-translated v-abl 
fragment probe (21); 

DNA Analysts. DNA was prepared from 5 x 10 7 cells of 
each cell line and processed for Southern blots with a v-aW 
probe as described (21). 

RESULTS 

Variable Levels of P210 cabl Are Detected in Ph'-Posltive Cell 
Lines. Ph^positive and Ph^negative, Epstein-Barr virus- 
transformed B-lymphocyte cell lines derived from the same 
patient were examined for P210 c ' abl synthesis by immuno- 
precipitation of [ 3i P]orthophosphate-labeled cell extracts 
with anti-abl sera (Fig. 1). The normal c-abl protein P145 c ' ttbl 
was detected at a similar level in multiple Prepositive and 
Ph L -negative cell lines. P210 cabl was only detected in the 
Prepositive cell lines because the bcr-abl chimeric gene 
which encodes P210 c ab! resides on the Ph 1 (4, 5, 11, 13). The 
level of P210 cabl was about 4- to 5-fold higher than the level 
of P145 c * abl in the SK-CML7Bt-33 cell line (Fig. 1A, +). The 
Ph l -positive erythroid-progenitor cell line K562 (C) showed 
a level of P210 cabl about 10-fold higher than P145 c " abl . 
However, the level of P210 c abl was about one-fifth that of 
P145 c abI in the. Prepositive SK-CML16BM cell line (Fig. IB, 
+). Comparison of different autoradiographic exposures 
roughly indicated that the level of P210 c ab ' varies over a 
20-fold range between these Ph 1 -positive B-cell lines. Anal- 
ysis of four additional Prepositive B-cell lines demonstrated 
that the level of P210 c " abl fell into two general classes; some 
cell lines had a level of P210 c abl similar to SK-CML7Bt-33 
and others had the low level similar to SK-CML16Bt-l (Table 
1). This differs from previous studies with Ph l -positive 
myeloid cell lines and patient samples derived from acute- 
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Fio. 1. Detection of variable levels of P210 c * bI in Ph l -positive 
B-cell lines. Production of P145 c lbl and P210 c - ftbl in Epstein-Barr 
virus-transformed B-cell lines derived from a blast-crisis (A) and a 
chronic-phase (B) CML patient was examined by metabolic labeling 
with [ 32 P]orthophosphate and immunoprecipitation. Ph ^negati ve 
(-) and Ph^positive (+) cell lines derived from each patient were 
analyzed. The Ph l -negative cell line in A , - is SK-CML7BN-2 and in 
B- is SK-CML16BN-1. The Ph l -positive cell line in A t + is 
SK-CML7BI-33 and in is SK-CML16Bt-l. The K562 cell line, a 
Ph'-positive erythroid progenitor cell line spontaneously derived 
from a blast-crisis patient (33), is represented in C. Cells (1.5 x 10 7 ) 
were metabolically labeled with 2 mCi of [ 3J P]orthophosphate for 3-4 
hr and then were extracted and clarified by centrifugation. Samples 
were immunoprecipitated with control normal serum (lanes 1), 
anti-pEX-2 (lanes 2), or anti-pEX-5 flanes 3) and analyzed by 
NaDodS0 4 /8% PAGE followed by autoradiography with an inten- 
sifying screen (3 days for A and C, 10 days for B). 

phase CML patients, in which P21Q c * bI was detected at a 
10-fold higher level than P145 c abI (refs. 10 and 11; Table 1). 
There was no large difference in level of chimeric mRNA and 
P210 c ' abl expressed in four myeloid/erythroid-lineage Ph 1 - 
positive cell lines (K562, EM2, EM3, CML22, and BV173; 
refs. 9 and 11), despite a 4- to 5-fold amplification of 
<zW-related sequences in the K562 cell line. 

Detection of different levels of P210 c abl ip Fig. 1 could be 
due to decreased phosphorylation of P210 c_?bl t a lower level 
of P210 c * abl synthesis, or altered stability of the protein. To 
help distinguish among these possibilities, the steady-state 
level of P210 c aW in the cell lines was assayed by immuno- 
blotting. The results show that SK-CML7Btr33 (Fig. 2A, +) 
had a higher level of P210 c ' abI than P145, similar to the results 
with metabolic labeling (Fig. 1). We did not detect P210 C ftbl 
by immunoblotting with 2 x 10 7 cells of line SK-CML8Bt-3 
(Fig. 2B, +). Reconstruction experiments using dilutions of 
cell extracts showed that we could detect about 5-10% the 
level of P210 c " abl expressed in the K562 cell line (data not 
shown). We infer that the steady-state level of P210 c abl in 
SK-CML8Bt-3 is lower than, the level in SK-CML7Bt-33 by 
a factor of at least 10. The level of P210 c_abl detected in these 
assays correlated with the amount of P210 c " abl tyrosine kinase 
activity that could be detected in vitrg (data not shown). 

Different Levels of P210 c ~* b ? Are Reflected In the Amount of 
Stable bcr-abl mRNA. To identify the basis for detection of 
variable levels of P210 c * abl f we examined the production of 
the abl RNA. RNA blot hybridization analysis using a v-abl 
probe (Fig. 3) showed that the normal 6- and 7-kb c-abl 
mRNAs were present at a similar level in Ph l -positive and 
-negative cell lines derived from different patients. However, 
the 8-kb mRNA that encodes P210 c " abl was detected at a 
10-fold higher level in SK-CML7Bt-33 (Fig. 3A, +) than in 
SK-CML16BM (B, +), which correlated with the relative 
level of P210 c abl detected in each cell line; Analysis of 
additional cell lines demonstrated that the level of 8-kb RNA 
directly correlated with the level of P210 c abl (Table 1). The 
variation in level of 8-kb RNA detected in these cell lines was 
not due to loss or gain of Ph 1 , because cytogenetic analysis 
confirmed the presence of Ph 1 in these cell lines (ref. 16 and 
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Table 1. Relative levels of bcr-abl expression in Epstein-Barr 
virus-immortalized B-cell lines and myeloid CM L lines 

8-kb 
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Cell line* 


CML phase 1 " 


Ph 1 * 


P210S 




o K-LML7 dN-2 


BC 






— 


Cgf PUT fiDXT 1A 


Chronic 










Chronic 








SK-CML16BN-1 


Chronic 








SK-CML35BN-1 


Chronic 








SK-CML7B5-33 


BC 


+ 


+ + + 


+++ 


SK-CML21BM 


Acc 


+ 


+ + + 


++ + 


SK-CML21Bt-6 


Acc 


+ 


+++ 


+++ 


SK-CMUBt-3 


Chronic 


+ 


+ 




SK-CML16BM 


Chronic 


+ 


+ 


+ 


SK-CML35Bt-2 


Chronic 


+ 


+ 


+ 


K562 


BC 


■ + . 


*+ + + + + 


+ + + + + 


BV173 


BC 


+ 


+ + + + + 


+ + + + + 


EM2 


BC 


+ 


+ + + + + 


+ + + + + 



♦Cell lines derived from CML patients by transformation with 
Epstein-Barr virus as described (16). Names of cell lines indicate 
patient number and Ph 1 status: SK-CML7Bt indicates a cell line 
derived from patient 7 that carries the 9;22 Ph 1 translocation; N 
indicates a normal karyotype. Myeloid-eiythroid cell lines (K562, 
EM2, and BV173) are described in previous publications (9, 11, 22, 

^Status of patient at the time cell line was derived, BC f blast crisis; 
Acc, accelerated phase. 

♦Presence (+) or absence (-) of Ph 1 as demonstrated by karyotypic 
or Southern blot analysis. 

8P210 c - aW detected as described in legend to Fig. 1. B-cell lines 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (+ + +) than levels of P145. Chronic- 
phase-derived cell lines had P210 levels lower than or just equivalent 
(+) to the level of P145. Myeloid and erythroid lines had levels of 
P210 5- to 10-fold higher than P145 (+ + +++). 

*Eight-kilobase bcr-abl mRNA detected as described in legend to 
Fig. 2. Symbols: ±, borderline detectable; + + + + + , level of 8-kb 
mRNA 5- to 10-fold higher than that of the 6- and 7-kb c>abl mRNA 
species; + + + , level of 8-kb mRNA 3- to 5-fold higher than that of 
the 6- and 7-kb species; +, a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of aM-related sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of abl sequences, as previously reported (22, 23). 
These combined data suggest that differential bcr-abl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 c - ftW detected. This could be mediated 
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Fig. 2. Analysis of steady-state abl protein levels by immuno- 
blotting. Cell extracts prepared from 2 x 10 7 cells of lines SK- 
CML7BN-2 (A,-), SK-CML7Bt-33 (A,+), SK-CML8BN-10 (*,-), 
and SK-CML8Bt-3 (£,+) were concentrated by immunoprecip- 
itation with anti-pEX-2 plus anti-pEX-5. Samples were then electro- 
phorescd in a NaDodS0 4 /8% polyacrylamide gel and transferred to 
nitroceUulose, using protease-facilitated transfer (18). abl proteins 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX-2 and pEX-5 aW-protein fragments produced in 
bacteria (19) as a probe and a peroxidase-conjugated goat anti-mouse 
second-stage antibody (Bio-Rad) for development. 





Fig. 3, Comparison of abl RNA levels in Ph'-positive and 
-negative B-cell lines. The levels of the normal 6- and 7-kb c-abl 
RNAs and the 8-kb bcr-abl RNA were analyzed by blot hybridization 
using a v-abl probe. RNA was extracted from Ph'-negative lines 
SK-CML7BN-2 (A,-) and SK-CML16BN-1 (*,-), from Ph'-pos- 
mve lines SK-CML6Bt-33 (A,+) and SK-CML16Bt-3 (B,+), and 
from line K562 (C.+) by the NaDodS0 4 /urea/phenol method (20). 
Polyadenylylated RNA was purified by oligo(dT) affinity chroma- 
tography, and 15 ng of each sample was electrophoresed in a 1% 
agarose/formaldehyde gel and then transferred to nitrocellulose. The 
blotted RNAs were hybridized with a nicktranslated v-abt fragment 
probe (21) and then autoradiographed for 4 days. 

by factors influencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Several lines of evidence suggest that formation of Ph 1 is not 
the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph 1 (1). This observation, 
coupled with studies of G6PD (glucose-6-phosphate dehy- 
drogenase)-heterozygous females with CML that demon- 
strate stem-cell clonality by isozyme analysis among cell 
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Fig. 4. Southern blot analysis of abl sequences in Ph^positive 
and -negative B-cell lines. High molecular weight DNA (15 pg) was 
digested with restriction endonuclease flamHI, separated in a 0.8% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized with a nick-translated, 2.4-kb Bgl II v-abl 
fragment (1.5 x 10* cpm/^g; ref. 21) and exposed for 4 days. (A) 
Autoradiogram of aW-specific fragments in cell lines HL-60 (lane 1), 
EM2 (lane 2), K562 (lane 3), SK-CML7Bt-33 (lane 4), SK-CML8Bt-3 
(lane 5), SK-CML16BM (lane 6), SK-CML21Bt-6" (lane 7), SK- 
CML35Bt-2 (lane 8), SK-CML7BN-2 (lane 9). SK-CML8BN-2 (lane 
10), and SK-CML35BN-1 (lane 11). (B) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose, showing the level of 
variation in amount of DNA loaded per lane. 
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2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3 . My scientific Curriculum Vita6, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 



Serial No.: * 
Filed: * 



4. • I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al. Biotechnology 10:413-417 (1992) (Exhibit B); Livak et al. 9 PCR 
Methods AppL 4:357-362 (1995) (Exhibit C) and Heid et aL. Genome Res. 6:986-994 (1996) - 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et al, Proa 
Natl. Acad. Sci. USA 95(25):14717-14722 (1998) (Exhibit E); Pitti et al, Nature 
396(6712):699-703 (1998) (Exhibit F) and Bieche etal. Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et al have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et al studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et al used the assay to study gene amplification in breast cancer. 



Serial No.: * 
Filed:* 



7. It is my personal experience that the : quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i:e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereoa 

Date / 

Audrey D. Goddard, Ph.D. 



AUDREY D. GODDARD, Ph.D. 



110 Congo St. 

San Francisco, CA t 94131 

415.841.9154 

415.819.2247 (mobile) 

agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

Gerientech, Inc. 1993-present 
South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 

1998 - 2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, ~ ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel [genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 



Genentech, Inc. 
1 DNA Way 

South San Francisco, CA, 94080 

650.225.6429 

goddarda@gene.com 
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1993-1998 



Scientist 



Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification: 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89-12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocytic leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 



"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 



1989 



Honours B.Sc 

The in vitro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1983 
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ACADEMIC AWARDS 

Imperial Cancer Research Fund Postdoctoral Fellowship 1989-1992 

Medical Research Council Studentship 1983-1988 

NSERC Undergraduate Summer Research Award ig83 

Society of Chemical Industry Merit Award (Hons. Biochem.) 1983 

Dr. Harry Lyman Hooker Scholarship 1981-1983 

J.L.W. Gill Scholarship 1981-1982 

Business and Professional Women's Club Scholarship 1980-1981 

Wyerhauser Foundation Scholarship 1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel- genes. Functional Genomics: From Genome to Function Litchfield 
Park.AZ, USA. October 2000 uicmiera 

High throughput identification, cloning and characterization of novel genes G2KBack to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 

Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencinq Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meetina San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocytic leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 
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PATENTS 

Goddard A ( Godowski PJ t Gurney AL. NL2 Tie ligand homologue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D, Goddard A, Roy M t Ferrara N, Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25, 2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski Pj and Gurney AL. Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 19, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hornrione 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,113. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D* Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H t Foster J, Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P t Colman 
S., Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 135-142. 

Lee J. Ho WH. Maruoka M. Corpuz RT, Baldwin DT. Foster JS. Goddard AD: Yansura DG. 
Vandlen RL. Wood Wt. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM . Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL (2000) Interleukin (IL)-22 F a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG t Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J f Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of . 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J t Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 
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Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD, Wood Wl, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB, Ng E, Kern JA ,Lee J, Brush J t Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P f Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. Sci. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A t Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient^route to human bispecific IgG. Nature Biotechnology 16(7): 677-681. 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2L/TRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19:15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD, Dowd P, Chernausek S, Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone, receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL, Moffat B, Vandlen R, Simmons L, Gu Q, 
Hongo JA, Devaux B, Poulsen K, Armanini M, Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CE f Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature. 387(6634): 717-21. 

Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q ( Johnson RL, Scott MP, Pennica D, 
Goddard A, Phillips H, Noll M, Hooper JE, de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM, Gray CL t Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(12): 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddei DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. ScL USA 93: 8241-8246. 

Yang M, Luoh SM, Goddard A, Reilly D, Henzel W and Bass S/ (1 996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp.1 87-21 5. 

Treanor JJS, Goodman L, de Sauvage F, Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. ScL USA 93: 7108-71 13. 

Winslow JW, Moran P, Valverde J, Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, Zeigler FC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. Sci. USA 92: 1866-1870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6: 732-737. 
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Goddard AD, Covelio R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS, Moran P, Gripp J, Armanini M, Phillips HS, Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT, Wang Z, Gu Q f Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 10720-10728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Fioretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocytic leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet. 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute . 
promyelocytic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl 3: 117S-119S. 

Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and.Gallie BL (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Cell. Genet 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [letter]. 
British Med. J. 302:409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991 ) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254: 1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E, Pihlajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acting as the p-subunit of prolyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cytogenet. Cell. Genet. 56: 165-168. 

Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
Genomics 10: 477-480. 

Borrow J, Goddard AD, Sheer D and Solomon E. (1990) Molecular analysis of acute . 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
1580. 
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Myers JC, Jones TA, Pohjolainen E-R, Kadri AS, Goddard AD, Sheer D t Solomon E and 
Pihlajaniemi T. (1990) Molecular cloning of 5(IV) collagen and assignment of the gene to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. J. Hum. 
Genet 46: 1024-1033. 

Gallie BL, Squire JA, Goddard A, Dunn JM, Canton M, Hinton D f Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis in retinoblastoma. Lab. Invest 62: 394-408. 

Goddard AD, Phillips RA, Greger V, Passarge E, Hopping W, Gallie BL and Horsthemke B. 
(1990) Use of the RB1 cDNA as a diagnostic probe in retinoblastoma families. Clinical 
Genetics 37: 117-126. 

Zhu XP f Dunn JM, Phillips RA, Goddard AD, Paton KE, Becker A'and Gallie BL. (1989) 
Germline, but not somatic, mutations of the RB1 gene preferentially involve the paternal 
allele. Nature 340: .312-314. 

Gallie BL, Dunn JM, Goddard A, Becker A and Phillips RA. (1988) Identification of mutations 
in the putative retinoblastoma gene. In Molecular Biology of The Eve: Genes, Vision and 
Ocular Disease . UCLA Symposia on Molecular and Cellular Biology, New Series, Volume 88. 
J. Piatigorsky, T. Shinohara and P.S. Zelenka, Eds. Alan R. Liss, Inc., New York, 1988, pp. 
427-436. 

Goddard AD, Balakier H, Canton M, Dunn J, Squire J, Reyes E, Becker A, Phillips RA and 
Gallie BL. (1988) Infrequent genomic rearrangement and normal expression of the putative 
RB1 gene in retinoblastoma tumors. Mol. Cell. Biol. 8: 2082-2088. 

Squire J, Dunn J, Goddard A, Hoffman T, Musarella M, Willard HF, Becker AJ, Gallie BL and 
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We have enhanced the polymerase chain j 
reaction (PGR) such that specific DNA 
sequences can be detected without open* 
ing the reaction tube* This enhaiijcemettt 
requires the addition of ethidium bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (ds) DNA an increase in flaores- 
cence in such a PGR indicates a positive 
amplification, which can be easily mcmi- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA setjuenoes 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation audi more 
widespread use in the clinic or in other 
situations requiring high sample tiirough- 
put 

Although the potential benefits of PGR* to clin- 
ical diagnostics arc well known 2 * 5 , it is still not 
widely used in this setting, even diough k is 
Font- )*ean eiuoQ thcnn?*tab]e DNA pc*jrm^r* 
ase$ 4 made PCR practical Some of the reasons for its $low. 
acceptance are high cost, lack of automation of pre-* and 
post-r*CR processing steps, and false positive results, from 
carryover-contamination. The fim two points arc related 
in that labor is the largest contributor to cost at the present 
stage of PCR development. Most Current assays require 
some form of ^downstream*' processing once tbermocy* 
cfing ts done in order *o determine whether the target 
DjNA sequence was- present and has amplified* Thctit 
include DNA hybrktfwoon* 6 , gel electrophorcftis with or 
whom use of restriction digestion 7 :*,' HPIX?, or capillary 
electrophoresis 10 . These methods are labor -intense, haye. 
low throughput, and arc difi&cult to automate- The third 
point is also ctascry related to downstream processing. 
The handling of the FCR product in these downstream 
processes increases the chances that amplified DNA '.will 
spread through the typing lab, resulting in a .risk of 



"carryover" false positives in subsequent testing 11 . 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DMA took place simultaneously within an unopened re- 
action vessel Assays m whkh such different processes take 
place without, the need to separate reaction components 
have been termed \^mogeneou5\ No truly homogc-. 
neons PGR assay has been deinonstrated to date, although 
progress towards this end has been reported.- Chehab, et 
at 1 , developed a PCR product detection scheme using 
fluorescent, primers that resulted in a fiaorcficeut PCR 
product AUdc-spedfic primers, each with different fluo- 
rescent tags, were used to indicate- the genotype of the 
DNA. However, the unincorporated primers uiust still be 
removed in a do wnstream process in order to visualize the 
result Recently, HoHaiid, et al. l? , developed an assay in 
which the endogenous 5' exonudease assay of Tdf DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe. The probe would only dcave if PCR axnp£6- 
cation bad produced its complementary sequence. In 
order to detect the cleavage product?, however, a subse- 
quent process w again needed. 

We have developed a truly homogeneous assay for PCR 
and PCR product detection based upon tbc greatly in- 
creased fluorescence that efhidxura 'btonude and other 
DNA binding dyes exhibit when they are bound .to.ds- 
DNA tJ *~ 19 . As ouchncd in Figure 1; a prototypk PCR 
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1 iVindplc of jimultancoua amdQ&attxni and dctedcon of 
PCR product; The cOtnpCarnU of a PCR CO&oinW EtBr thai «rr? 
fluorescent are lisied— EtBr itselt EtBr bound to other ssDNA or 
tUDNA. Tbcrc is a Jar^e fbioroccfvoc cnhanccmcdt when EtBr is 
bound to DNA and bsnduig t* gi^c&tl/ enhanced when DNA .is 
double-stranded. AfW sunicicTU <n)..cfdcs of PGR. the .net 
increase in d^pNA residts in addrtjorol EtBr bioijEag* and a net 
iacrcajte in total flaomv^rna^ 
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fWttd 2 Gei electrophoresis of PCS. atopiifiottioo products of the 
human, nudcar gen<\ HLA DQfc, made in the presence of 
increasing amouots of EtBr (up to 9 pgftnl). The presence of 
£tBr Ivja no obvious effect on the yield or specificity of amplifi- . 
cuion. 




RGOkX % (A) ttttbtesccnte measurement* |ro«i PCRs that contain 
0.5 u^/m! EtBr and that are specific for Y-eJ)jtxno$OBoc repeat 
**M<uetice*. Five replicate FCRswere begun containing each of the 
DNA* specified. At ^ach mcUdtcd cycle, one of the five rep&catc 
PCHs for each DNA -was perooved from thennocydmg and H» 
6uoreficcncc measured. Uhha of fluorescence Art arbitrary. (B) 
UV pboKygraphy of PGR tubes (0.5 ml Eppcndorf-^tylc, polypro- 
pylene tnkro-cenirifuRc tubes) conuming; reactions, those start* 
ing from $ ng male DNA and control reactions without any DMA, 
from (A). 
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begins with primers that arc single-stranded DNA (ss- 
DNA), dNTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DMA) is 
also typically present. Thb amount tan vary, depending 
on the application, from single-cell amounts oTDNA tT to 
micrograms per PCR^. If EtBr is present, the reagents 
that will fluoresce, in order ofirKrta<an£ fluorescence, are 
free EtBr itself, and EtBr bound to the singJoetrandcd 
DNA primers and to the doubles tianded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubk-hchx). After the first denaturation cyde, target 
DNA will be largely single-stranded. After a PGR is 
completed the most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several micrograms- Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease -in the amount of 
ssDNA primer, but because tbc binding of EtBr to ssDNA 
is much less than to dsDNA, the effect of this change on 
the total fluorescence of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 



before and after, or even continuously during, therrnocy. 
ding. 



RESULTS 

PGR in the presence of EtBr. In order to' assess the 
affect of EtBr io PGR, amplifications of the human HLA 
DQct gene 19 were performed with the dye present at 
concentrations from 0.06 to 8,0 M-gfaal (a lypieaj concen- 
tration of EtBr used in staining of nuckk acids fajfcwing 
get electrophoresis is 0.5 pg/mf). As shown in Figure 2, gd 
electrophoresis revealed little or no djflfarcpec in the yield . 
or quahiy of the amphfkarion product whether EtBr was 
absent or present at any of these coru^trarkms, indicat- 
ing that EtBr does not inhibit PGR, 

Deteetiou of human Y-chromowra« specific st- 
cjnences- SecjuerK^-spcafk, fluorcscciKe e^ancement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0.5 u.gfrn| EtBr and primer? 
specific to repeat DNA sequences found on the human 
Y-chromosomc 10 . These PCRs initially contained either 
60 ng male, 60 ng female, 2 ng male human or no DNA. 
Five replicate PCRs were begun for each DNA* After 0,. 
17, SI # 24 and 29 cycles of therniocyding, a PCR &>r cadi 
DNA was removed from the thermocyder, and its. fluo- 
rescence measured in a spectroftnorometer and plotted 
vs, amplification cyde number (Fig. 3A). The shape of this 
curve reflects the fact that by the rime an increase in 
fluorescence can be detected* the increase in DNA is 
becoming linear and not exrxmeniial with cyde number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not significandy increase 
for negative control PCRs, which contained cither no 
DNA or human female DNA. The more 'male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycle* were needed to give a detectable increase in fluo- 
rescence. Gel dectn^pftoresis oo the products of these 
amplifications showed that DNA fragments of the ex- 
pected she were made in the male DNA containing 
reactions and that utile DN A synthesis took place in the 
control samples. 

In addition the increase in fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
traroulujmnator and photographing them through a red' 
filter. This is shown in figure 3B lor the reactions thai 
began with 2 ng male DNA and those with no DNA- 

Detection of specific aDcJe* of the human p-globtn 
gene. In order to demonstrate that this approach has 
adequate specifkky to allow genetic screening* a direction 
of the Sicklc-cdl anemia mutation was performed- Figure 
4 shows the fluorescence from completed amplication* 

containing ttBr (0.5 ng/ml) as detected by photography 

of the reaction tubes on a UV trapsilluminator. These 
reactions were performed using primer* specific for ci- 
ther the wikJ-tvpe or skkle-cefl mutation of the human 
p-globin gene^. The srjorirkity for each allele is imparted 
by placing the sickie-m utation site at the terminal 3 
nucleotide of one primer. By using an appropriate primer 
annealing temperature* primer extension— and thus an> 
phhcatipn— -can take place only if the 3' nucleotide of the 
primer is complementary to the 0-gtobin allele prc*cnr -. 

Each pair of amrAmcaoons shown in Figure 4 consists of 
a reaction with etcher the wiMHypc allele spedfk (le« 
tube) or stckk-aUde specific (rigbt tube) primers. Three 
different DNAs were typed: DNA from a homcgygous. 
Wild-type p-globin individual <AA); from a heterozygous 
sickle £-gk>bin individual (AS); and from a homozygous 
sickle p-gioWn individual (SS). Each DNA (50 ng genon?* 
DNA to start each PGR) was Analyzed in triplicate (3 pairs 



PAGE 3/S * RCVD AT 7/1 W2004 3:10:03 PM [Pacific DayfigM TimeJ * SVR:SVCS01/0 * DNB:6638 * CSiD:650 952 9881 1 DURATION (mm-ss):0446 



JUL- 19-2004 13:51 FROM: GENENTEv -V EGRL 650 952 9881 



T ■ j246638 



P:4'6 




ess the 
l HJLA 
«rot at 

JovriW 

tc yield 
Br was 
ndicat- 

Sc «. 

nem or 
:rics of 
►rimers 
human 
either 
»DNA* 
tftet 0« 
?r each 
£5 fluo- 
plotted 
: of this 
case in 
)NA is 
umber, 
xtt-fotd 

ncreasc 
bcr no 
J>NA 
cr 

ftvO- 

f these 
the cx- 
-tai rung 
5 in the 



^rf> 



uialkcd 
naUV 
ha red 
<m that 

••gfobifi 
ich has 
ttottjon 
. Figure 
icariorts 
>Sr» P hy 
. These 
: for ct- 
human 
n parted 
wo*l 3' 
:ptirocr 
bus am* 
c of the 
^nr*'-**. 
mist* of 
ttc (left 
; Three 
jzygous, 
o*ygou* 
ozygous 

(5 pairs 



reactions each). The DMA type wai reflected in the 
y^jatrve fluorescence intensities in each pair of completed 
ggtpltficatioitt. There was a significant increase in fluores* 
only where a (^globin aQele DNA matched the 
primer set. When measured -on a spectroflnoronictcr 
(data not shown), this fluorescence was about three times 
jrtt present in a PCR where both p-dobra alleles were 
nVsmatchcd to the primer set. Gel electrophoresis (not 
frown) established that this increase in fluorescence was 
doe to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for 0-giobin. There was 
little synthesis of ds£)NA in reactions in .which the aflele- 
jjpednc primer was mismatched to both alleles. 

Continuous monitoriog of a PCR. Using a fiber optic 
deviecrk is possible to direct excitation illumination from 
p i|>ectrofluororneter to a PCR undergoing thcrmocyding 
„nd to return its fluorescence to the jqKCcroftuororweter. 
The fluorescence readout of such an arrangement, di- 
rected At an EtB r-containing amplification of Y-chromo- 
loroc specific sequences tram 25 ng of human male DNA» 
\% shown in Figure 5. The readout from a control PCR 
v'lih no target DNA is also shown. Thirty cycles of PCR 
were monitored for each. 

The fluorescence trace as a function of time clearly 
shows the efFect of the thenrjocyding. Fluorescence inten- 
sity rise* and fails inversely with tempemurc. The fluo- 
rescence intensity is minimum at the denatuxation tem- 
perature (*H°C) and maximum at die anneaUn^ekiension 
temperature (SOX). In the negative-control PCR, these 
fluorescence maxima and minima do not change, signifi- 
cantly over the thirty tbonroocyck*, indicating ihat there is 
Bttlc dBDNA synthesis without the appropriate target 
DNA, and there is little if any bleaching of JEtBr during 
the continuous ilhimination Of the sample. 

In the PCR containing male DNA, the fluorescence 
maxima at the annealmg/extjension temperature begin to 
increase at about 4000 seconds of thcrmocyding, and 
continue to increase with rime, indicating that dsDNA is 
being produced at a detectable level Note that die fluo- 
rescence minima at the derjaturatioa temperature do not 
sigruftcandy increase, presumably because al this temper- 
ature there is no d&DNA for EtBr to bind- Thus the course 
of the amplification is followed by tracking the fluorcs-. 
cence increase at the annealing temperature. Analysis bf 
the products of these two amplifications by gel electropho- 
resis showed a I>NA fragment of the expected sire for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
o,uence-«pedfic probe can enhance die specificity of DNA 
dece^uvu Xtj PGR. The cluirawtkin of tho%c processes, 
means that" the specificity of this homogeneous assay 
depends solely on that of tCfL la the case of sickle-cell 
disease, we have shown that PCR ak>oc has sufficient DNA 
sequence apedficky to permit genetic screening. Using 
appropriate amplification conditions, there b Ifrde non- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

The specificity required to detect pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA chat must be taken with the 
sample, A difficult target is HIV, which requires detection 
of a viraJ genome that can be at the level of a few copies 
per thoit.<ands of host cells*. ComP&ttd with genetic 
screening, which is performed or ceils .containing at least 
one cony of die target sentence* HiV detection requires 
hoth more specrfiaty and the input of more total 
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UV photography of PCR tubes containing lunpfificauoju 
using EtBr tout are rpecifk to viloVtrpe (A) or licUe (S) alkies' of 
the human 0-globin gene. The left of each pair of tube* contain* 
alkle-*pcdfk: primers to the wild-type alleles, the right tube 
primers to the skWe aftek- The photograph was tafceb after SO 
cycles of PCR..*nd the input DNAs and the alleles jhev coataio 
are indicated. Fifty tag of DNA was used to bcgra PGR. Typing 
wis done in triphcatc (3 pair* of P&U) for each input DNA. . 
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nCtNS 5 Continuous, real-time monitoring of a PCR. A fiber optic 
was wed to carry excitation Rgbt to a Pt;R in progress and ako 
emitted light back to a fltiorometcr (see Expentncnal ^ocol). 
AmpHficaaon -using -human uialo-DNA spcofk priiiicrf in » PCR 
starting with tO ng of human male DNA (too), or in a control 
PCR mthout DNA (bottom), were rnonhorrd. Thirty cydes of 
PCR were foRowed for each. The temperature cyded between 
94*C (denaturatiota) And 50X (annealing and extension). Note in 
the male DNA PCR, the cyde M depenxfcot tricreuc in 
ftuorcsccncc at the aoueaBttg^extcnsion tcmpewture, 
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DNA — up to microgram amc*uiu»~ia order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA tn an ampub^ooa signibc^nUy increases 
the bacKground fluowceuce over which any additional 
fluorescence produced by PGR must be detected. An 
additional complication, that occurs whh targets in low 
copy-number is the formation of the "primer-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occur* the extension product is a 
substrate for PCR amplification* and can compete whh 
true PCR targets if those targets are rare. The primex- 
dimer product i* of course dsDN A and thus i* a potential 
source of false signal in this homogeneous a*$ay. 

To increase PGR specificity and reduce the effect of 
primer-dimcT arnplification, we are investigating a num- 
ber of approaches, including the use of Dcsted-prkner 
amplification* that take place in a single tube 8 , and the 
**hot-start** ( in which nonspecific amplification is reduced 
by raiding the temperature of the reaction before DNA 
synthesis begins 25 . Prefaunary results using these ap- 
proaches suggest tbat^nmcr^tmcT is effectively reduced 
and it is possible to detect the increase in Ecfir fluores- 
cence in a PCR instigated by a single HIV genome in a 

background of 10* celts. With larger number* of cells, the 
background fluorescence contributed by genomic DNA 
becomes problcinatic- To. reduce thw background, it may 
be possible to use se^fuence-specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to . 
the oligonuckotide pruncr 4 ' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-contaming PCR is straightforward, 
both once PCR is completed and continuously during 
therrnocycHng. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. Hie fluorescence analysis 
of completed PCRs is alrcadyjxjsstbJc with existing instru- 
mentation in 96-well format**. In tliis format,, the fluores- 
cence in each PCR can be cruantitated before* after, and 
even at selected points during therrnocyciirtg by moving 
the rack of PCR* to a 96^microwcll plate fluorescence 
reader 40 . 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberopdes transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target l)NA copy number. Figure 3 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase w detected. Prelimi- 
nary experiments {Wiguchi and Dollinger, manuscript in 
preparation) with continuous raorutoriiig have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as* it can be in genetic sereening-^continuous 
monitoring may provide a means of detecting false posi- 
tive and false negative result*. With a known number of 
target roolecules, a true positive would exhibit detectable 
fluorescence by 9 predictable number of cycles of PCR- 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts. False negative 
results due to* for example,, inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more than are necessary co detect a true 
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positive- If a sample fails to bare a fluorescence increase 
after this many- cycles, wihnViuon may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event before any test based on this 
principle is ready for the clime, an assessment of its false 
positive/false negative rates will need to be obtained'*! sing 
a large number of known saroples- 

In summary, inclusion in PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR, 
in applications that demand the high throughput of 
sample** 

EXPERIMENTAL PROTOCOL 

Human HLA-DQn geoe *uptiSaUKms containing Etffc. 
PCRs were set «p in 100 volumes containing 1 0 mM Trb-HQ, 
pH 8.3; 50 mM KCI; 4 rati MgC^: tS units of Ton DNA 
polymerase (Perkm^EImcr Cctua, Norwalk, CT); 20 pm ote each 
of human HtA-DQa ' gene specific ol«onudeoaoe primers 
<>H$6 and CH27 19 and approMinattly W copies of DQfc PCR 
product diluted from a prttriou* reaction. EuSidium bromide 
folVr; Stgma) was used at the concenu^uons indicated in Figure. 
2. TherrnocycHng proceeded for 20 cycles in a mode! 480 
uVcrmocrdcr (Perkjn-Elmcr Cerna, Norwalk, CT) uanga 'itcp- 
cydc M program of 94*C for 1 mnxdciiaturation and 60aT» for )0 
sec anacafing and 72°C for 30 tec. extension, 

Y-chromowwnc specific PCR. PCRi (100 uJ total reaction 
volume) containing t)J» pgtoJ EtBr were prq>arcd as described 
for HLA-DQcr, except wi<h different primers and target DNAs. 
These PCRs contained I $ pmotc each male DNA-ipccujc primen 
Y J. 1 and V IJF*, slid cither 60 ng male, €0 ugfemale, 2! ng male, 
or no human DNA. TherniOCTClmg was WHJTor 1 nun. and SVC 
for 1 man using a "step-cycle* program The number of cycles for 
a sample were as indicated m figure 3. fluorescence measure- 
ment is described below. 

Allcfc-epccific, human frtfabhi pewe PGR* AropUncauons of 
100 pi volume using 05 u^Anl of £xBr were prepared a* 
described for HLA^DQa above except with different pruncrs and 
target DNAs. These PCRs contained eUtrer. primer pair HGF*/ 
H0HA <wBdHype globin speouc primers) or HGTMtyUS (sick- 
le-giobin specific primers) at 10 pmole each primer per PCR, 
These primers wrc developed by Wu ct al 2 \ Three different 
tacget DNAs Were used in separate amptifkatiOnsr-^SO ng each of 
human ON A that was homozygous for the l*cWc trait <$S)» DMA 
that was ttaernzrcous for the tickle trait (A$X or DNA that wsu 
homo^rg^us for Uw w.i- ^lobin (AA). ThcxmocycBng was far i0 
cycles at MX for 1 mm. and 65*C (or 1 min. using a "step-crefc" 
program. An anneatiog temperature of 55*C b*il nccn shown vy 
Wo ei al sl to pwide »Uc3c<pocific ampliation. Completed 
PCRs were phertogranhed through a red fiber (Wratten 23A) 
after placing the reaction twbes atop a model TM-36 (ranaiHuCfti- 
nator (UV-producu San Gabnd, CK). 

Fhtoresecnee nteasuronem. Fluore*cet>oe Tncasurcmenti vrcte 
rro<le on PCRs containing EtBT in a *1uorolog»2 OUoromCtcr 
(SFEX Edison, NJ). Excitation was at the &00 nm band with 
about 2 nra bandwidth with a OG 435 nm cut-off uherJfMefles 
Crist, Inc*. Irvine. CA) to exclude second-order light. Erorttcd 
Bght was detected a< 5^0 nm with a band width of about 7 nm. An 
OG 530 urn cut -off filter was used to remove the excitation hght 

ContftHtoaft naoreftcencc xntmftormg of PCR. Continuous 
monitoring or a PCR in progress was accomplished u«ng aic 
spcetrofluorometer and setonga described above as well as a 
fiberoptk accessory <SPEX cat no. 1950) to both send excuauon 
6ght m. aod receive emitted Ught from, a PCR placed in a well of 
a model 480 iherroocyclcr (Pcrim-EImer Cetws). The probe e«d 
of the fiberoptic cable was attached with "5 nwutc-cpoxy*' to tbt 
open top of a PCR tube (a 0^ ml potoropyiene centrifuge robe 
with Hs cap removed) effectively wading rt- The exposed top ot 
the PCR tuoe and the end of the nberoptk caWc were rfiJeldcd 
from room light and the rOOCO lights were kept dimmed during 
- each tup- The monitored PCR was an ampftnciiidn of V-cbio- 
rno?6me-«pcdrk repeat scquetices as described above, c*ccpt 
using.an anneatiug/extenskm temperature of 5CC The reacuoa 
was covered wkh mii>era} oil (2 drops) to prevent cvaporaaon- 
Thcrrocxycliny and Quorcsccncc roc^uccmeut were started 
multancously. A time-base scat*, witfc a 10 second mlegradotf tnnc 
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was uftfd aod ihc emUuoa signal was radocd to" tfvr excitation 
Vigo*) to control for change* in Jkht-fourcc intensity. D^C* were 
Reeled using the dro3O0Of, version $.5 (SPEX) data system. 
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IMMUNO BIOLOGICAL LABORATORIES 



sCD-14 EUSA 



Trauma, Shock and Sepsis 




The CD- 14 molecule is expressed on the surface of 
fnonocytes and some macrophages. Membrane- 
bound .CD-1.4 is a, receptor for lipopotysacchaiide 
(LPS) complexed to LPS-Binding-Protein (LBP). The 
concentrailon of \\s soluble form is aftered under 
certain pathological conditions. There, is evidence lor 
an important rate of $CXM4.with polytrauma, sepsis, 
burnings and inflammafcns. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of value in 
monitoring these patients. 



IBL offers an EUSA for quantitative determinauon of 
soluble CD-14 in human serum, -piasma, ceil-cutture 
supernatants and other biological fluids. 
Assay features: 12x8 determinations 
(microtJter strips), 
precoated swtth a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/mi 
detection limit: 1 ng/ml 
CV: intra- and tnterassay < 8% 

For more information call or fax. 



GESELLSCHAFT FUR IMMUNCHEMIE UND -BIOLOGiFmbh 

OSTERSTRASSE 86 • D- 2000 HAMBURG 20 - GERMANY TEL. +40/491 00 61-64 FAX +40/40 u 98 



PAGE 6/6 1 RCVD AT 7/19/2004 3:10:03 PM [Pacific Daylight TimeJ ' SVR:SVCS01/0 1 DN1S:6638 1 CS©:650 952 9881 ' DURATION (mnv$$):IM 



From (613) 991-5695 Order ;^811209DP04816741 Mon 24 Oct 2005 0' :35 PM EDT Page 2 of 7 



IIIIResearch 



Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

Perkin-EImer, Applied Biosystems Division, Foster City, California 94404 



a 



The S' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An increase 
In reporter fluorescence intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' -+3' nucle- 
olytlc activity of Tag DNA polymerase. 
In this ttudy, probes with the 
quencher dye attached to an Internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 '-end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence* In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by Increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 '-end nucleotide also exhibited 
an Increase in reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
*>e used as homogeneous hybrldiza- 
lon probes. 



t\ homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. (1 > 
The assay exploits the 5'->3' nucle- 
olytic activity of Taq DNA poly- 
merase (2 3) and is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(F ET).< 4 ' 5) During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' -* 3' nucleolytlc 
activity, if the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
ramethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-EImer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeied phos- 
phoramidite at the 5' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
IAN-containlng oligonucleotide in 250 
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PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 
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FIGURE 1 Diaeram of 5' nuclease assay. Stepwise representation of the 5' - 3' ™*™^J*- 
5 of ' wX^ymerase acting on a fluorogenic probe during one extension phase of PGR. 



mM Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x4.6- 
mm column with 7-u,m particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0,1 m. TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Ai with LAN- 
TAMRA at nucleotide position 7 from the 
5* end. 



PCR Systems 

All PCR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-uJ reactions that con- 
tained 10 mM Tris-HCl (pH 8.3), 50 him 
KC1, 200 jjlm dATP, 200 u.m dCTP, 200 \M 
dGTP, 400 \lm dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpHTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakallma-Hjima et al.) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. <B) Actin am- 
plification reactions contained 4 mM 
MgCI 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 nM each 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Sma\ site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl 2 , 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe. 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (I min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-pJ 
aliquot of a sample was transferred to an 
individual well of a white, 96-well micro- 
ti ter plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a 515-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



TABLE 1 Sequences of Oligonucleotides 



Name 

F119 
R119 
P2 
P2C 
PS 
P5C 
AFP 
ARP 
Al 
A1C 
A3 
A3C 



Type 



Sequence 



ACCCACAGGAACTGATCACCACTC 

ATGTCGCGTTCCGGCTG ACGTTCTG C 

TCGCATTACTGATCGTIGCCAACCAGS> 
GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCTATGACAAGGATp 
TTCATCCITGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 

CAGCGGAACCGCrCATTGCCAATGG 

ATGCCCICCCCCATGCCATCCIGCGlp 
AGACGCAGGATGGCATGGGGGAGGCCATAC 

CGCCCrGGACTTCGAGCAAGAGATp 
CCATCTCTTGCTCGAAGTCCAGGGCGAC 

For each oligonucleotide used in this study, the nucleic acid sequence is ^^Tro^^ 
y -f 3 ' direction. There are three type, of oligonucleotides: PC* pnmer, A««o^!cp« ^ £ 
fn the 5' nuclease assay, and complement used to h f"<^ 
probes the underlined base indicates a position where LAN with TAMRA attached was 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 



primer 
primer 
probe 

complement 
probe 

complement 
primer 
primer 
probe 

complement 
probe 

complement 



n. 
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A 1 -2 RAQGCCCTCCCCCATGCCATCCTGCGTp 

A1 -7 RATGCCCQCCCCCATGCCATCCTGCGTp 

A1 -1 4 RATGCCCTCCCCCAQGCCATCCTGCGTp 

A1-19 jRATGCCCTCCCCCATGCCAQCCTGCGTp 

A1-22 RATGCCCTCCCCCATGCCATCCQGOGTp 

A1-26 RatgccctcccccatgccatcctgcgQp 



Probe 


518 nm 


582 nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ temp. 








Al-2 


25.5 ±2J 


32.71 1.9 


38.2 ± 3,0 


38.2 ±2.0 


0.67+0.01 


0.8610.06 


0.1910-06 


A1-7 


53.5 ±4.3 


395.1 ±21.4 


108.5 1 6.3 


110,3 + 5.3 


0.4910.03 


3.5810.17 


3.0910.18 


A1-14 


127.Q±4.9 


403.5119.1 


109.715.3 


93.116.3 


1.1610.02 


4.3410.15 


3.1810.15 


A1-19 


187.5117.9 


422.717.7 


70.317.4 


73.012.8 


2.6710.05 


5.8010.15 


3.13 ±0.16 


A1-22 


224.6 + 9.4 


482.2 ± 43.6 


100.014.0 


96 2 1 9.6 


2.25 1 0.03 


5.0210.11 


2.7710.12 


A1-26 


160.2 ±8.9 


454.1 + 18.4 


93.1 ±5.4 


90.7 ±3.2 


1.7210.02 


5.0110.08 


3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.d. for six reactions run without added template (no temp.) 
and six reactions run with template ( + temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ~ and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction rube. This normalizes for well- 
to-well variations in probe concentra- 
tion and fluorescence measurement Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
(RCT), 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human 0-actin gene. 
Figure 2 shows the results of an experi- 
ment In which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance in the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
. second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleavedi It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ~ values indi- 
cate that probes Al-14, Al-19, Al-22, and 
A 1-26 probably have reduced quenching 
as compared with A 1-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 




RQ + 


ARQ 




A3-6 


54.6 ± 3.2 


84.8 ± 3.7 


116.2 ±6.4 


115.6 ±2.5 


0.47 


±0.02 


0.73 ± 0.03 


0.26 ± 


0.04 


A3-24 


72.1 ± 2.9 


236.5 ± 11.1 


84.2 ± 4.0 


90.2 ± 3.8 


0.86 


±0.02 


2.62 ± 0.05 


1.76 ± 


0.05 


P2-7 


82.8 ±4.4 


384.0 ± 34.1 


105.1 ± 6.4 


120.4 ± 10.2 


0.79 


±0.02 


3.19 ±0.16 


2.40 ± 


0.16 


P2-27 


113.4 ± 6.6 


555.4 ±14.1 


140 7 ± 8.5 


118.7 ± 4.8 


O.R1 


±0.01 


4.68 ±0.10 


3.88 ± 


0.10 


P5-10 


77.5 ± 6.5 


244.4 ± 15.9 


86.7 ± 4.3 


95.8 ± 6.7 


0.89 


±0.05 


2.55 ± 0.06 


1.66 ± 


0.08 


P5-28 


64.0 ± S.2 


333.6 ±12.1 


100.6 ± 6.1 


94.7 ± 6.3 


0.63 


± 0.02 


3,53 ±0.12 


2.89 ± 


0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 * effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 * concentra- 
tion/With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 him Mg 2 * is only slightly higher than 
RQ at 10 mM Mg 2+ . For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2 * are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 * followed by 
a gradual decline as the Mg 2f ^ concen- 
tration Increases to 10 niM. Probe Al-14 
shows an intermediate RQ value at 0 mM 
Mg 2+ with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2+ ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 2 "*" effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamlne dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nra 



582 nm 



RQ 



Probe 


ss 


ds 


ss 


ds 


ss 


ds 


Al-7 


27.75 


68.53 


61.08 


138.18 


0.45 


0.50 


Al-26 


43.31 


509.38 


53.50 


93.86 


0.81 


5.43 


A3-6 


16.75 


62.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.25 


0.45 


3.21 


P2-7 


35.02 


70.13 


54.63 


121.09 


0.64 


0.58 


T2-27 


39.89 


320.47 


65.10 


61.13 


0.61 


5.25 


P5-10 


27.34 


144.85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded. The fluorescence emissions at 518 or 582 nm for solutions containing a final 
concentraUon of SO nM indicated probe, 10 mM Tris-HCl (pH 8.3), 50 mM KC1, and 10 mM MgCl 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nw A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27, or 100 nM 
PSC for probes P5-10 and P5-28. Before the addition of MgCl 2 . 120 »xl of each sample was heated 
at 9S°C for 5 mhi. Following the addition of 80 jtl of 25 mM MgCl 2 , each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 run. If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the S' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ" include 
the particular reporter and quencher 
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FIGURE 3 Effect of Mg^concentrationon RQ ratio for the Ai series of probes. The fluorescence 
emission intensity at 518 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Tris-HCI <pH 8.3), 50 mM KC1, and varying amounts (0-10 mM) of MgCi 2 - The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCl 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 




dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T m , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. (1) 

The rise in RQ~ values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe A 1-1 9 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ~ values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3* TAMRA 
probe having a larger RQ~ than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ~ 
value. For the PS probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ~ value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ". 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe A 1-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and P5 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe A 1-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay. In this 
assay, an increase In the intensity of re- 
porter fluorescence is observed only 
when the probe Is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes oh the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (A 1-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the 5' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of Increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations In an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. <l> demonstrated that allele-specific 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AF508 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion In real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al.°°> describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR arnplifi- 
. cation. 
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Wc have developed a novel "real time" quantitative PCR method. The method measure* PCR product 
accumulation thraigh a duaHahded nubrownlc probe {Le„ TaqMan Prot*). This nutfhorl PrwU«^vwy 
accurate and reprodudble quantitation of f «nc copies. Unlike other quantitative PCR methods, real-time PCR 
does nor require poa-PCR sample handling, preventing potential PCR product carry-over contamination and 
resulting In much faster and higher throughput assays. Th* real-time PCR method has a very large dynamic 
rantre of starting target molecule determination (at fea>t five orders of magnitude). Real-time auantltarlvc 
PCR is extremely accurate and iess-labor-intensWe than current quantitative PCR methods* 



Quantitative nucleic acid sequence analysis Mas 
had an important role in many fields of biologi- 
cal research. Mcasuiciiicni of geue. expression 
(RNA) has been used extensively in monitoring 
biological responses to various stimuli Clan et al. 
1994; Huang el al. I995a,b; Prud'homme et al. 
1995), Quantitative gene analysis (DNA) has 
lH-cn toed tw determine the genome quantity of a 
particular gene, as iu the case, ot the human HER2 
gene, which Is amplified in -30% of breast tu- 
mors (Slarnon ut ah 1987). Gene and genome 
quantitation (DNA and UNA) also have been used 
for analysis of human tnummodeficiency virus 
(JlJV) burden demonstrating changes in the lev- 
ds of virus throughout the different phases of the 
disease (Connor et al. 1993; Pltttak ct al. jvv:tb; 
J'urtado et al. 199S). 

Many methods have been described for the 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern 1V/6; Sharp ct 
al. 1980; Thomas 1980). Recently, PCR has 
proven to be a powerful tool fOT quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase (KT)-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made ]>os- 
slble many experiments that could not have been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 
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thai It be um?U properly f«r quantitation <tt»uy- 
macKro 1995). Many early reports of quantita- 
tive PCR and RT-PCB described quantitation of 
the PCR product but did not measure the Initial 
target sequence quantity. Il is essentia] to design 
proper controls for Ihe quantitation of the initial 
target sequences (Pcrrc 1992; Clement! ei al. 
1003) 

Researchers have developed several methods 
of quantitative PCR and RT-PCR, One approach 
measures PCR product quantity in the log phase 
of the reunion before, the plateau (Kellogg et al. 
1990; Pang et al. 1990). This method requires 
that each sample has equal Input amounts of 
nucleic add and that each sample undev analysis 
amplifies with kJenlical efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained in ;iH samples at relatively constant quan- 
tities, such ax p-aclln) can.be us«d for sample 
wnipliTication efficiency normalization. Using 
conventional methods of PCR detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure 
that all samples are analyzed during the log phase 
of the reaction (for both the target gene and the 
normalization gene). Another method, quantita- 
tive competitive (QC) -PCR, has been developed 
and is used widely for PCR quantitation. QC-PCR 
n:lies on the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
Hatak cl al. 199J«,1>). Th<* efficiency of oach 
action is normalised to the internal competitor. 
a known amount of lnl«--maJ competitor can be 
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added to each snmpkvTo obtain relative n»'^"*- 
ration, the unknown target I'CR product is com- 
parcel with the known competitor KIK product. 
Success of a quantitative competitive I'CR assay 
relies on aevcloplng an Internal control dial am- 
plifies with the same efficiency «s the Uugct mol- 
ecule. TJlC design of the coiupctltox «ud the vali- 
dation of amplification efficiencies jcquirc n 
dedicated effort. However, because QC-l'CR dues 
not require (liiit PCRpioduCls be analysed during 
the loe, phase of Die simplification, it is tint easier 
<.if the two methods to use. 

Severjil detection Kystcius Hie usvd_for quan 
Utative PCR and RT-UCU analysis: (1) agarose 
&els, (2) imunrscenl label! 1 115 uf PC *R products and 
detection with -Ui.tiTr-iiKluced fluorc.ncencc using 
capillary e.lcctrnphoTcsi» (hwsco et al. 1995; Wll- 
Hams ci al. 1996) or acrylamklc gels, imJ (3) plate 
capture and sandwich probe liyl>ridl^ot»t*it (Mul- 
der et al. 1994). Although these method* jm»vwJ 
successful, each method requires posl-)*CR ma- 
uipulaTions That ;uJd Time to Hie analysis <uid 
may lead to hibouitoiy 1 on I « 111 i nation. The 
sain pie throughput of these method* t* limited 
(with the vxccpllon of the plate capture ap- 
proach), wnd, thi:rufore, these methods ore not 
well suited foj u»o demanding high sample 
Throughput (i.e., screening of targe number? of 

hlwiliOtc^ulc:* wi analysing :WMUplea fwi didgiictt- 
lie* or c-hnicol truiUs). 

Here we report the development of a novel 
assay for quantitative DNA analysis. The assay is 
based on the tisf. of the .5* nuclease assay first 
described l>y Holland et al. (1991). The method 
uses the 5' nuclease Activity of 7Vi</ polymerase to 
cleave a noncxtcndlblc hybridization probe dur- 
ing the extension phnsc of I'CR- The. approach 
uses dual-Iobcicd fluorogenic hybrid! /.tit ion 
probes (Lcc et id. 3 993; Mossier ct al. 1993; Uvok 
et «1, 199oo,b). One fluorescent dye serves its ft 
reporter |PAM (i.e., <^carboxyfluorc*vcin)| and its_ 
emission spectra is quenched by the second fluo- 
resc.cn t dye, TAMRA o<:arboxy-tetr«methyl- 
rhodarninc). The nuclease degradation of the hy- 
brklixatton probe releases the quenching of Ihe 
I'AM fluorescent emission, resulting jn an In- 
crease in peak fluorescent emission at Sjy nin. 
The use Of o sequence detector (AUi Prism) allows 
measurement of fluorescent spectra of ail yo welb 
of the 1 norma I Cycler continuously during the 
1*CR amplification. Therefore,. the rem lions aje 
liionltorvU in real lime, TJlC output data is de- 
scribed and quantitative analysis of input tuiget 
I )NA sequences is discussed below. 



RESULTS 



PCR Produce DerectlOn in Real Time 

The goal was to develop a high-throughput', sen- 
sitive, unci accurate gene quantitation assay for 
use In monitoring lipid mediated therapeutic 
gene delivery. A plasm Id encoding human factor 
VIII gene sequence, pl'8TM (see Methods). w;is 
used as a model therapeutic R«nc. The assay use* 
fluorescent Taqman methodology and an instru- 
ment cajwbk- of measuring fluorescence jn real 
time (A151 Prism 7700 Sequence Detector). The. 
T«qn»ao reaction requires 11 hybridization probe 
labeled witli two different fluorescent dyes. One 
dye Is a reporter dy« (I'AM), the other is x quench- 
ing dye (TAMRA). When the pruln: i.\ intact, fluo* 
icsccjiI energy transfer occurs and the reporter 
dye fluorescent emission is absorbed by the 
quenching dye (TAMRA). During Die extension 
phase of the PCK cycle, ih« fluorescent hybrid! 
1/^'jlion probe Is cleaved by the S'-T nuclcolytic 
octlvity of the DNA polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, re 
suiting l 1 ' " 11 increase of the reporter dyu fluores- 
ce" I emission upectra. PCR primers and probun 
were designed foi the human factor VI 1 J se- 
quence and human p-actln gene (as described in 
Methods). Optimization reactions were per- 
formed 10 choose the appropriate probe u"d 
magnesium concentration* yielding the hight^t 
Intensity of reporter fluorescent signal without 
sacrificing specificity. The Inst rumen I u;;es a 
charge -cot J pled device (i.e., CCD camera) for 
measuring the fluorescent emission apectm from 
, r jOO to fi$0 nm. ICach PC^R tube was monitored 
sequentially for 2ft tnsec with continuous moni- 
toring lhrc>Ughottt tlit: -amplificittioii. liach tube 
was rr.-exandned every 8.5 m.*c. Computer K>ft- 
wnre was designed to examiiir the fluorescent In- 
tensity of both the reporter <iye (PAW) . and 
the quenching dye (TAMRA). The 'fluorescent 
intensity of the quenching dye, TAMRA, changes 
very little over tlie course of the PCR amplify 
cation (data not shown). Therefore, the Intensity 
of TAMRA dye emission serves as an Internal 
standard with which to normnHy« the reporter 
dyt: (FAM) emission variations. 11 >e software cal- 
culates a value termed ARn (or A HQ) usln£ the. 
following equation: ARn - (Rn 4 ) (Rii"). where 
Kn -1 emission intensity «>f i'0|X^rter/emission in- 
tensity of quencher al any given time In ft reac 
tton tube, and Ru -emission intensility of re- 
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poncr/emlssion Jlilemily wf quencher measured 
prior to 1'CK iimplilication in ihnt same reaction 
tube. For the purpose of quantitation,, the 
three data points (ARi)S) collected during the ex* 
tension step for cadi K:k cycle were analysed. 
The nucleolytic degradation of the Hybridization 
probe occurs during the extension phase or i't:u» 
and, there-fore, reporter fluorescent cnuaMon In- 
creases during tills time, 'Hie tlnce data point* 
were averaged for each KJK cycle and the uivtui 
value for each was plotted in an "amplification 
plot" shown in Wtfure 1 A. The AKn mean value is 
plotted on the ^axis, and time, represented hy 
cycJc number, is plotted on the A-axix. Dtirlngthe 
early cycles «f (he T'CM amplification, tin- Attn 



value remains at base J J no When sufficlenl hy- 
bridl/atlon probe has been cleaved by the TtU] 
jxilymcnise miflftAftt activity, the intensity of re- 
porter Hucnxvutcnt emission inereutie^ Mo.nl PCU 
ainplifiv^tjnns reach a plateau phnNe of reporter 
fJuorc>cv.ul cmifision if the reaction is carried nut 
lo high cycle ijujiiIhmn. The amplification plot li* 
cxtmimetl vaily in th* reaction, ut a. point Ihat 
icprcsonte the log phase of product arnwtula* 
lion. 'This Is done by ussignlng an arbitrary 
threshold thai is based on the variability of the 
base-line dma. In Figure 1 A, the threshold whs set 
at It) standard deviations above the mean of 
base line emiayion iralculated from cycles 1 lo 1 fv 
Once the threshold is chosen, the point at which 
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Figure 1 PCR product detection in real time. {A) The Model 7700 software will construct amplification plots 
from the extension phase fluorescent emission data collected during the PCR amplification. The standard de- 
viation is determined Irom the data points collected from the base line of the amplification ploL values are 
calculated by determining the point at which the fluorescence exceeds a threshold limit (usually 10 times the 
Standard deviation of the base line). (S) Overlay ot amplification plots of serially (1:2) diluted human genomic 
DNA samples amplified with B-actin primers. (Q Input DNA concentration of the samples plotted versus C x . All 
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the amplification plot croMco the threshold ivdc 
fined as C r , C, is reported as I he cycle number ;ii 
thl* point. Ar will be demonstrated, lh« C, .value 
lit piediclive of the quantity of input target. 

Cj Values Provide a Quantitative Measurement, of 
Inpur Target Sequences 

Figure IB shows amplification plol* of l^oYffev. 
cut.PCR amplifications overlaid. *n^c amplify- 
Hons were performed on a 1:2 serial dilution itf 
human genomic JWA. The amplified target vwu 
human p actln. The amplification plotK Khifl to 
the right (to higher threshold cycles) ns the input 
(argot quantity n reduced, i ntf. is exported ho- 
eatlKU rmcttom with fewer starting rnpmx of the 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence. An arbitrary threshold of JO stan- 
dard deviations above the base line was used to 
determine the O r values. Figure 1C represents the 
O r values plotted versus the sample dilution 
value. Each dilution was amplified in trlpiicate 
Pc:r Amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The C T values decrease linearly with increas- 
ing target quantity. Thus, c r values can be used 
as a quantitative measurement of the input target 
number. It should bo noted that the amplifica- 
tion plol for the 15.6-ng sample shown In Mgurc 
IB does not reflect the same fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15,6-ng sample also achieves endpoint pla- 
teau at a lower fluorescent vaJuc than would he 
expected based on the input DNA. This phcuom* 
cnon has been observed, occasionally with other 
samples (data not shown) and may be attribut- 
able to late cycle inhibition; this hypothesis is 
sill] under investigation. It is important to note 
that the flattened slope and early plateau do not 
impact significantly the calculated O, value as 
demonstrated by the flt on iht» iim* .shown )n 
Figure, 1 C. All triplicate amplifications resulted in 
very similar O v values— the standard deviation 
did not exceed 0.5 for any dilution. This experi- 
ment contains a >1 00,0G0-fold range of Input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The linear range. oJ iluorcsccnl in- 
tensity measurement of ihc A1H Prism 7700 Sc- 



mwit« over n very large ranj»«« of relive Qt^»rf tnp, 
target quantities. 

Sample Preparation Validation 

Several parameters influence the efficiency nf 
PC.R amplification: magnesium and salt coriccu* 
initio"*, reaction conditions (he., time- and tem- 
perature), PCK target size and composition, 
primer sequences, and sample purity. All of the 
.above factors are common to a single VCR assay, 
except sample lo sample purity, in an effort to 
validate the method of sample preparation for 
the iaeior Viil assay, WJR amplification reproduo 
ibility and oificlency oi 10 repJicate sample 
prrjw rat ions were, examined. After genomic DNA 
was prepared from the 10 replicate samples, ihc 
DNA wasquamiiatcd by ultraviolet spectroscopy. 
Amplifications were performed analyzing p-aciln 
gene, content in 100 and 26 n>; of total genomic 
UNA. Each I'CK amplification was performed in 
triplicate. Comparison of G,* values for each trip', 
ltcate sample show minimal variation based on 
standard deviation and coefficient of variance 
(Tabic 1). :i*hercfore, each ol the triplicate PCU 
amplifications was highly reproducible, demon- 
Stratlxig that real time PCK* using this insfn/mcn- 
trition introduces minimal variation into the. 
quantitative. J'CK analysis. Comparison of tlie 
mean C n values of the 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for ft-nctln gene quantity. The highest (V 
difference between any of the samples was 0.S5 
and 0.73 for the IfX) and 25 ng samples, respec- 
tively. Additionally, the amplification- of each 
sample, exhibited an equivalent rate of Huoro 
cent emission intensity change per amount of 
DNA target analyzed as indicoicd by similar 
slopes derived from the sample dilutions (Pig. 2). 
Any sample containing an excess of a I'Ck inhibi- 
tor would exhibit a greater jneasurcd 0-actln O r 
value for a given quantity of DNA. In addition, 
the inhibitor would be diluted along with the 
sample in the dilution analy.si.% (hi#. 2), altering 
the expected C r value change. Each sample am* 
pJJfication yielded a similar result in the analysts, 
demonstrating that tins mathod of sample prepa- 
ration is highly reproducible with regard lo 
sample purity. 

Quantitative Analysis of a Plasmid After 
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Table 1. lloproducthflUty of S«mpl« Preparation Method 



1 
2 
3 
4 



8 
9 
10 

Mean 



100 ng 



Samplo 

no. C T 



standard 
m£i*n deviation 



CV 



18,24 

18.23 

10.33 

1833 

18.35 

18.44 

18.3 

18.3 

18.42 

18.15 

18.23 

18.32 

18.4 

18.38 

18.46 

18.54 

18.67 

18.28 

18.36 

18.52 

18.45 

18.7 

18.73 

18.18 

18.34 

18.26 

18.42 

18.57 

18.66 

0 10) 



1«.27 0.06 



18A7 0.06 



18.34 0.07 



18.23 0.O& 



1U.42 0.04 



18.74 0.21 



18.39 



18.55 
18.-12 



0.12 



18.63 0.16 



18.29 0.1 



0.12 
0.17 



0.32 

0.V 

0.36 

0.46 

0.23 

1.26 

0.66 

0.83 

0,55 

0.65 
0,90 



20.48 

20.55 

20,5 

20.61 

20.59 

P0.41 

20.54 

20.6 

20,49 

20.48 

20,44 

20.38 

20.68 

20.87 

20,63 

21,09 

21.04 

21,04 

20.67 

20,73 

20.65 

20.98 

20.84 

20.75 

20.46 

20.54 

20.48 

20.79 

20.78 

20.62 



25 ng 



standard 
mean deviation 



20,51 0.03 



20.54 



20.43 



20.73 
20.66 



0,11 
0.06 
0.05 



20.73 0.13 



21.06 0.03 



20.68 0.04 



20.86 0.12 



20.51 0,07 



0.1 
0.19 



cv 

0.17 

0.54 

0,28 

0.26 

0.61 

0.15 

0.2 

0.57 

0.46 
0.94 



(or containing a partial cONA for human factor 
VI H, p) ; 8TM. A scries of transections was ant 
up using a decreasing amount of the plasmid v (40, 
A, 0.5, and o.l ng). IVv^niy-four hours poM- 
tn*nsf<:t*tion, total IWA wtts purified from each, 
flask of-irlh, p-Acliu gene quantity was vliuxi i a* 
a value for no* mali^t twn of genomic DN A con- 
centration from ctteli sample. In this cxpeiimcnt, 
|5-actin gene content should remain constant 
relative to total genomic DNA. Figure 3 .show:* the 
result of the p-actin DNA measurement (100 tig 
total DNA determined by ultraviolet spectros- 
copy) Ot each scimple. Kach &&rvipte was analyzed 
in triplicate and the mean p-actin values of 
the triplicates were plotted (error bars represent 

^.^rt «'+**<"»i<tfrt fipviai^oni I h#» nli»ttf*sr rtif former 



bctw<tan any iwn sample* moans was 0.95 C,.. Ten 
nanograms of total UNA of yacli sample were also 
examined for fl-actln. The results ognJu .sl«>wd 
that very similar amounts of genomic DNA were 
present; tin- maximum mean p actio C: t value 
difference wa.s 1 .0. As Figure 3 shows, the rate of 
p-actin C r change Ixriwccn the 100 and 10-ng 
sample* was similar (slope values rang« b«twoon 
3.56 and - 3.45). Tht.% verifies again thai "the 
method of .sample preparation yields samples of 
identical PCR integrity (i,0-, no sample contained 
an excessive amoujil of a PCR inhibitor). How- 
ever, these results indicate that each sample con 
talned slight diffeiencex in the actual amount of 
genomic 1>NA analyrxd. Determination of actual 
tfcuouiiv DNA concentration wos nccompllshed 
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Figure 2 Simple preparation purity. 1 he replicate 
samples shown In Table 1 wore also amplified In 
tripicate using 25 ng of each DNA sample. The fig* 
uifc shows die input DNA concentration (TOO And 
25 ng) vs. C, In irw* figure, ih«* 100 ;ind ng 
points for each sample are connected by a line. 



by p] oiling the mean p-actin O, value obtained 
for eat:! i 100 ng sample wu a p-actin standard 
i.-uive (shown In J'$h- 40). T* 1C «eUi»l genomic 
1>NA concent r;i<l««i of each sirmpU:, a, was ob 
talncd t>y extrapolation to thu x<u*U, 

Figure 4 A shows the measured (I.e., i\m\* 
normalised) qua u till e.N ut factor VI) J ploMinul 
DNA (pJtn'M) from each of tlic four transient cell 
InnNsfeelionii. Each reaction contained J GO tig of 
total sample DNA (as determined by UV spectros- 
copy)- llach sample was analyzed in triplicate 



|>C ;r< a7npHficatioi)!(. As shown, pl : 8TM purified 
,fioic J be 20H cells decreases (mean C, values in- 
cuui.«£j with decreasing amounts uf plasmld 
Urumh'UCCL The' inc»n C L values obtained for 
p*WM in 'Figure 4A were plotted uu a sUmdyrd 
eurvc comprised uf seiially dihilcd pFHTM, 
shown .in figure 4R. The quantity uJ pFKTM, b, 
found in each of the four iranfifectlonK was de- 
termined by extrapolation to the or vxk of the 
standard curve In Figure 4H. Thttsc uncorrected 
m values, h, for pKWM were nonnallx^d to deter- 

mine the actual amount of pF81M found per 100 
n« « r genomic DNA by using ihc equ&lion:. 

h X 10 0 o« actual plWM copies ner 
^ r 100 ng of genomic DNA 

where a - actual genomic DNA in u sample and 
fcwpPHTM copies from the standard curve. The 
normalised quantity of pl'STM per 100 ng of ge- 
nomic DNA for each o/ the four irana'fccilons Is 
snown in Figure 4JJ. 'H ie.se re.Null* Miow thai the 
quantity of factor Vlll plasiuld assuvJuted wiih 
the 293 cells, 24 lir wftvr truiusfwlMin, dei.ie.isev 
with decreasing pJttMuul eoiieeniiatJon u.sed In 
the transection, Tlie quuntlty of pFtiTM associ- 
ated with 293 ceils, after trunsfealon with 40 ng 
of pi;ismid t was 35 pg per 1 00 ng genomic DNA. 
Tills results in -520 plasinid copies per cell, 



9R 



25 



h 94. 



23 




V » 27.73 * Wr H- 1 



1.4 1-3 1-8 
1*9 (ng **put DNA) 

Hgure 3 Analybb of ticinslectcd cell DNA quantity 
and purity- I he DNA preparations of Hie lour 293 
cell transfections (40, 4, 0.5, and 0.1 jxg of pF8TM) 
were analysed for the 0-actln gene. 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM that was transfected, the f*-aciln 
C T values are plotted versus the total Input DNA 



DISCUSSION 

We have described a new method for qua n turn- 
ing- gene copy numbers using /edMlrnc anulysls 
of PCJR amplifications. ReaMimc FCH in compat- 
ible with cither of the two PCK.(KT-PCR) ap- 

proaclie>: (1) quantitative competitive where An 
inteuiul cuni|X£tiLor for each target sequence b 
used for nornialixotJon (data not shown) or (2) 
quautiiaUve comparative PCH using ti imiiintliM- 
tion gene coniained withbi the sample (i.e., (3-nc- 
tin) or a "housekeeping" gene for RT-PC-K- ft" 
equal amounts of nucleic add are analyzed for 
each sample and if the amplification cf/kiency 
before quantitative analyst i> identical for each 
sample, ihe iTUernai conlnii (nojmnli^iioH gene 
or competitor) Should give equal Mgnah for flIJ 
samples. 

The real-lime PCH method <;ffers several ad- 
vantages over the other two methods <:urrcntly 
employed (see the Introduction). First, the real- 
time PCR method Is performed in a doscd-tube 
system and requires no post-PCR ntanipulatlon 
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Figu™ 4 Quantitative Analyst* of pF8TM in tiWccicd cells. <^ Amount of 
plasmtd DNA used for the transaction plotted against the invun C, value deter- 
T^fi^j^™ remaining M hr alter transection. (0,C) Standard curvr* of 
P<*H1M ana 1 P-actln, respectively. p r 0TM DNA (0) and genomic PNA (Q were 
dllutftd itftrlally 1;S before amplification with the appropriate primers. The p-oclin 
standard curve wav usod lo normalise the results of A to 1 00 rig of oenornic DNA. 
(0) The amount of pf8TM present p<:r 100 ng of genomic DMA. 



of sa.mplv. Therefore,. tin- potcmiti) for TCK con- 
tamination in the laboratory is reduced because 
amplified product*: can l w ; 4 ijHly/.ed and disposed 
of without opening the ruaetion tubes. Second, 
this method siippoiU the u»« of •* iit/rni;i1i* ( itk>H 
gene (i.e., P-actin) for quantitative PCR or house- 
keeping genes for quantitative RT-I'CU controls. 
Analysis Is performed in real time during the Jog 
phase of product accumulation. Analysis during 
k*K phase permits many different genes (over a 
wide input target range) to be analyzed simulta- 
neously, without concern of reaching reaction 
plateau at different cycles. Tliis will moke jjiuUI- 
analysis assays much CAMei to develop, be- 
cnuac individual internal i.ompetUois will not be 
needed for each gene under analysts. Third, 
sample throughput will hu.it:a>c diamaticalty 
with the new method because, theve is no post- 
PCR processing time. Additionally, wniking In a 
^6-wel) format In highly compatible with auto- 
mation technology. 

The real-time 1>CR method is highly repro. 
ducible. Replicate amplifications can be analysed 



for «ach sample minimizing jnttcnuai emir. The. 
sysntm allow* for a very Jar^e assay dynamlj: 
ruiigi- <uppro;u:bm$ 1,000,000- fold starting tai- 
gel). UmIn K a .standard curve for the target ol in- 
terest, relative copy number values can be deter- 
mined for any unknown sample, Fluorescent 
threshold values, C, v cojiehite linearly with rela- 
tive DNA copy numt>ers. UeaJ time quantitative 
KT-PCK methodology (Cilbwii et al., this Issues) 
has also been developed, finally, real lime quan- 
titative PCU methodology can be used to develop 
high-throughput &crcching assay* for n variety of 
applications [quantitative gene CApjca&ioii (KT- 
rCR)i ficne copy nnxayz <Mcr2, 131V, CtO), gcm> 
1 yP* n B (knockout mouse, analysis), and J mm urn r 

pouj. 

Real-time PGR may &\m t«t jTcrformed using 
intercalating dyes (Htguchi et al. Mich as 

cfJUdium bromide. The fluorogenic probe, 
method offers a major advantage over inter- 
calating dyes- greater specificity (i.e./ primer 
dimers and nonsperJfle PCR product.s are not de- 
tuned). 
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METHODS 

Generation of <t ttasmld Cunlalnlng a Partial 
cDNA for Human Factor VIII 

Total KNA w<u harveMcd (UNAko* » from Tel Tc$t, Inc., 
*Tje:nCl3WOOd, TX) from cell*. li*n*ic<ied with a factor Vlll 
expression vector. pCIS2.tk4&0 (Katun vl ftl. 1V86; Cor. 
mon r.t ah 1990). A factor VIII partial chNA wpieiitv W«S 
^•,„.rnud uy irr PC :it jcioneAmp I'-Z tTth UNA Ur*U 
(part NW/K-ov/y, l'h Applet) IsiosystcmiC I'ostei City, CA)J 
uslnf, ihe VCllt pritm-rs »i»*l Fflrev (primer sequences 

arc shown below), tin* ampllcon was feampliOi-il aMnfc 
modified I'flfor and Wrcv primers (qipeiulrd with H«m\\\ 
and HhtdWl restriction site sequences at thv vend* and 
clonal Into jXiKM- 3Z (lVonH^a Corp., MikIlhoii, Wl). The 
resulting d«nr f pPftlTvt, Was used lur transient traosfocUon 
oJ "2W ccJU. 



Amplification of Target DN A ant! Dvlcctlon of 
Amplicpn Factor VIII Masmid DNA 

(pr'HTM) was (tinpllfUTil wllh die ptimei* PBfor 5 '-C<X 

<rr(K;(;AACiAu:j'CiAt:tiicnc-3* and rarev .s*-AAA<:crr- 

t^OC;CrOGATC;Ci l AtiO-^MlM; rvwctUm piodnevd « '122- 
np k:k product. The forward primer was debited to icv 
ngitlxe u unique MMpiemc ft mi id In the 5' untranslated 
region of Hit* pa i tf ill uC152.b\Z$l> pi ^ mi ml <iml llmicforc 
does not rvoiffpiUu itnd amplify llie human factor VII) 
gem*. I'rimnrit w«ro chojion with the jtvwrtimrf'of I he com- 
puter program Oli^o 1,0 (N<jitim;il ItUiAeicoccs, Inc. Ply. 
month, MN). The human p-aetl« gpne whs amplified with 

the urixucr> fJ-m-tSn forward primer A' ■TCACOOAC.lAt 7I"< IT 
GCCCAT^rACX'iA-.V and P-actiu reverse p#imcr S'.CAf;. 

CGCAACCX;crj*<;A nxx:c^AJ'GG-3'. 'fhc' reaction pro- 

OUCCCI 3 ?,V5 np I'OK product. 

Amplification reactions (50 pj) contained a DNA 
sample. K)x VCM Huffur II (a p.)) t 200 jtM dATl', dCTl\ 
dGTP, And 400 p,M dt)IT # 4 m\i MgCI,, l.ZS Unlls AmpH 
7W<; DNA polymerase, 0.5 unit AmpKrosc uracil N-Rty- 

coayluMc (UNO), £0 pin oil.' of each facKot Vlll jtrliiwi/ und IS 
inmiAv of uuoh p >ct In pdnifir. 71m ioavtlo»v* h1m> 4H>nlaltic<J 
one of the following dtirrtimt proiws (W«J hm mrU): 
i't»j»n.i><- A'(KAM>Ac:c v *r< , ri , <:cAc:trr<;frn , (ri-rr<:rcri*- 

GCCTT(TAMRA)p 3' <mtf p-ni-tui probe 5 f (FAM)ATGf.X:c:- 
X(J*AMKA)CCCCr;ATGGCATCp-3' where p indicates 
phoiphoryl/iiiAii nrvd X indicates a linker arm nucleotide. 
Reaction tuln;? w«tx* Mi«:njAft\p Optit^nl 1*ur>es (part ntl/TV 
bc-rNKOI OO.t.H, rcrkJn lUoivr) wufe f rocto<l (v*t IVrklvi 
Elmer) (o pre wnl Uylit from /cflcctlnp. Tube cap* were 
Simitar in Micro Amp t;npa I>t4l Apcciftlly dcsiftilCCt to pre- 
vent light scattering. All <»l \\t<* Vt'M ^m/twiu(»l>U-* were 5U>w 
pliv:d by PK Applied UjOtfyKtem^ (|!<»»ter CMy, CA) CKCcpl 
the* factor Vlll primers, wlileb weie %y niheslzrd ut Cenen 
tech, Inc. (South Francisco, CA). Prober w«t** dc-sl^ncnl . 
u.Ung the Oligo 4.<> software, folloWlMK giitilellneif 

j»csieo in tnc Model 7 700 .sequence iH*tan>r iii^tiunirnt 
manual. Hr lefty, prube T m s)ttm\d hr a I Jcaal 5 U C hlftlier 
mail XUv aane«nnx iei(i|^*Mlnrc u.>td during the mini cy- 
rlmg/ primers sho\ihl jjoi fuim M*iblv duplexed with the 
probe. 

The tlieruj**! i'ycllng euitdiUo»vs Inetuded 2 niln ftl 
SO'Caild 10 ruin «il 95 ,, C. 'Iliejmal cycling procrrdrd with 

mm 



reaction* were performed if* lh< ( Model 7700Scqitcn«r IV- 
l<Htor (PE ApplU'd Ulusyvtviuv), which conuiin ».f.1cne- 
Amp l*<:U SyMvm POOO. kc:a<:Oon conditions w*rr- pro. 
Rruilliltexl mii .< I'iiww Macaut«»b 7100 (Apple C>.in*pntr*r, 

Santa Clara, CA) linked dirvuly to the Model V7fw> 
ijucnw iXiloclor* Anwly* u i,r daU w * v nlKC> performed on 
the Mcintosh computer C'.nllnctUm and amdynU enftwaro 
wju devvlo|wl Ht PK Applied Blosy^tuins. 

Traiufectton of Cells with Factor Vlll Conrfrucl 

Pour T17S flaski of 293 cells (ATCC ClKl. 1573), n human, 
feiol Ultbicy sutipetifcion cell line, were n r " w " lo IWMi.con- 
llueney and tranjfceted pt : fri*M. Cells were Krowu In the 
following medio! 5()% HAM^ 1^12 wllhoul GMT, 50% IniM 
glucose JHdlKtXN^ modified B«le medium (UMUM) wiU). 
out glyrint; wiUi sodium bicadxmate, 1(1% Iclal btmnc 
seruui, 2 him L-Klutantmv <>nr1 1% pcnkilliii-ylrrpiomy- 

VlO, 'I'llC mcdlfl WW dld"gcd 30 mln Mo«* the tronsfee 

lion. plWM DMA amountt* of 40, 4, 0..S, nnd 0.1 m were 
«iUitr;<l to 1..S ml of ft solution containing 0.125 m CiiCl^- 
Jtnd t X MKPUS. The four mixtures were left at rvom teiti- 
pe.mOirf f<« TO mln and then acUlett rlmpwliw U> the cells. 
The fl»»k> wv»* ;iMi.iil^tod at 37°C and ClX for 24 hr, 
washed with PltS, and «ui M :ipcndcd in PUS. The re.tiiH 
)n'iuK:d cdh were divided into aliquot* ur»d UMA wa4 ctf- 
tr««U'd lmincdlutclyu»iii« IheQIAamp KUhkI Kit (0Uc,en. 
aitttwortli, <.^). l>NA w<is eJuted Into 2(K) ol 30 mU 
1VM1CJ ut pll 8.0. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISPS, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISPS, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (/) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and (ii) Wnt-1 transgenic 
mice. The WISP-I gene was localized to human chromosome 
8q24.1-8q24J. WISP-I genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to >40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISPS mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 

Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnl-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsn) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-30) resulting in an increase in 
/3-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (A PC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-j3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn) y and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and retractile ceils that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP- J 
and WISPS, and a third related gene, WISPS. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov y a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778. AF100779, AF100780, and AF100781). 
*To whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 u.g of P ol y( A )* RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /xg 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WlSP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
. coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 jiM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW.403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation fluorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<A«) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
8-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The WAS/'-specific signal was 
normalized to thai of the glyceraIdehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on 0-catenin levels (13, 14). Expression oiWlSP-1 was 
up-regulated approximately 3-fold in the C57MG/ Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-I gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-I were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of *=40,000 (A/ r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 2A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of **27,000 (M r 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 and WISP-2 arc induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MC/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 pg) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP-1- specific probe 
(amino acids 278-300) or a 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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FiC. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (1GF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1 . 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and" 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISPS are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Eiml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule, that promotes ceil adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overex pressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. AH are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (1GF)- 
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Fic. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerizatiori and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown).. 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 



14720 Cell Biology, Medical Sciences: Pennica et al. 



Proc. Natl. Acad. Scu USA 95 (1998), 



analysis on adult and fetal multiple tissue cDNA panels. 
WISP- 1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, cojon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP- 1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 £-//). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A, C, £, and G) Representative hematoxylin/ eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
O. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas lumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (21, 29). 

Amplification and Aberrant Expression ofWISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression, 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus.. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig, 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr ceil lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISP-3 was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 




Genomic DNA 




Fig. 5. Amplification of WISP-I genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jtg) 
digested with £coRI (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901^2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-foId) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 1. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient- 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative. PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means i SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to > 40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of. 
a letracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, . 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., 0-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-1 -transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through )3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v p3 serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-pl, which is the stimulus for 
stromal proliferation (34), TGF-j31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the. Writ- 1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply. WISP-1 and 
W1SP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP^2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 ampHcon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and 0-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 
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methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPMI growth medium. T-cell-proliferation assays were 
done essentially as described 20,21 . Briefly, after antigen pulsing (30u,gmr' 
TTCF) with tetrapeptides (l-2mgmr'). PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glu tar aldehyde. Glycine was added 
to a final concentration. of 0.1M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T- cell clones in 
round-bottom 96-welI microtitre plates. After 48 h, the cultures were pulsed 
with 1 u,Ci of 3 H -thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 u.g TTCF with 0.25 u.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C. 
Glycopeptide digestions. The peptides HIDNEEDI, HIDN(AT-glucosamine) 
EEDI and HIDNESDI, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic ( Promega) digestion 
of 5mg reduced, carboxy-methylated human transferrin followed by 
concanavalin A chromatography". Olycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N- terminal sequencing. The lyophilized transferrin - 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
.5-50 mUmF 1 pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgrnl" 1 a- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target ceils through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG)\ 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy-terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and. lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 67 , Apo3 L/TWE AK HV , or OPGL/TRANCE/ 
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RANKL 10 "' 2 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c ), as did the Fc-tagged ectodomain of Fas but not 
TNFRL Gel- filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
l.l±0.InM, respectively; Fig. 2e),. and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig: 3a). DcR3- 
Fc and Fas-Fc blocked soluble- FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 fjLgmF 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process'. Consistent with previous results 13 , activation 
of interleuicin-2-stimulated CD4-positive T .cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amtno-acid sequences of DcR3 and of osteoprotegerin {OPG); the C-ierminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the A/-linked glycosylation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL, peripheral blood 
lymphocyte. 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer ceUs and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes M4 ""\ Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at —lu-gml" 1 ; the residua] 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequendy contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 



t50 



c 0 
§150 
o 



Vector 




FasL 



0 10* 



bAfr(K ) Vector 



102 103 104 
PE 

FasL 



30 
22 



I 



c/Wr(K) 



•«— n to i- ro oo 

or or ™ or o: « 

2 O Z Q 



30- 



t- CO 

or or 



,? -4 




10- 1 10 1 
sFasL (nM) 



Figure 2 Interaction of DcR3 with FasL a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (soitd line, shaded area), TNFRl-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference (P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE, phycoerythrtn- 
labelled cells, b. 293 cells were transfected as in a and metabolically labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1. DcR3 or Fas. 
c. Purified soluble FasL (sFasL) was immunoprecipitated with TNFRl-Fc, 0cR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d. Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag ; e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR) 18 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18- fold (Fig. 4a f b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3-based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 
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Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Rag-tagged soluble FasL (sFasL;. 5ng ml -1 ) oligomerized 
with anti-Flag antibody (0.1 iigmP 1 ) in the presence of the proposed inhibitors 
DcR3-Fc f Fas-Fc or human IgG! and assayed for apoptosis (mean * s.e.m. of 
triplicates), b. Jurkat cells were incubated with sFasL-Flag plus anti-Flag antibody 
as in a. in presence of 1 u,g rnt -1 DcR3-Fc (filled circles). Fas-Fc (open circles) or 
human IgGi (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interteukin-2, 
followed by control (white bars) or anti-CD3 antibody (rifled bars), together with 
phosphate-buffered saline (PBS), human IgG 1. Fas-Fc. or DcR3-Fc (l0u.g ml' 1 ). 
After 16 h. apoptosis of CQ4* cells was determined (mean ± s.e.m. of results from 
five donors), d. Peripheral blood natural killer cells were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (filled circles). Fas-Fc (open 
circles) or human IgGi (triangles), and target-cell death was determined by 
release of 5l Cr (mean i s.d. for two donors, each in triplicate). 
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we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM2 1 8xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3 v and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3-linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 ' 9 . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface' 7 . DcR3 competes with Fas for 
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Figure 4 Genomic amplification of DcR3 in tumours, a. Lung cancers, comprising 
eight adenocarcinomas (c. d. f, g. h, j. k. r). seven squamous-cell carcinomas (a. e. 
m, n. o, p, q). one non-small-cell carcinoma (b), one small-cell carcinoma (i). and 
one bronchial adenocarcinoma (I). The data are means ±r s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate, c. in situ hybridization 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-held image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d. Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward. Rev and Fwd). the 
DcR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's r-test 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin- 1 has been 
described 2 '. In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL- related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR- family member is 
OPG\ which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. □ 



Methods 

Isolation of DcR3 cDNA. Several overlapping ESTs in Gen Bank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the codirig'regibhs of these clones were identical in size to that of 
the initial clone (data not shown). ; 
Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hingearid Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate orEffectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u,g), together with pRK5 encoding CrmA 
(2u,g) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fc orTNFRl-Fc and then with phycoerythrin-conjugated 
streptavidiri (GibcoBRL). and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown),, it is possible that DcR3 recognized some other factor that is 
expressed constitutiyely on 293 cells. 

Immunoprecipitatron. Human 293 cells were transfected as above, and 
meUbolicaUy labelled with [ J5 Slcysteine and [ 55 S] methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-fmk (10u.M), 
the medium was immunoprecipitated with DcR3-Fc ? Fas-Fc or TNFRl-Fc 
(5u,g), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 (xg) (Alexis) was incubated 
with each Fc-fusion protein (1 ixg), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by unmunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysts of complex formation. Flag-tagged soluble FasL (25p.g) was 
incubated with buffer or with DcR3-Fc (40 jig) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 >xl aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell A1CO. CD3* lymphocytes were isolated from peripheral blood of 
individual, donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u,g ml" 1 ) for 24 h, and cultured 
in the presence of interleukin-2 ( 100 U ml"') for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin-V-binding of CD4* cells". 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 5, Cr-loaded Jurkat cells at an effector- 
to- target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target-cell death was determined by release of ?*Cr in effector- target co- 
cultures relative to release of 51 Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by |. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR" 
using a TaqMan instrument (ABI ). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to -500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5 ' - ( FAM -ACACGATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 <ACT) , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous E. coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this: family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter assodated: with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisPt the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and. of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains'. In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of S. typhimurium and E, co/i 1 * 3 "* is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
. presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity*, and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded P- 
sheet (p3 and p8-0 12) spans both arms of the L, with a domain of a 
a- plus p-type structure (pi, p2, p4-p7, al and a2) on one side 
(within arm I) and a domain of mostly a-helices (a3-a9) on the 




Figure 1 Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The p-strands at the dimer interface are labelled, c. View of one monomer from 
the bottom of arm I, as shown in a. towards arm II. showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 2 *. N. amino terminus; C. C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. J. 
Cancer 78:661 -666, 1998. 
© 1998 Wiley-Liss, Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et al., 1 994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ ( 1 1 q 1 3), and erbB2 ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the. myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Berns et ai, 1992; 
Schuuring et al., 1992; 9tamon et al. 1987). Muss et al. (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al. (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The methodjs relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard cup«?e- is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et ai. 1996; Heid et 
ai. 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et ai (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et ai. 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al, 1994): ft) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (it) the C, (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why Q is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C, 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 nr. 

Here, we applied this semi -automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and er£B2), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular- weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-rime PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C, (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C, and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3. in which no. genetic alterations have been found in 
breast-rumor DNA by means of CGH (Kallioniemi et al., 1 994). 

Thus, the ratio of the copy number of the target gene io the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N", and is determined as follows: 

copy number of target gene (app, myc. ccndl, erbS2) 

N — ■ . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standardrcurve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (10 5 copies of each gene) to 
10~ 10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 ng). 

PCR. amplification. Amplification mixes (50 pi) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATR dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCI 2 , 1 .25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 1 0 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-weIl microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates Q and determines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbB2 proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (21q21.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/uh It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 3 copies or as 
many as I0 5 copies. 

Copy-number ratio of the 2 reference genes (app and alty 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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signal established ir. the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. L, (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here Bottom- 
Standard curve plotting log starting copy number vj. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. . J 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app. 2iq21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et ai, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean. 1.02 ± 0.21), and was similar for the 108 primary- breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and crbB2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 1 8 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1 .3 (mean 0.9 1 ± 0. 1 9) for erbB2. Since N values 
for myc, ccndl and crbBl in normal leukocyte DNA consistently 
fell between 0.5 and 1 .6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erb52 gene dose in breast-tumor DNA 

myc, ccndl and erbB2 gene copy numbers in the 108 primary 
breast tumors are reported in Table 1. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbB2 (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for er£B2, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (T118). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbB2 and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbB2 in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbB2 copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erbB2). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbB2 amplifications 
had previously been done on the same 108 primary breast tumors. A 
perfect correlation between the results of real -time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N ^ 5). However, there were cases (1 myc, 6 ccndl and 4 erbBT) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2,9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE 1 - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND erAB2 GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification 1 


level (N) 




<0.5 


0.5-1.9 


2-4.9 


>5 


myc 

ccndl 

erbBl 


0 

0 

5 (4.6%) 


97 (89.8%) 
83 (76.9%) 
87 (80.6%) 


11 (10.2%) 
17(15.7%) 

8 (7.4%) 


0 

8(7.4%) 
8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage rumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1 994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et ai, 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 rumors per day. Third, the 
system has a linear dynamicrange of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C, to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C, value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
Q ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disa vantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistbehem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996; Slamon et ai, 1989). However; FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature. (0 Chromosome regions 4q 1 1 -q 1 3 and 21q21.2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (ii) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Berns et ai, 1992; Borg et ai, 1992). (///') 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbB2 amplifica- 
tion, confirming the findings of Borg et ai ( 1 992) and Courjai et ai 
(1997). (iv) The maxima of ccndl and erbB2 over-representation 
were 18-fold and 15-fold, also in keeping with earlier results (about 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 1 8 (El 2, C6, black squares), Tl 33 (Gil. B4, red squares) 
and T 1 45 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table H. 



30-fold maximum) (Berns et aL, 1992; Borg et aL. 1992; Courjal et 
aL, 1997). (v) The erbBl copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et aL, 1995; Deng et aL, 1 996; Valeron 



et aL, 1 996). Our results also correlate well with those recently 
published by Gelmini et aL ( 1 997), who used the TaqMan system to 
measure erbBl amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystems) which only allows end- 
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TABLE 11 - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb . 




Kccndl/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


T118 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








. T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 . 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




121722 






7933 









'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Hccndl/alb) is determined by dividing the average ccndl 
copy namber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(> 5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erb&2) observed by means of real -time 
quantitative PCR as compared with Southern-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamone/tf/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbB2 (but not of the other 2 proto- 
oncogenes) in several tumors; erbB2 is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1 995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is riot sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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8. Ashkenazi, A .; and Chamow, S. M. Immunoadhesins: an alternative to human 
monoclonal antibodies. Immunomethods: A companion to Methods in 
EnzimologyS y 104-115 (1995). 
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